Indietro

#bioinformatics

30 APIs con questa etichetta

Genome Assemblies API

Reference genome assemblies as an API — powered by NCBI Assembly, the registry of genome builds for organisms across the tree of life. Search assemblies by organism (or free text) and look up any assembly's metadata: its accession (GCF_… RefSeq or GCA_… GenBank), name (e.g. GRCh38.p14), organism and taxon id, assembly level (complete genome, chromosome, scaffold or contig), contiguity statistics (contig and scaffold N50), sequencing coverage, RefSeq category, UCSC and Ensembl names, the submitting organization, release date and FTP download paths. From the human reference genome to any sequenced microbe, plant or animal, it turns the genome-assembly registry into a clean search-and-fetch API. A genome-assembly registry — distinct from sequence (ENA), genome annotation (Ensembl), variant (ClinVar, dbVar) and gene-expression (GEO) databases. Open data from NCBI Assembly (public domain).

api.oanor.com/genomes-api

Gene Expression API

Functional-genomics experiments as an API — powered by NCBI GEO (Gene Expression Omnibus), the largest public repository of gene-expression data. GEO archives expression series and curated datasets from microarray and high-throughput-sequencing experiments across every organism. Search experiments by keyword and optionally by organism, and look up any series or dataset to get its metadata: title, summary, assay type (expression profiling by array or by sequencing), organism, number of samples, platform and the publication behind it. From β-cell stress studies to cancer transcriptomics across human and mouse, it turns the GEO archive into a simple search-and-fetch API for transcriptomics, bioinformatics and research-data discovery. A gene-expression / functional-genomics dataset repository — distinct from sequence (ENA), variant (ClinVar, dbVar), structure (PDB) and ontology databases. Open data from NCBI GEO (public domain).

api.oanor.com/geodatasets-api

Structural Variants API

Human genomic structural variation as an API — powered by NCBI dbVar, the archive of structural variants (SVs): copy-number variants (CNVs), large deletions, duplications, insertions, inversions and translocations, typically larger than 50 base pairs. This is the structural counterpart to single-nucleotide variant databases: search structural variants overlapping a gene (or by free text) and get each variant's dbVar accession, the study it came from, its type, the genes it overlaps, its genomic placement on GRCh38 and its clinical significance; then look up any variant for the full record — placements on both GRCh37 and GRCh38 assemblies, variant type, genes, clinical significance, study type, methods and variant counts. From BRCA1 CNVs to Cri-du-chat deletions, it is ideal for genomics, cytogenetics, rare-disease and bioinformatics work. A structural-variation / CNV resource — distinct from clinical single-nucleotide variant interpretation (ClinVar), population allele frequencies (gnomAD) and trait associations (GWAS). Open data from NCBI dbVar (public domain).

api.oanor.com/dbvar-api

Protein Interactions API

Protein-protein interaction networks as an API — powered by STRING, the database of known and predicted protein associations that combines evidence from laboratory experiments, curated pathway databases, gene co-expression, genomic context and automated text mining into a single confidence score, across thousands of organisms. Get a protein's top interaction partners (each with the combined confidence score and the seven evidence-channel subscores), the interaction network among any set of proteins as scored edges, and functional enrichment for a gene set — the over-represented GO terms, KEGG pathways, Pfam domains and more, each with its p-value, FDR and member genes. Pass gene symbols (TP53) or STRING/Ensembl ids, for human (default) or any species by NCBI taxon id. It is a cornerstone of systems biology — ideal for network analysis, functional genomics, pathway and bioinformatics tools. A protein-interaction-network resource — distinct from biological pathways (Reactome), curated protein complexes (Complex Portal) and Gene Ontology annotations (QuickGO). Open data from STRING (CC BY 4.0).

api.oanor.com/stringdb-api

Polygenic Scores API

Polygenic (risk) scores as an API — powered by the NHGRI-EBI PGS Catalog, the open database of published polygenic scores: weighted combinations of genetic variants used to estimate a person's genetic predisposition to a trait or disease. Search traits by name to find their ontology ids, list every polygenic score developed for a trait, and read a score's full metadata — the reported and mapped (EFO/MONDO) traits, the number of variants in the score, the development method, genome build, the ancestry distribution of the samples it was built and evaluated on, the publication behind it (title, journal, date, PubMed id), the release date, license and a direct link to the scoring file. From breast cancer and coronary artery disease to type 2 diabetes and BMI, it is ideal for statistical genetics, genomics, risk-prediction research and bioinformatics tools. A polygenic-score / genetic-risk-prediction resource — distinct from single-variant association studies (GWAS Catalog), population allele frequencies (gnomAD) and clinical variant interpretation (ClinVar). Open data from the NHGRI-EBI PGS Catalog (CC BY 4.0).

api.oanor.com/pgs-api

Gene Ontology API

Gene function as an API — powered by EMBL-EBI's QuickGO and the Gene Ontology (GO), the standard vocabulary that describes what gene products do across three aspects: molecular function, biological process and cellular component. Given a gene or protein (a UniProt accession), list every GO annotation made for it — the GO term, its aspect, the qualifier, the evidence code, the supporting reference (e.g. a PubMed id), the organism and who assigned it — optionally filtered by aspect or organism. Look up any GO term to get its definition, aspect, synonyms and number of child terms; and search the ontology by name to find the right GO terms. GO term names are resolved automatically on annotations. From TP53 to any protein in any species, it is the backbone of functional genomics — ideal for enrichment analysis, annotation pipelines, bioinformatics and research tools. A gene-function annotation resource (which genes have which functions, with evidence) — distinct from generic ontology-term lookup. Open data from EMBL-EBI QuickGO and the GO Consortium (CC BY 4.0).

api.oanor.com/quickgo-api

GWAS Catalog API

Human genetic trait associations as an API — powered by the NHGRI-EBI GWAS Catalog, the curated reference of published genome-wide association studies. It answers the core question of statistical genetics: which genetic variants (SNPs) are associated with which traits and diseases, and how strongly. Look up a SNP to get its functional class, genomic location and mapped genes; pull every trait association reported for it — the trait, p-value, effect size (odds ratio or beta), risk allele and frequency, and author-reported genes; and read the study behind the evidence — trait, sample sizes, ancestries, genotyping technology and the publication (PubMed id, authors, journal, date). From type 2 diabetes and Crohn disease to systemic lupus erythematosus and hundreds of thousands of associations, it is ideal for genomics, bioinformatics, statistical-genetics and biomedical research tools. A published genetic-association evidence base — distinct from population allele frequencies (gnomAD), clinical variant interpretation (ClinVar) and genome annotation (Ensembl). Open data from the NHGRI-EBI GWAS Catalog (EMBL-EBI).

api.oanor.com/gwas-api

BioSamples API

BioSamples como API, impulsado por EMBL-EBI — la base de datos que almacena y vincula los metadatos de muestras biológicas, los especímenes físicos detrás de los experimentos biológicos. Una muestra en BioSamples lleva un acceso estable (como SAMEA3231268) y un rico conjunto de características — organismo, tejido o parte del organismo, tipo celular, sexo, enfermedad, etapa de desarrollo, cepa y cualquier atributo proporcionado por el remitente — y es referenciada por otros archivos de EBI, incluidos el Archivo Europeo de Nucleótidos (ENA), ArrayExpress y PRIDE. /v1/search?q=liver busca muestras por texto libre y devuelve el acceso, nombre, organismo y fecha de publicación de cada coincidencia. /v1/sample?id=SAMEA3231268 devuelve los metadatos de una muestra — su acceso, nombre, ID de taxón NCBI, organismo, fechas de publicación y actualización, el número de relaciones con otras muestras, y sus características aplanadas en un mapa limpio clave→valor. Los accesos tienen el formato SAMEA…, SAMN… o SAMD…; obtenga uno desde el endpoint de búsqueda. Ideal para integración de datos en ciencias de la vida, seguimiento de muestras, armonización de metadatos y vinculación de datos de secuenciación o expresión con su espécimen fuente. Datos de EMBL-EBI BioSamples (público). Este es un registro de metadatos de muestras biológicas — distinto de las bases de datos de estudios (BioStudies), secuencias (ENA), variantes (ClinVar) y estructuras.

api.oanor.com/biosamples-api

BioStudies API

BioStudies as an API, powered by EMBL-EBI — the database that holds the descriptions of biological studies and links their data together across EBI resources, including imaging (BioImage Archive), functional genomics (ArrayExpress), proteomics, and the literature (Europe PMC). Each study has an accession, a title and abstract, the collection it belongs to and links to its underlying data and publications. /v1/search?query=covid searches the studies and returns each match's accession (e.g. S-EPMC8017430), title, author, study type, release date and link/file counts. /v1/study?id=S-EPMC8017430 returns a study's metadata — its accession, the collection it belongs to (such as EuropePMC, ArrayExpress or BioImages), title, abstract, release date, authors and the number of linked resources. Accessions look like S-EPMC8017430 or S-BSST123; get one from the search endpoint. Ideal for research-data discovery, linking literature to its underlying datasets, systematic reviews and reproducibility tooling. Data from EMBL-EBI BioStudies (public). This is a studies and datasets metadata index — distinct from the sequence (UniProt, ENA), structure (PDB, EMDB), variant (ClinVar) and ontology databases.

api.oanor.com/biostudies-api

EMDB API

The Electron Microscopy Data Bank (EMDB) as an API, powered by EMBL-EBI — the public archive of three-dimensional electron-microscopy density maps of proteins, nucleic acids and large macromolecular complexes. EMDB is the electron-microscopy counterpart of the Protein Data Bank, holding maps solved by single-particle cryo-EM, electron tomography and electron crystallography, the technique behind the recent "resolution revolution" in structural biology. /v1/search?q=ribosome searches the archive and returns each matching entry's EMDB id (e.g. EMD-1010), title, electron-microscopy method and resolution in ångström. /v1/entry?id=EMD-1010 returns an entry's metadata — its title, the EM method (single particle, tomography, …), the aggregation state, the resolution, the biological sample studied, classification keywords, the deposition, map-release and last-update dates, and the depositing authors. EMDB ids look like EMD-1010, and you may pass just the number. Ideal for structural-biology and cryo-EM tools, structure-comparison and visualisation apps, and education. Data from EMBL-EBI EMDB (public domain). This is the archive of experimental electron-microscopy MAPS — distinct from atomic-coordinate structures (the PDB), predicted structures (AlphaFold) and protein-sequence databases (UniProt).

api.oanor.com/emdb-api

BioModels API

BioModels como API, impulsado por EMBL-EBI — el repositorio más grande del mundo de modelos matemáticos curados y publicados de sistemas biológicos. BioModels recopila modelos computacionales (principalmente en SBML, el Lenguaje de Marcado de Biología de Sistemas) de metabolismo, señalización celular, redes de regulación genética, ciclo celular, procesos de enfermedades y fisiología, cada uno vinculado a la publicación revisada por pares de la que proviene. /v1/search?query=glycolysis busca en el repositorio y devuelve el id de cada modelo coincidente (como BIOMD0000000012), nombre, formato, remitente y fechas de envío/modificación. /v1/model?id=BIOMD0000000012 devuelve los metadatos de un modelo: su nombre y descripción, el formato de codificación, el enfoque de modelado (por ejemplo, modelo de ecuaciones diferenciales ordinarias), el estado de curación, la publicación detrás (título, revista, año, autores) y los archivos del modelo. Los ids de modelo se ven como BIOMD0000000012 para modelos curados o MODEL1234567890 para envíos no curados; obténgalos del endpoint de búsqueda. Ideal para herramientas de biología de sistemas y modelado computacional, flujos de trabajo de investigación reproducible y reutilización de modelos, y enseñanza. Datos de EMBL-EBI BioModels (CC0). Este es un repositorio de modelos computacionales / biología de sistemas — distinto de las bases de datos de secuencias (UniProt, ENA), estructuras (PDB, AlphaFold), rutas y variantes (ClinVar).

api.oanor.com/biomodels-api

UCSC Genome API

The UCSC Genome Browser as an API — reference genome data for hundreds of species, from the renowned UCSC Genome Browser at UC Santa Cruz. /v1/genomes lists the 220+ genome assemblies UCSC hosts, each with its assembly id (such as hg38 for human, mm39 for mouse, danRer11 for zebrafish), organism, description and data source. /v1/chromosomes?genome=hg38 returns an assembly's chromosomes and sequences with their sizes in base pairs, largest first. /v1/sequence?genome=hg38&chrom=chrM&start=0&end=100 retrieves the raw DNA sequence of any genomic region (0-based start, half-open end; regions are capped at 100,000 bases per call). Assembly ids come from /v1/genomes and chromosome names look like chr1, chrX or chrM. Ideal for bioinformatics pipelines, genome-visualisation and primer-design tools, region and sequence lookups, comparative genomics and teaching. Data from the UCSC Genome Browser (free for academic, non-profit and personal use). This is the genome browser's assemblies and raw reference sequence — distinct from gene-annotation and protein-sequence databases such as Ensembl, UniProt and ENA.

api.oanor.com/ucsc-api

ClinVar API

ClinVar como API, alimentado por la Biblioteca Nacional de Medicina de EE. UU. a través de las utilidades E de NCBI. ClinVar es el archivo público de las relaciones entre variantes genéticas humanas y la salud, registrando la interpretación clínica (significado) de cada variante — si es Patogénica, Probablemente patogénica, de Significado incierto, Probablemente benigna o Benigna — junto con las condiciones asociadas. /v1/search?gene=BRCA1 busca en ClinVar por símbolo de gen, o por texto libre con q= (por ejemplo, una enfermedad o expresión HGVS), devolviendo el número total de variantes coincidentes y una lista de identificadores de variantes de ClinVar. /v1/variant?id=4852102 devuelve un resumen de una variante: su acceso de ClinVar (VCV…), título, tipo de variante, los nombres de la variante y cDNA, la clasificación clínica y el estado de revisión, la(s) condición(es) asociada(s), el(los) gen(es) y el gen primario, el cromosoma y la ubicación, el cambio de proteína y la consecuencia molecular, además de un enlace al registro de ClinVar. Obtenga un ID de variante de /v1/search, luego obtenga sus detalles. Ideal para pipelines de genómica clínica y anotación de variantes, herramientas de enfermedades raras y asesoramiento genético, y paneles de investigación. Datos de NCBI ClinVar (dominio público). Esta es interpretación de variantes clínicas — distinta de las bases de datos de frecuencia alélica poblacional (como gnomAD) y de las bases de datos de proteínas/secuencias. Mantenga las tasas de solicitud moderadas según el uso justo de NCBI.

api.oanor.com/clinvar-api

ENA API

The European Nucleotide Archive (ENA) as an API, powered by EMBL-EBI — one of the three INSDC partners alongside NCBI GenBank and DDBJ, and the comprehensive public archive of the world's nucleotide sequence data. ENA holds raw sequencing reads, assembled and annotated genomes, individual sequences, biological samples and the studies behind them, for every domain of life — the backbone resource for genomics, microbiology, ecology, evolution and clinical research. This API gives a clean three-step workflow over that archive. First, /v1/taxon resolves an organism name (e.g. "Homo sapiens") to its NCBI taxon id, scientific name, taxonomic rank and full lineage — or looks a taxon up directly by id. Then /v1/search queries the archive for that taxon's records of a chosen type: genome assemblies (with assembly name, level and base count), sequencing runs (with platform, instrument and read counts), biological samples (with collection date and country), annotated sequences, read experiments, analyses, coding and non-coding sequences, and studies — by default including all descendant taxa, or restricted to the exact taxon. Finally /v1/record returns a summary for any ENA accession — assemblies (GCA_…), studies and projects (PRJ…), samples (SAM…/ERS…), sequencing runs (ERR…/SRR…) and sequences — with its title, data type, taxon, scientific name, base and sequence counts and public status. Ideal for bioinformatics pipelines, genome-data discovery, sequencing-metadata harvesting, biodiversity and metagenomics tooling, and research reproducibility. Taxon ids look like 9606 (human); accessions like GCA_000001405. Data from EMBL-EBI ENA, an INSDC archive, free to use.

api.oanor.com/ena-api

MGnify API

MGnify como API, impulsado por EMBL-EBI, el recurso gratuito más grande del mundo para el análisis y archivo de datos de secuenciación del microbioma, y la hermana metagenómica de PRIDE (proteómica) y MetaboLights (metabolómica). MGnify alberga decenas de miles de estudios públicos de metagenómica y metabarcoding que abarcan el microbioma intestinal humano, ambientes marinos y de agua dulce, suelos, aguas residuales, el entorno construido y comunidades asociadas a hospedadores. Busque estudios por palabra clave, obteniendo el acceso de MGnify de cada estudio (MGYS...), nombre, resumen, bioma, recuento de muestras y el BioProject de secuenciación de origen; lea los metadatos completos de un estudio, incluidos su nombre y resumen, clasificación del bioma, número de muestras, centro de envío, estado público, origen de los datos y fecha de última actualización; y navegue por el árbol de clasificación de biomas estilo GOLD, desde raíz:Asociado al hospedador:Humano:Sistema digestivo hasta raíz:Ambiental:Acuático:Marino, con recuentos de muestras y estudios por bioma, para descubrimiento por entorno. Ideal para investigación en microbioma y genómica ambiental, reutilización de conjuntos de datos y metaanálisis, pipelines de bioinformática y enseñanza. Los accesos de estudio tienen el formato MGYS00006862. Datos de EMBL-EBI MGnify.

api.oanor.com/mgnify-api

Cellosaurus API

Cellosaurus como API, impulsado por el SIB Instituto Suizo de Bioinformática — la enciclopedia de referencia de líneas celulares utilizadas en investigación biomédica. Con más de 150,000 entradas que abarcan líneas celulares cancerosas, hibridomas, células madre pluripotentes inducidas y líneas de cientos de especies, Cellosaurus es el catálogo autorizado que los investigadores utilizan para identificar y validar las líneas celulares detrás de experimentos publicados. Busque líneas celulares por nombre o palabra clave, obteniendo el accession de Cellosaurus de cada línea (CVCL_…), nombre, categoría, especie y enfermedad; y lea el registro completo de una línea celular: su nombre y sinónimos, categoría (por ejemplo, línea celular cancerosa, hibridoma, célula madre), especie con ID de taxonomía de NCBI, sexo, edad, la enfermedad de la que deriva con identificadores de ontología NCIt, el tejido o sitio anatómico de origen, su línea celular parental y el número de líneas hijas derivadas, el número de referencias bibliográficas y las múltiples referencias cruzadas (a ATCC, DSMZ, ECACC, Wikidata y más), páginas web relevantes y — críticamente para la reproducibilidad de la investigación — si la línea está marcada como PROBLEMÁTICA, lo que significa que ha sido mal identificada o contaminada cruzadamente, junto con las notas explicativas. Ideal para control de calidad en laboratorio y autenticación de líneas celulares, investigación biomédica y oncológica, curación de datos y comprobaciones de reproducibilidad. Los accessions se ven como CVCL_0030 (HeLa). Datos de Cellosaurus (CC-BY 4.0).

api.oanor.com/cellosaurus-api

AlphaFold API

The AlphaFold Protein Structure Database as an API, powered by EMBL-EBI and Google DeepMind. AlphaFold predicts the three-dimensional structure of a protein from its amino-acid sequence with experimental-level accuracy, and the database now covers over 200 million proteins — nearly every sequence in UniProt. Look up the AlphaFold model for any protein by its UniProt accession and get its gene and protein description, organism and sequence length, model version and creation date, the global confidence metric, the full amino-acid sequence, and direct download links to the predicted structure as mmCIF, PDB and BinaryCIF together with the Predicted Aligned Error (PAE) plot image and data; and read a protein's structural coverage — the AlphaFold predicted model(s) and any linked structures with their provider, model category, method and the UniProt residue range covered. Ideal for structural biology, drug discovery and target assessment, protein engineering, molecular visualisation and teaching. Proteins are identified by UniProt accession (for example P00520 or P38398). Data from the AlphaFold DB (CC-BY 4.0). For experimentally-determined 3D structures see the PDB API, for protein sequences and functional annotation the UniProt API, and for families & domains InterPro.

api.oanor.com/alphafold-api

Complex Portal API

The Complex Portal as an API, powered by EMBL-EBI — a manually curated, encyclopaedic database of stable macromolecular complexes: assemblies of two or more proteins (and sometimes nucleic acids, ligands or small molecules) that work together as a single functional unit, such as ribosomes, proteasomes, RNA and DNA polymerases, the spliceosome, respiratory-chain complexes and thousands more across many species. Search the complexes by keyword and optionally by organism, getting each complex's Complex Portal accession (CPX-…), name, organism, description and whether it is computationally predicted; read a complex's full curated record including its recommended and systematic names, synonyms, species, biological function, the participating subunits each with its molecule identifier (for example a UniProt accession) and stoichiometry, any associated ligands and diseases, the evidence type and cross-references to UniProt, Gene Ontology, Reactome, Wikidata and more; and pull just the subunit composition of a complex. Ideal for structural and systems biology, pathway and network analysis, protein-function research and bioinformatics pipelines. Complex accessions look like CPX-6036. Data from EMBL-EBI Complex Portal (IMEx consortium, CC-BY). For protein–protein interaction networks see the STRING API, for protein sequences UniProt, for biological pathways Reactome and for families & domains InterPro.

api.oanor.com/complexes-api

Rfam API

La base de datos Rfam de familias de ARN no codificante como API, impulsada por EMBL-EBI. Rfam agrupa ARN funcionales que comparten un origen evolutivo común en familias, cada una modelada por un modelo de covarianza construido a partir de una alineación semilla curada y una estructura secundaria. Busque las familias por nombre, descripción o tipo de ARN — riboswitches y otros elementos reguladores cis, ribozimas, familias de microARN, ARN ribosómicos, ARN de transferencia, ARN nucleares pequeños y ARN nucleolares pequeños, ARN largos no codificantes y repeticiones directas CRISPR — obteniendo el acceso Rfam de cada familia, nombre, descripción, tipo de ARN y curadores; lea el registro completo de una familia, incluyendo su descripción, clasificación por tipo de ARN, los curadores que la construyeron, el número de secuencias en sus alineaciones completas y semilla, la fuente de la estructura, el comentario del curador, el clan (grupo de familias relacionadas) al que pertenece y la versión de Rfam; y navegue por las familias según la clase de ARN. Ideal para biología del ARN, pipelines bioinformáticos, anotación de ARN no codificante, genómica comparativa y enseñanza. Los accesos de las familias se ven como RF00005 (ARN de transferencia). Datos de EMBL-EBI Rfam. Para familias y dominios de proteínas, consulte la API InterPro; para secuencias de proteínas, UniProt; para conjuntos de datos de proteómica, PRIDE; y para metabolómica, MetaboLights.

api.oanor.com/rfam-api

MetaboLights API

MetaboLights como API, impulsado por EMBL-EBI — el repositorio abierto más importante del mundo para experimentos de metabolómica (espectroscopia de RMN y espectrometría de masas) y un recurso hermano de PRIDE para proteómica. Busque estudios públicos de metabolómica por palabra clave (devolviendo el acceso, título, descripción y organismo de cada estudio); lea los metadatos completos de un estudio, incluido su resumen, estado, fechas de envío y publicación, descriptores de diseño del estudio, factores experimentales, los ensayos analíticos con su tipo de medición, tecnología y plataforma, los contribuyentes y sus roles, las publicaciones vinculadas con DOI e identificadores de PubMed, remitentes, recuento de muestras, URL de descarga FTP y licencia de datos; inspeccione el flujo de trabajo analítico — cada protocolo con su nombre, tipo, descripción y parámetros (recolección de muestras, extracción, cromatografía, espectroscopia de RMN/MS, transformación de datos e identificación de metabolitos); y enumere los organismos y partes de organismos estudiados con sus términos de ontología. Ideal para investigación en metabolómica y biología de sistemas, reutilización de conjuntos de datos y metaanálisis, tuberías bioinformáticas y herramientas que integran evidencia experimental. Los accesos de estudio se ven como MTBLS1. Datos de EMBL-EBI MetaboLights.

api.oanor.com/metabolights-api

PRIDE API

The PRIDE proteomics archive as an API, powered by the EMBL-EBI PRIDE Archive — the world's largest public repository of mass-spectrometry proteomics data and a founding member of ProteomeXchange. Search the public proteomics experiments by keyword (returning each project's accession, title, organisms, diseases and instruments); read a project's full metadata including its description, keywords, organisms and organism parts, mass-spectrometry instruments, software, the protein modifications identified, sample- and data-processing protocols, submitters, affiliations and the linked publication (DOI and PubMed); list a project's data files with their category, format, size and a direct download link; and explore facets — the diseases, organisms, instruments, experiment types, software and countries represented across matching projects — for discovery. Ideal for proteomics and systems-biology research, dataset reuse and meta-analysis, bioinformatics pipelines, and tools that integrate experimental evidence. Project accessions look like PXD000001. Data from EMBL-EBI.

api.oanor.com/pride-api

InterPro API

Protein families, domains and functional sites as an API, powered by the EBI InterPro database. InterPro classifies proteins into families and identifies the domains, repeats and important sites they contain, by combining the predictive signatures of many member databases (Pfam, SMART, PROSITE, CDD, PANTHER, SUPERFAMILY, NCBIfam and more) into a single integrated resource. Look up an InterPro entry — a family, domain, repeat, conserved/binding/active site or post-translational modification — with its description, Gene Ontology terms and the member-database signatures that define it; search entries by name and type; read a protein's metadata; and, most usefully, list the InterPro entries found on a protein together with their start–end positions, so you can see a protein's domain architecture. Ideal for protein annotation and function prediction, comparative genomics, structural-biology and bioinformatics pipelines, and research and teaching tools. Entry ids are IPR followed by six digits; protein ids are UniProt accessions. Data from EMBL-EBI.

api.oanor.com/interpro-api

Open Targets API

Drug target–disease associations as an API, powered by the Open Targets Platform. Open Targets integrates human genetics, genomics, transcriptomics, known drugs, animal models and the scientific literature to systematically score how strongly a target (gene/protein) is associated with a disease — the evidence that underpins modern drug discovery. Search across targets, diseases and drugs; read a target for its approved symbol, biotype, function, genomic location and UniProt ids together with the diseases it is most strongly associated with and their overall association scores; read a disease for its description, therapeutic areas and its top associated targets with scores; and read a drug for its modality, maximum clinical stage, trade names, synonyms and mechanisms of action. Ideal for drug-discovery and target-identification pipelines, therapeutic-area research, biomedical data science and pharma intelligence tools. Target ids are Ensembl gene ids, disease ids are EFO/MONDO/Orphanet ids, drug ids are ChEMBL ids. Data is open (CC0).

api.oanor.com/opentargets-api

KEGG API

KEGG分子数据库作为API,由官方KEGG REST服务提供支持。KEGG(京都基因与基因组百科全书)连接基因组、化学和疾病。获取任何KEGG条目并解析为JSON——代谢化合物、KEGG直系同源组(KO)、酶(EC编号)、反应、模块、药物、疾病、聚糖、基因或通路图;按名称搜索任何KEGG数据库;列出数据库的条目;在数据库之间交叉链接条目(基因到其通路、通路到其化合物、酶到其反应);以及将KEGG标识符与外部命名空间(NCBI Gene/Protein、UniProt、ChEBI、PubChem)相互转换。非常适合系统生物学和代谢组学流程、酶和直系同源映射、药物和疾病研究、基因到通路注释以及生物信息学标识符转换。KEGG ID以字母为前缀(C化合物、K直系同源、D药物、H疾病、M模块、R反应、G聚糖)或按生物体编码(hsa人类、eco大肠杆菌)。

api.oanor.com/kegg-api

gnomAD API

Population genetics as an API, powered by the Broad Institute's gnomAD (Genome Aggregation Database) — allele frequencies and gene constraint aggregated from over 800,000 human exomes and genomes. Look up a gene's constraint scores (pLI, LOEUF, observed vs expected loss-of-function, missense Z) and genomic location; get a variant's allele frequencies broken down by ancestry population (African/African-American, Admixed American, Ashkenazi Jewish, East Asian, Finnish, Non-Finnish European, South Asian, Middle Eastern…) across both genome and exome callsets, with rsIDs, homozygote counts and predicted consequence; search genes by symbol; read a transcript's constraint; and list the variants in a small genomic region. Supports GRCh38 and GRCh37 and the gnomAD v4/v3/v2 datasets. Ideal for clinical and population genetics, variant interpretation and prioritisation, rare-disease and pharmacogenomics research, and bioinformatics pipelines. Variant ids are chrom-pos-ref-alt.

api.oanor.com/gnomad-api

STRING API

The STRING protein–protein interaction database as an API — the curated and predicted network of functional associations between proteins, powered by the official STRING API. Resolve gene or protein names to STRING identifiers with annotations; get a protein's top interaction partners with a combined confidence score and per-channel evidence (experimental, curated databases, co-expression, text-mining, gene fusion, neighbourhood and co-occurrence); build the interaction network among a set of proteins as scored edges; run functional enrichment of a gene set over Gene Ontology, KEGG, Reactome, Pfam, InterPro and more with p-values and false-discovery rates; and score homology between proteins. Covers 12,000+ organisms (default human, NCBI taxon 9606). Ideal for systems-biology and network-biology pipelines, gene-set and pathway analysis, drug-target and disease-gene research, and bioinformatics dashboards.

api.oanor.com/string-api

Reactome API

La base de conocimiento de rutas Reactome como API — la base de datos abierta y revisada por pares de rutas y reacciones biológicas, impulsada por el Reactome ContentService oficial. Busque en el archivo curado de rutas, reacciones y moléculas; lea cualquier entidad por su ID estable de Reactome (una ruta, reacción, complejo o proteína: nombre, tipo, especie, compartimentos, resumen y bandera de enfermedad); enumere los eventos (sub-rutas y reacciones) contenidos en una ruta; enumere las moléculas que participan en una ruta o reacción con sus identificadores de referencia; obtenga las rutas de nivel superior para cualquier organismo modelo; asigne una proteína UniProt a las rutas en las que participa; y enumere las especies compatibles. Cubre humanos y más de 15 organismos modelo en metabolismo, transducción de señales, ciclo celular, sistema inmunológico, enfermedades y más. Ideal para pipelines de biología de sistemas y bioinformática, herramientas de enriquecimiento de rutas y dianas farmacológicas, aplicaciones de investigación biomédica, recursos educativos y chatbots de ciencias de la vida.

api.oanor.com/reactome-api

PDB API

The RCSB Protein Data Bank as an API — 3D macromolecular structures of proteins, nucleic acids and complexes, powered by the official RCSB PDB data and search services. Fetch a structure entry by its 4-character PDB id for its title, experimental method (X-ray, cryo-EM, NMR), resolution, keywords, deposit and release dates, authors, primary citation and entity & assembly counts; run full-text search across the whole archive returning matching PDB ids and the total hit count; read a polymer entity for its protein or nucleic-acid name, one-letter sequence, length, source organism, chains and linked UniProt ids; read a biological assembly for its oligomeric state, symmetry and chain & atom counts; list the ligands bound in a structure with their component ids and names; and look up any chemical component (ligand) by code for its formula, weight, SMILES and InChIKey. Ideal for structural-biology and drug-discovery tools, molecular viewers, bioinformatics pipelines, education apps and research dashboards.

api.oanor.com/pdb-api

Ensembl API

The Ensembl genome database as an API, powered by the official Ensembl REST service from EMBL-EBI. Look up any gene by symbol or Ensembl stable id for its biotype, genomic location, strand, description and transcripts; resolve any feature (gene, transcript, exon) by stable id; pull external database cross-references; fetch sequence variants by rsID with their alleles, most-severe consequence, minor-allele frequency, clinical significance and genomic mappings; list the genes, transcripts, exons, variations or repeats overlapping any genomic region; retrieve genomic, cDNA, CDS or protein sequences by id; and read genome-assembly metadata including the karyotype and chromosome lengths. Across human, mouse and 300+ vertebrate species. Ideal for bioinformatics pipelines, genome browsers and variant-annotation tools, genetics research apps, clinical-genomics dashboards and life-science chatbots.

api.oanor.com/ensembl-api

UniProt API

The UniProt protein knowledge base as an API, powered by the official UniProt REST service curated by EMBL-EBI, SIB and PIR. Look up any protein by its UniProt accession for protein and gene names, organism, length, mass, function, keywords, Gene Ontology (GO) terms and linked PDB 3D structures; run full-text protein searches filtered by organism (NCBI taxon id) and Swiss-Prot review status; fetch amino-acid sequences with FASTA, molecular weight and CRC64 checksum; list sequence features such as signal peptides, chains, domains, active and binding sites, modified residues and natural variants, with a by-type breakdown; resolve NCBI taxonomy nodes with their full lineage; and pull reference proteomes with protein counts and genome-assembly ids. Across all kingdoms of life, from human to bacteria. Ideal for bioinformatics pipelines, drug-discovery and proteomics tools, sequence-analysis dashboards, academic research apps and life-science chatbots.

api.oanor.com/uniprot-api