Genomic data has been extended to include 36 human IGK samples from Engelbrecht, Rodriguez, Shields et al., 2024. We have replaced the 22 human IGL samples published in Gibson, Rodriguez, Shields et al., 2022 with 200 that have been processed with more accurate sequencing and pipelines than were available for that study. The IGL and… Continue reading Genomic data extended
Read support for genomic alleles
The annotation file for a genomic sample can be found by clicking on the folder icon in the VDJbase Samples tab. This file lists the alleles that were determined from the sample. In the Genes tab of VDJbase, the ‘Max cov sample’ column shows, for each allele, the sample in which it was found with… Continue reading Read support for genomic alleles
OGRDB Updated
OGRDB has been updated to use binomial (latin) species names instead of colloquial names. The former names can still be used in calls to fetch germline sets, which I hope will prevent automated scripts from breaking, but if you do have problems please try substituting, for example, “Homo sapiens” for “Human”. This turned out to… Continue reading OGRDB Updated
Collaboration Opportunities
OGRDB and VDJbase are part of the AIRR Knowledge Commons project – bringing together immune repertoires from the AIRR Data Commons, epitopes from IEDB, receptor germline analysis from OGRDB and VDJbase with other emerging datasets. We are actively seeking collaborators whose projects will drive use cases and demonstrate the utility of bringing these various sources… Continue reading Collaboration Opportunities
Novel Allele Submission Process Simplified
Following an IARC decision in December 2023, the submission process for inferred novel alleles has been simplified. Previously, it was necessary to deposit the sequence of each inferred allele in GenBank or ENA, and it was usually necessary to accompany this with an extracted set of reads from the repertoire that provided explicit support for… Continue reading Novel Allele Submission Process Simplified
Germline Databases, or adventures into the allelic underworld
If you are interested in receptor germlines, you might enjoy this On-AIRR podcast with Corey Watson and William Lees, hosted by Ulrik Stervbo and Zhaoqing Ding. “In this episode we talk about the recent work by the Germline Database Working Group of the AIRR-Community. The accuracy of V and J gene segment assignment improves with… Continue reading Germline Databases, or adventures into the allelic underworld
Genomic databases updated
We have updated the genomic databases to follow the same project/individual/sample structure that is used for AIRR-seq databases – previously we only structured genomic databases into project/individual. This change makes it easier to handle datasets that have more than one analysis of the same individual – as can be seen, for example, in the human… Continue reading Genomic databases updated
Human IG germline sets now available
Germline sets for human IGH, IGK and IGL are now available for download. They are described in a forthcoming publication, “AIRR-C Human IG Reference Sets: curated sets of immunoglobulin heavy and light chain germline genes”, currently available from Biorxiv as a preprint. Our aim with these sets is to provide high-quality, evidenced collections that are… Continue reading Human IG germline sets now available
Human genomic IGH data now available
A set of ~150 human IGH loci from diverse backgrounds is now available. The data is taken from Rodriguez, Safonova, Silver, Shields, et al. 2023. ‘Genetic Variation in the Immunoglobulin Heavy Chain Locus Shapes the Human Antibody Repertoire’. The data presented here was taken from IGenotyper haplotyped assemblies, processed by the VDJbase automated pipeline. Analysis… Continue reading Human genomic IGH data now available
Downloading germline sets from the command line or API
In this post, we’re going to look briefly at the different ways that you can download germline sets from the command line, or from code. We want to make germline sets on OGRDB as easy to work with as possible, and making them available for download conveniently is an important part of that, so that… Continue reading Downloading germline sets from the command line or API