
Variant Call Format
The Variant Call Format (VCF) specifies the format of a text file used in bioinformatics for storing gene sequence variations. The format has been developed with the advent of large-scale genotyping and DNA sequencing projects, such as the 1000 Genomes Project. Existing formats for genetic data such as General feature format (GFF) stored all of the genetic data, much of which is redundant because it will be shared across the genomes. By using the variant call format only the variations need to be stored along with a reference genome.
- Comment
- enThe Variant Call Format (VCF) specifies the format of a text file used in bioinformatics for storing gene sequence variations. The format has been developed with the advent of large-scale genotyping and DNA sequencing projects, such as the 1000 Genomes Project. Existing formats for genetic data such as General feature format (GFF) stored all of the genetic data, much of which is redundant because it will be shared across the genomes. By using the variant call format only the variations need to be stored along with a reference genome.
- Depiction
- Developer
- index.html
- ExtendedFrom
- Tab-separated values
- Extension
- envcf
- Genre
- enGenomic sequence format
- Has abstract
- enThe Variant Call Format (VCF) specifies the format of a text file used in bioinformatics for storing gene sequence variations. The format has been developed with the advent of large-scale genotyping and DNA sequencing projects, such as the 1000 Genomes Project. Existing formats for genetic data such as General feature format (GFF) stored all of the genetic data, much of which is redundant because it will be shared across the genomes. By using the variant call format only the variations need to be stored along with a reference genome. The standard is currently in version 4.3, although the 1000 Genomes Project has developed its own specification for structural variations such as duplications, which are not easily accommodated into the existing schema. There is also a genomic VCF (gVCF) extended format, which includes additional information about "blocks" that match the reference and their qualities. A set of tools is also available for editing and manipulating the files.
- Is primary topic of
- Variant Call Format
- Label
- enVariant Call Format
- LatestReleaseDate
- 13 January 2021
- LatestReleaseVersion
- 4.30
- Link from a Wikipage to an external page
- samtools.github.io/hts-specs/
- vcftools.github.io/index.html
- spectrum.ieee.org/biomedical/diagnostics/the-race-to-build-a-search-engine-for-your-dna%7Ctitle
- vcftools.sourceforge.net/VCF-poster.pdf
- Link from a Wikipage to another Wikipage
- 1000 Genomes Project
- Alleles
- Bioinformatics
- Category:Biological sequence format
- DbSNP
- DNA sequencing
- FASTA format
- FASTQ format
- File:Binary BCF versus VCF format.png
- General feature format
- Gene sequence
- Genotyping
- GFF3
- Global Alliance for Genomics and Health (GA4GH)
- GVF format
- Human genetic variation
- Human genome
- IEEE Spectrum
- Indel
- Metadata
- SAM (file format)
- Single-nucleotide polymorphism
- Tab-separated values
- Name
- enVariant Call Format
- Open
- enYes
- SameAs
- 4xcjD
- Formato Variant Call
- m.0gfgm8y
- Q7915770
- Variant Call Format
- Variant Call Format
- Variant Call Format
- Variant Call Format
- Variant Call Format
- פורמט VCF
- Subject
- Category:Biological sequence format
- Thumbnail
- Url
- https://samtools.github.io/hts-specs/
- WasDerivedFrom
- Variant Call Format?oldid=1111129960&ns=0
- WikiPageLength
- 10657
- Wikipage page ID
- 30712001
- Wikipage revision ID
- 1111129960
- WikiPageUsesTemplate
- Template:Bioinformatics
- Template:Cite news
- Template:Infobox file format
- Template:Mono
- Template:Reflist
- Template:Short description
- Template:Start date and age