Variant Call Format

Variant Call Format

The Variant Call Format (VCF) specifies the format of a text file used in bioinformatics for storing gene sequence variations. The format has been developed with the advent of large-scale genotyping and DNA sequencing projects, such as the 1000 Genomes Project. Existing formats for genetic data such as General feature format (GFF) stored all of the genetic data, much of which is redundant because it will be shared across the genomes. By using the variant call format only the variations need to be stored along with a reference genome.

Comment
enThe Variant Call Format (VCF) specifies the format of a text file used in bioinformatics for storing gene sequence variations. The format has been developed with the advent of large-scale genotyping and DNA sequencing projects, such as the 1000 Genomes Project. Existing formats for genetic data such as General feature format (GFF) stored all of the genetic data, much of which is redundant because it will be shared across the genomes. By using the variant call format only the variations need to be stored along with a reference genome.
Depiction
Binary BCF versus VCF format.png
Developer
index.html
ExtendedFrom
Tab-separated values
Extension
envcf
Genre
enGenomic sequence format
Has abstract
enThe Variant Call Format (VCF) specifies the format of a text file used in bioinformatics for storing gene sequence variations. The format has been developed with the advent of large-scale genotyping and DNA sequencing projects, such as the 1000 Genomes Project. Existing formats for genetic data such as General feature format (GFF) stored all of the genetic data, much of which is redundant because it will be shared across the genomes. By using the variant call format only the variations need to be stored along with a reference genome. The standard is currently in version 4.3, although the 1000 Genomes Project has developed its own specification for structural variations such as duplications, which are not easily accommodated into the existing schema. There is also a genomic VCF (gVCF) extended format, which includes additional information about "blocks" that match the reference and their qualities. A set of tools is also available for editing and manipulating the files.
Is primary topic of
Variant Call Format
Label
enVariant Call Format
LatestReleaseDate
13 January 2021
LatestReleaseVersion
4.30
Link from a Wikipage to an external page
samtools.github.io/hts-specs/
vcftools.github.io/index.html
spectrum.ieee.org/biomedical/diagnostics/the-race-to-build-a-search-engine-for-your-dna%7Ctitle
vcftools.sourceforge.net/VCF-poster.pdf
Link from a Wikipage to another Wikipage
1000 Genomes Project
Alleles
Bioinformatics
Category:Biological sequence format
DbSNP
DNA sequencing
FASTA format
FASTQ format
File:Binary BCF versus VCF format.png
General feature format
Gene sequence
Genotyping
GFF3
Global Alliance for Genomics and Health (GA4GH)
GVF format
Human genetic variation
Human genome
IEEE Spectrum
Indel
Metadata
SAM (file format)
Single-nucleotide polymorphism
Tab-separated values
Name
enVariant Call Format
Open
enYes
SameAs
4xcjD
Formato Variant Call
m.0gfgm8y
Q7915770
Variant Call Format
Variant Call Format
Variant Call Format
Variant Call Format
Variant Call Format
פורמט VCF
Subject
Category:Biological sequence format
Thumbnail
Binary BCF versus VCF format.png?width=300
Url
https://samtools.github.io/hts-specs/
WasDerivedFrom
Variant Call Format?oldid=1111129960&ns=0
WikiPageLength
10657
Wikipage page ID
30712001
Wikipage revision ID
1111129960
WikiPageUsesTemplate
Template:Bioinformatics
Template:Cite news
Template:Infobox file format
Template:Mono
Template:Reflist
Template:Short description
Template:Start date and age