Id: | t2t/gene_models/ncbiRefSeq_110 |
Type: | gene_models |
Version: | 0 |
Summary: |
NCBI RefSeq 110, comprehensive gene annotation on the T2T-CHM13v2.0 assembly (gapless chromosomes, no alternate scaffolds). |
Description: |
The NCBI RefSeq project www.ncbi.nlm.nih.gov/refseq/ provides a curated, non-redundant set of reference sequences for genomes, transcripts, and proteins. RefSeq Release 110 represents the first comprehensive annotation of the telomere-to-telomere CHM13v2.0 human genome assembly, adding gene models in previously unresolved regions such as centromeres, rDNA arrays, and segmental duplications. This annotation expands the catalog of human gene content beyond GRCh38, while maintaining compatibility with widely used biomedical resources and clinical databases, including Gene, ClinVar, and dbSNP. Downloaded on 9/25/2025 https://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/GCA_009914755.4.chromAlias.txt Processing Details: build.sh unzips the original file, replaces chromosome names with ucsc naming (e.g. chr1), rearranges chromosomes by conventional order (chr1, chr2,...) and gzips the resulting file. |
Labels: |
|
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ucsc.txt.gz
Format:
Transcript number | Protein coding transcript number | Gene number | Protein coding gene number | |
---|---|---|---|---|
Global | 125,270 | 65,506 | 57,682 | 20,514 |
chr1 | 11,328 | 6,225 | 5,350 | 2,045 |
chr2 | 8,691 | 4,594 | 4,011 | 1,324 |
chr3 | 7,340 | 4,274 | 3,068 | 1,057 |
chr4 | 5,215 | 2,565 | 2,557 | 779 |
chr5 | 5,590 | 2,874 | 2,697 | 907 |
chr6 | 6,361 | 3,178 | 3,100 | 1,032 |
chr7 | 6,154 | 3,283 | 2,897 | 1,042 |
chr8 | 4,774 | 2,429 | 2,311 | 757 |
chr9 | 5,123 | 2,567 | 2,338 | 785 |
chr10 | 5,364 | 3,009 | 2,257 | 745 |
chr11 | 6,854 | 4,097 | 3,005 | 1,294 |
chr12 | 5,993 | 3,420 | 2,595 | 1,015 |
chr13 | 3,015 | 1,127 | 1,779 | 346 |
chr14 | 4,300 | 2,205 | 2,227 | 843 |
chr15 | 4,281 | 2,069 | 2,082 | 604 |
chr16 | 4,507 | 2,627 | 2,041 | 853 |
chr17 | 5,420 | 3,302 | 2,512 | 1,160 |
chr18 | 2,301 | 1,217 | 1,032 | 283 |
chr19 | 5,966 | 4,021 | 2,543 | 1,409 |
chr20 | 3,074 | 1,644 | 1,401 | 542 |
chr21 | 1,679 | 696 | 981 | 229 |
chr22 | 2,692 | 1,508 | 1,320 | 525 |
chrX | 4,089 | 2,266 | 2,255 | 851 |
chrY | 5,159 | 309 | 1,352 | 105 |
Filename | Size | md5 |
---|---|---|
GCA_009914755.4.chromAlias.txt | 866.0 B | 3c7f3f0529a46d9da6fee6aee70a1fed |
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ncbiRefSeq.gtf.gz | 19.96 MB | d5fb63080182480fa9b8e13f27fcaa34 |
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ncbiRefSeq.gtf.gz.dvc | 144.0 B | d4fa7615a824b56b89b48b070df6b7f0 |
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ucsc.gtf.gz | 19.39 MB | 7c48479a1e94726a6dbc0528e5ef794f |
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ucsc.gtf.gz.dvc | 138.0 B | 037b6febecd8ce4090d792036cd28371 |
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ucsc.txt.gz | 5.95 MB | 4620eb5e0807c428ea82d0f0996ad931 |
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.ucsc.txt.gz.dvc | 137.0 B | 043f50b6d9a53dadff071082149df80c |
build.sh | 277.0 B | 9e4c5060d77a4230252e42a9a6e51637 |
genemodel_prep.py | 1.4 KB | e319bc6aab00dd3ccb967e6d593c829f |
genomic_resource.yaml | 1.65 KB | 809609e2f480638651e5db9a3dcffa87 |
statistics/ |