| UniProt ID | THYG_HUMAN | |
|---|---|---|
| UniProt AC | P01266 | |
| Protein Name | Thyroglobulin {ECO:0000305} | |
| Gene Name | TG {ECO:0000312|HGNC:HGNC:11764} | |
| Organism | Homo sapiens (Human). | |
| Sequence Length | 2768 | |
| Subcellular Localization | Secreted. | |
| Protein Description | Precursor of the iodinated thyroid hormones thyroxine (T4) and triiodothyronine (T3).. | |
| Protein Sequence | MALVLEIFTLLASICWVSANIFEYQVDAQPLRPCELQRETAFLKQADYVPQCAEDGSFQTVQCQNDGRSCWCVGANGSEVLGSRQPGRPVACLSFCQLQKQQILLSGYINSTDTSYLPQCQDSGDYAPVQCDVQQVQCWCVDAEGMEVYGTRQLGRPKRCPRSCEIRNRRLLHGVGDKSPPQCSAEGEFMPVQCKFVNTTDMMIFDLVHSYNRFPDAFVTFSSFQRRFPEVSGYCHCADSQGRELAETGLELLLDEIYDTIFAGLDLPSTFTETTLYRILQRRFLAVQSVISGRFRCPTKCEVERFTATSFGHPYVPSCRRNGDYQAVQCQTEGPCWCVDAQGKEMHGTRQQGEPPSCAEGQSCASERQQALSRLYFGTSGYFSQHDLFSSPEKRWASPRVARFATSCPPTIKELFVDSGLLRPMVEGQSQQFSVSENLLKEAIRAIFPSRGLARLALQFTTNPKRLQQNLFGGKFLVNVGQFNLSGALGTRGTFNFSQFFQQLGLASFLNGGRQEDLAKPLSVGLDSNSSTGTPEAAKKDGTMNKPTVGSFGFEINLQENQNALKFLASLLELPEFLLFLQHAISVPEDVARDLGDVMETVLSSQTCEQTPERLFVPSCTTEGSYEDVQCFSGECWCVNSWGKELPGSRVRGGQPRCPTDCEKQRARMQSLMGSQPAGSTLFVPACTSEGHFLPVQCFNSECYCVDAEGQAIPGTRSAIGKPKKCPTPCQLQSEQAFLRTVQALLSNSSMLPTLSDTYIPQCSTDGQWRQVQCNGPPEQVFELYQRWEAQNKGQDLTPAKLLVKIMSYREAASGNFSLFIQSLYEAGQQDVFPVLSQYPSLQDVPLAALEGKRPQPRENILLEPYLFWQILNGQLSQYPGSYSDFSTPLAHFDLRNCWCVDEAGQELEGMRSEPSKLPTCPGSCEEAKLRVLQFIRETEEIVSASNSSRFPLGESFLVAKGIRLRNEDLGLPPLFPPREAFAEQFLRGSDYAIRLAAQSTLSFYQRRRFSPDDSAGASALLRSGPYMPQCDAFGSWEPVQCHAGTGHCWCVDEKGGFIPGSLTARSLQIPQCPTTCEKSRTSGLLSSWKQARSQENPSPKDLFVPACLETGEYARLQASGAGTWCVDPASGEELRPGSSSSAQCPSLCNVLKSGVLSRRVSPGYVPACRAEDGGFSPVQCDQAQGSCWCVMDSGEEVPGTRVTGGQPACESPRCPLPFNASEVVGGTILCETISGPTGSAMQQCQLLCRQGSWSVFPPGPLICSLESGRWESQLPQPRACQRPQLWQTIQTQGHFQLQLPPGKMCSADYADLLQTFQVFILDELTARGFCQIQVKTFGTLVSIPVCNNSSVQVGCLTRERLGVNVTWKSRLEDIPVASLPDLHDIERALVGKDLLGRFTDLIQSGSFQLHLDSKTFPAETIRFLQGDHFGTSPRTWFGCSEGFYQVLTSEASQDGLGCVKCPEGSYSQDEECIPCPVGFYQEQAGSLACVPCPVGRTTISAGAFSQTHCVTDCQRNEAGLQCDQNGQYRASQKDRGSGKAFCVDGEGRRLPWWETEAPLEDSQCLMMQKFEKVPESKVIFDANAPVAVRSKVPDSEFPVMQCLTDCTEDEACSFFTVSTTEPEISCDFYAWTSDNVACMTSDQKRDALGNSKATSFGSLRCQVKVRSHGQDSPAVYLKKGQGSTTTLQKRFEPTGFQNMLSGLYNPIVFSASGANLTDAHLFCLLACDRDLCCDGFVLTQVQGGAIICGLLSSPSVLLCNVKDWMDPSEAWANATCPGVTYDQESHQVILRLGDQEFIKSLTPLEGTQDTFTNFQQVYLWKDSDMGSRPESMGCRKDTVPRPASPTEAGLTTELFSPVDLNQVIVNGNQSLSSQKHWLFKHLFSAQQANLWCLSRCVQEHSFCQLAEITESASLYFTCTLYPEAQVCDDIMESNAQGCRLILPQMPKALFRKKVILEDKVKNFYTRLPFQKLMGISIRNKVPMSEKSISNGFFECERRCDADPCCTGFGFLNVSQLKGGEVTCLTLNSLGIQMCSEENGGAWRILDCGSPDIEVHTYPFGWYQKPIAQNNAPSFCPLVVLPSLTEKVSLDSWQSLALSSVVVDPSIRHFDVAHVSTAATSNFSAVRDLCLSECSQHEACLITTLQTQPGAVRCMFYADTQSCTHSLQGQNCRLLLREEATHIYRKPGISLLSYEASVPSVPISTHGRLLGRSQAIQVGTSWKQVDQFLGVPYAAPPLAERRFQAPEPLNWTGSWDASKPRASCWQPGTRTSTSPGVSEDCLYLNVFIPQNVAPNASVLVFFHNTMDREESEGWPAIDGSFLAAVGNLIVVTASYRVGVFGFLSSGSGEVSGNWGLLDQVAALTWVQTHIRGFGGDPRRVSLAADRGGADVASIHLLTARATNSQLFRRAVLMGGSALSPAAVISHERAQQQAIALAKEVSCPMSSSQEVVSCLRQKPANVLNDAQTKLLAVSGPFHYWGPVIDGHFLREPPARALKRSLWVEVDLLIGSSQDDGLINRAKAVKQFEESRGRTSSKTAFYQALQNSLGGEDSDARVEAAATWYYSLEHSTDDYASFSRALENATRDYFIICPIIDMASAWAKRARGNVFMYHAPENYGHGSLELLADVQFALGLPFYPAYEGQFSLEEKSLSLKIMQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDFVPRAGGENYKEFSELLPNRQGLKKADCSFWSKYISSLKTSADGAKGGQSAESEEEELTAGSGLREDLLSLQEPGSKTYSK | |
| Overview of Protein Modification Sites with Functional and Structural Information | ||
|
|
||
* ASA = Accessible Surface Area
| Locations | Modification | Substrate Peptides & Secondary Structure |
ASA (%) | Reference | Orthologous Protein Cluster |
|---|---|---|---|---|---|
| 24 | Iodination | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | 2760035 | |
| 24 | Other | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | 2760035 | |
| 24 | Sulfation | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | - | |
| 24 | Sulfation | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | - | |
| 76 | N-linked_Glycosylation | SCWCVGANGSEVLGS EEEEEECCCCEEECC | 48.90 | 32025030 | |
| 76 | N-linked_Glycosylation | SCWCVGANGSEVLGS EEEEEECCCCEEECC | 48.90 | 32025030 | |
| 108 | Iodination | QQILLSGYINSTDTS CHHHHHCCCCCCCCC | 8.11 | 32025030 | |
| 110 | N-linked_Glycosylation | ILLSGYINSTDTSYL HHHHCCCCCCCCCCC | 30.30 | 8530385 | |
| 110 | N-linked_Glycosylation | ILLSGYINSTDTSYL HHHHCCCCCCCCCCC | 30.30 | 32025030 | |
| 149 | Iodination | DAEGMEVYGTRQLGR CCCCCEEEEECCCCC | 10.24 | 2760035 | |
| 198 | N-linked_Glycosylation | PVQCKFVNTTDMMIF CEECEECCCCCCHHH | 40.15 | 32025030 | |
| 198 | N-linked_Glycosylation | PVQCKFVNTTDMMIF CEECEECCCCCCHHH | 40.15 | 32025030 | |
| 199 | Phosphorylation | VQCKFVNTTDMMIFD EECEECCCCCCHHHH | 20.90 | 22210691 | |
| 200 | Phosphorylation | QCKFVNTTDMMIFDL ECEECCCCCCHHHHH | 19.38 | 22210691 | |
| 234 | Iodination | RFPEVSGYCHCADSQ HCCCCCCEEEECCHH | 3.29 | 32025030 | |
| 234 | Phosphorylation | RFPEVSGYCHCADSQ HCCCCCCEEEECCHH | 3.29 | - | |
| 240 | Phosphorylation | GYCHCADSQGRELAE CEEEECCHHCHHHHH | 18.93 | - | |
| 258 | Iodination | ELLLDEIYDTIFAGL HHHHHHHHHHHHCCC | 12.65 | 2760035 | |
| 376 | Phosphorylation | QQALSRLYFGTSGYF HHHHHHHHHCCCCCC | 10.00 | - | |
| 484 | N-linked_Glycosylation | LVNVGQFNLSGALGT EEEEEEEECCCCCCC | 25.96 | 32025030 | |
| 484 | N-linked_Glycosylation | LVNVGQFNLSGALGT EEEEEEEECCCCCCC | 25.96 | 32025030 | |
| 529 | N-linked_Glycosylation | LSVGLDSNSSTGTPE CCCCCCCCCCCCCHH | 39.38 | 8615697 | |
| 529 | N-linked_Glycosylation | LSVGLDSNSSTGTPE CCCCCCCCCCCCCHH | 39.38 | 8615697 | |
| 704 | Iodination | QCFNSECYCVDAEGQ EEECCEEEEECCCCC | 7.46 | 2760035 | |
| 704 | Other | QCFNSECYCVDAEGQ EEECCEEEEECCCCC | 7.46 | 2760035 | |
| 748 | N-linked_Glycosylation | TVQALLSNSSMLPTL HHHHHHHCCCCCCCC | 37.77 | 8615697 | |
| 748 | N-linked_Glycosylation | TVQALLSNSSMLPTL HHHHHHHCCCCCCCC | 37.77 | 8615697 | |
| 785 | Iodination | PEQVFELYQRWEAQN HHHHHHHHHHHHHHH | 6.71 | 2760035 | |
| 816 | N-linked_Glycosylation | YREAASGNFSLFIQS HHHHHCCCHHHHHHH | 22.34 | 9287346 | |
| 816 | N-linked_Glycosylation | YREAASGNFSLFIQS HHHHHCCCHHHHHHH | 22.34 | 9287346 | |
| 866 | Iodination | ENILLEPYLFWQILN CCCCCCHHHHHHHHC | 12.73 | 2760035 | |
| 883 | Iodination | LSQYPGSYSDFSTPL HHCCCCCCCCCCCCC | 20.41 | 2760035 | |
| 939 | Phosphorylation | VLQFIRETEEIVSAS HHHHHHHHHHHHHCC | 29.12 | - | |
| 947 | N-linked_Glycosylation | EEIVSASNSSRFPLG HHHHHCCCCCCCCCC | 43.66 | 9287346 | |
| 947 | N-linked_Glycosylation | EEIVSASNSSRFPLG HHHHHCCCCCCCCCC | 43.66 | 9287346 | |
| 992 | Iodination | QFLRGSDYAIRLAAQ HHHCCCHHHHHHHHH | 12.96 | 2760035 | |
| 1005 | Phosphorylation | AQSTLSFYQRRRFSP HHHHHHHHHHCCCCC | 9.57 | - | |
| 1011 | Phosphorylation | FYQRRRFSPDDSAGA HHHHCCCCCCCCCHH | 26.13 | 22210691 | |
| 1019 | Phosphorylation | PDDSAGASALLRSGP CCCCCHHHHHHHCCC | 20.66 | 24719451 | |
| 1082 | Phosphorylation | TTCEKSRTSGLLSSW CCCCHHCCCCCHHHH | 33.95 | 26437602 | |
| 1083 | Phosphorylation | TCEKSRTSGLLSSWK CCCHHCCCCCHHHHH | 26.74 | 26437602 | |
| 1087 | Phosphorylation | SRTSGLLSSWKQARS HCCCCCHHHHHHHHC | 38.91 | 26437602 | |
| 1088 | Phosphorylation | RTSGLLSSWKQARSQ CCCCCHHHHHHHHCC | 37.96 | 26437602 | |
| 1154 | Phosphorylation | SLCNVLKSGVLSRRV HHHHHHHHCCCCCCC | 29.94 | 30576142 | |
| 1158 | Phosphorylation | VLKSGVLSRRVSPGY HHHHCCCCCCCCCCC | 18.68 | 24719451 | |
| 1165 | Phosphorylation | SRRVSPGYVPACRAE CCCCCCCCCCCEECC | 13.47 | - | |
| 1220 | N-linked_Glycosylation | PRCPLPFNASEVVGG CCCCCCCCHHHEECC | 40.39 | 32025030 | |
| 1220 | N-linked_Glycosylation | PRCPLPFNASEVVGG CCCCCCCCHHHEECC | 40.39 | 32025030 | |
| 1310 | Thyroxine | GKMCSADYADLLQTF CCCCCCCHHHHHHHH | 11.17 | - | |
| 1310 | Iodination | GKMCSADYADLLQTF CCCCCCCHHHHHHHH | 11.17 | 2760035 | |
| 1310 | Other | GKMCSADYADLLQTF CCCCCCCHHHHHHHH | 11.17 | 2760035 | |
| 1348 | N-linked_Glycosylation | LVSIPVCNNSSVQVG EEEEEECCCCCEEEE | 52.21 | 8615697 | |
| 1348 | N-linked_Glycosylation | LVSIPVCNNSSVQVG EEEEEECCCCCEEEE | 52.21 | 8615697 | |
| 1349 | N-linked_Glycosylation | VSIPVCNNSSVQVGC EEEEECCCCCEEEEE | 30.45 | 32025030 | |
| 1365 | N-linked_Glycosylation | TRERLGVNVTWKSRL EHHHCCCCEEECHHC | 24.57 | 32025030 | |
| 1365 | N-linked_Glycosylation | TRERLGVNVTWKSRL EHHHCCCCEEECHHC | 24.57 | 32025030 | |
| 1421 | Phosphorylation | SKTFPAETIRFLQGD CCCCCHHHHHHHCCC | 21.40 | - | |
| 1449 | Phosphorylation | EGFYQVLTSEASQDG CCCHHHHCCCCCCCC | 26.18 | 27251275 | |
| 1450 | Phosphorylation | GFYQVLTSEASQDGL CCHHHHCCCCCCCCC | 27.31 | 27251275 | |
| 1453 | Phosphorylation | QVLTSEASQDGLGCV HHHCCCCCCCCCCEE | 25.42 | 27251275 | |
| 1467 | Iodination | VKCPEGSYSQDEECI EECCCCCCCCCCCCE | 22.62 | 2760035 | |
| 1498 (in isoform 2) | Phosphorylation | - | 23.66 | 17929957 | |
| 1508 (in isoform 2) | Phosphorylation | - | 18.95 | 17929957 | |
| 1529 | Phosphorylation | QCDQNGQYRASQKDR CCCCCCCEECCCCCC | 15.00 | - | |
| 1577 | Phosphorylation | KFEKVPESKVIFDAN CCCCCCCCCEEEECC | 26.59 | - | |
| 1591 | Phosphorylation | NAPVAVRSKVPDSEF CCCEEEECCCCCCCC | 31.26 | - | |
| 1652 | Phosphorylation | KRDALGNSKATSFGS HHHHCCCCCCCCCCC | 23.13 | 18452278 | |
| 1716 | N-linked_Glycosylation | VFSASGANLTDAHLF EEECCCCCCCHHHHH | 47.18 | 32025030 | |
| 1716 | N-linked_Glycosylation | VFSASGANLTDAHLF EEECCCCCCCHHHHH | 47.18 | 32025030 | |
| 1774 | N-linked_Glycosylation | DPSEAWANATCPGVT CHHHHHHCCCCCCCC | 25.69 | 32025030 | |
| 1774 | N-linked_Glycosylation | DPSEAWANATCPGVT CHHHHHHCCCCCCCC | 25.69 | 32025030 | |
| 1839 | Phosphorylation | SMGCRKDTVPRPASP HHCCCCCCCCCCCCC | 35.13 | 24905233 | |
| 1869 | N-linked_Glycosylation | NQVIVNGNQSLSSQK CCEEECCCCCCHHCH | 24.06 | 32025030 | |
| 2013 | N-linked_Glycosylation | CTGFGFLNVSQLKGG CCCEEEEEHHHCCCC | 29.18 | 32025030 | |
| 2013 | N-linked_Glycosylation | CTGFGFLNVSQLKGG CCCEEEEEHHHCCCC | 29.18 | 32025030 | |
| 2122 | N-linked_Glycosylation | VSTAATSNFSAVRDL CCCCCCCCCHHHHHH | 30.45 | 32025030 | |
| 2184 | Iodination | REEATHIYRKPGISL HHHHHHHHCCCCEEE | 12.65 | 2760035 | |
| 2184 | Phosphorylation | REEATHIYRKPGISL HHHHHHHHCCCCEEE | 12.65 | - | |
| 2250 | N-linked_Glycosylation | FQAPEPLNWTGSWDA CCCCCCCCCCCCCCC | 45.00 | 32025030 | |
| 2250 | N-linked_Glycosylation | FQAPEPLNWTGSWDA CCCCCCCCCCCCCCC | 45.00 | 32025030 | |
| 2295 | N-linked_Glycosylation | IPQNVAPNASVLVFF CCCCCCCCCEEEEEE | 34.80 | 32025030 | |
| 2295 | N-linked_Glycosylation | IPQNVAPNASVLVFF CCCCCCCCCEEEEEE | 34.80 | 32025030 | |
| 2467 | Phosphorylation | NVLNDAQTKLLAVSG HHCCHHHHHHHEECC | 25.94 | - | |
| 2510 | Phosphorylation | EVDLLIGSSQDDGLI EEEEECCCCCCCCHH | 20.17 | 23828894 | |
| 2540 | Iodination | TSSKTAFYQALQNSL CCHHHHHHHHHHHHH | 7.16 | 32025030 | |
| 2573 | Thyroxine | LEHSTDDYASFSRAL CCCCCCCHHHHHHHH | 13.62 | - | |
| 2573 | Iodination | LEHSTDDYASFSRAL CCCCCCCHHHHHHHH | 13.62 | 2760035 | |
| 2573 | Other | LEHSTDDYASFSRAL CCCCCCCHHHHHHHH | 13.62 | 2760035 | |
| 2575 | Phosphorylation | HSTDDYASFSRALEN CCCCCHHHHHHHHHH | 19.95 | 24719451 | |
| 2582 | N-linked_Glycosylation | SFSRALENATRDYFI HHHHHHHHHCCCEEE | 47.68 | 32025030 | |
| 2582 | N-linked_Glycosylation | SFSRALENATRDYFI HHHHHHHHHCCCEEE | 47.68 | 32025030 | |
| 2587 | Thyroxine | LENATRDYFIICPII HHHHCCCEEEEEHHH | 8.06 | - | |
| 2587 | Iodination | LENATRDYFIICPII HHHHCCCEEEEEHHH | 8.06 | 2760035 | |
| 2617 | Iodination | MYHAPENYGHGSLEL EEECCCCCCCCCHHH | 14.61 | 2760035 | |
| 2650 | Phosphorylation | QFSLEEKSLSLKIMQ CCCCCHHHHHHHHHH | 26.86 | 24719451 | |
| 2652 | Phosphorylation | SLEEKSLSLKIMQYF CCCHHHHHHHHHHHH | 34.41 | 24719451 | |
| 2658 | Phosphorylation | LSLKIMQYFSHFIRS HHHHHHHHHHHHHHC | 6.94 | 25884760 | |
| 2670 | Phosphorylation | IRSGNPNYPYEFSRK HHCCCCCCCCCCCCC | 14.40 | 25884760 | |
| 2697 | Iodination | PRAGGENYKEFSELL CCCCCCCHHHHHHHC | 13.95 | 2760035 | |
| 2721 | Phosphorylation | DCSFWSKYISSLKTS CHHHHHHHHHHHCCC | 10.66 | 22817900 | |
| 2723 | Phosphorylation | SFWSKYISSLKTSAD HHHHHHHHHHCCCCC | 26.91 | 24719451 | |
| 2724 | Phosphorylation | FWSKYISSLKTSADG HHHHHHHHHCCCCCC | 25.07 | 29083192 | |
| 2728 | Phosphorylation | YISSLKTSADGAKGG HHHHHCCCCCCCCCC | 23.83 | 19664994 | |
| 2737 | Phosphorylation | DGAKGGQSAESEEEE CCCCCCCCCCCHHHH | 36.96 | 27251275 | |
| 2740 | Phosphorylation | KGGQSAESEEEELTA CCCCCCCCHHHHHHC | 50.45 | 27251275 | |
| 2749 | O-linked_Glycosylation | EEELTAGSGLREDLL HHHHHCCCCHHHHHH | 31.99 | 16679516 | |
| 2766 | Triiodothyronine | QEPGSKTYSK----- CCCCCCCCCC----- | 20.42 | - | |
| 2766 | Iodination | QEPGSKTYSK----- CCCCCCCCCC----- | 20.42 | 2760035 | |
| 2766 | Other | QEPGSKTYSK----- CCCCCCCCCC----- | 20.42 | 2760035 |
| Modified Location | Modified Residue | Modification | Type of Upstream Proteins | Gene Name of Upstream Proteins | UniProt AC of Upstream Proteins | Sources |
|---|---|---|---|---|---|---|
Oops, there are no upstream regulatory protein records of THYG_HUMAN !! | ||||||
| Modified Location | Modified Residue | Modification | Function | Reference | ||
|---|---|---|---|---|---|---|
Oops, there are no descriptions of PTM sites of THYG_HUMAN !! | ||||||
* Distance = the distance between SAP position and PTM sites.
| Modified Location | Modification | Variant Position (Distance <= 10) |
Residue Change | SAP | Related Disease | Reference |
|---|---|---|---|---|---|---|
Oops, there are no SNP-PTM records of THYG_HUMAN !! | ||||||
| Interacting Protein | Gene Name | Interaction Type | PPI Reference | Domain-Domain Interactions |
|---|---|---|---|---|
| THYG_HUMAN | TG | physical | 11294872 |
loading...
| N-linked Glycosylation | |
| Reference | PubMed |
| "Glycosylation in human thyroglobulin: location of the N-linkedoligosaccharide units and comparison with bovine thyroglobulin."; Yang S.X., Pollock H.G., Rawitch A.B.; Arch. Biochem. Biophys. 327:61-70(1996). Cited for: PARTIAL PROTEIN SEQUENCE, GLYCOSYLATION AT ASN-76; ASN-198; ASN-484;ASN-529; ASN-748; ASN-816; ASN-947; ASN-1220; ASN-1348; ASN-1349;ASN-1365; ASN-1716; ASN-1774; ASN-2013; ASN-2250; ASN-2295 ANDASN-2582, AND ABSENCE OF GLYCOSYLATION AT ASN-110; ASN-496; ASN-1869AND ASN-2122. | |
| O-linked Glycosylation | |
| Reference | PubMed |
| "A single chondroitin 6-sulfate oligosaccharide unit at Ser-2730 ofhuman thyroglobulin enhances hormone formation and limits proteolyticaccessibility at the carboxyl terminus. Potential insights intothyroid homeostasis and autoimmunity."; Conte M., Arcaro A., D'Angelo D., Gnata A., Mamone G., Ferranti P.,Formisano S., Gentile F.; J. Biol. Chem. 281:22200-22211(2006). Cited for: GLYCOSYLATION AT SER-2749. | |