UniProt ID | THYG_HUMAN | |
---|---|---|
UniProt AC | P01266 | |
Protein Name | Thyroglobulin {ECO:0000305} | |
Gene Name | TG {ECO:0000312|HGNC:HGNC:11764} | |
Organism | Homo sapiens (Human). | |
Sequence Length | 2768 | |
Subcellular Localization | Secreted. | |
Protein Description | Precursor of the iodinated thyroid hormones thyroxine (T4) and triiodothyronine (T3).. | |
Protein Sequence | MALVLEIFTLLASICWVSANIFEYQVDAQPLRPCELQRETAFLKQADYVPQCAEDGSFQTVQCQNDGRSCWCVGANGSEVLGSRQPGRPVACLSFCQLQKQQILLSGYINSTDTSYLPQCQDSGDYAPVQCDVQQVQCWCVDAEGMEVYGTRQLGRPKRCPRSCEIRNRRLLHGVGDKSPPQCSAEGEFMPVQCKFVNTTDMMIFDLVHSYNRFPDAFVTFSSFQRRFPEVSGYCHCADSQGRELAETGLELLLDEIYDTIFAGLDLPSTFTETTLYRILQRRFLAVQSVISGRFRCPTKCEVERFTATSFGHPYVPSCRRNGDYQAVQCQTEGPCWCVDAQGKEMHGTRQQGEPPSCAEGQSCASERQQALSRLYFGTSGYFSQHDLFSSPEKRWASPRVARFATSCPPTIKELFVDSGLLRPMVEGQSQQFSVSENLLKEAIRAIFPSRGLARLALQFTTNPKRLQQNLFGGKFLVNVGQFNLSGALGTRGTFNFSQFFQQLGLASFLNGGRQEDLAKPLSVGLDSNSSTGTPEAAKKDGTMNKPTVGSFGFEINLQENQNALKFLASLLELPEFLLFLQHAISVPEDVARDLGDVMETVLSSQTCEQTPERLFVPSCTTEGSYEDVQCFSGECWCVNSWGKELPGSRVRGGQPRCPTDCEKQRARMQSLMGSQPAGSTLFVPACTSEGHFLPVQCFNSECYCVDAEGQAIPGTRSAIGKPKKCPTPCQLQSEQAFLRTVQALLSNSSMLPTLSDTYIPQCSTDGQWRQVQCNGPPEQVFELYQRWEAQNKGQDLTPAKLLVKIMSYREAASGNFSLFIQSLYEAGQQDVFPVLSQYPSLQDVPLAALEGKRPQPRENILLEPYLFWQILNGQLSQYPGSYSDFSTPLAHFDLRNCWCVDEAGQELEGMRSEPSKLPTCPGSCEEAKLRVLQFIRETEEIVSASNSSRFPLGESFLVAKGIRLRNEDLGLPPLFPPREAFAEQFLRGSDYAIRLAAQSTLSFYQRRRFSPDDSAGASALLRSGPYMPQCDAFGSWEPVQCHAGTGHCWCVDEKGGFIPGSLTARSLQIPQCPTTCEKSRTSGLLSSWKQARSQENPSPKDLFVPACLETGEYARLQASGAGTWCVDPASGEELRPGSSSSAQCPSLCNVLKSGVLSRRVSPGYVPACRAEDGGFSPVQCDQAQGSCWCVMDSGEEVPGTRVTGGQPACESPRCPLPFNASEVVGGTILCETISGPTGSAMQQCQLLCRQGSWSVFPPGPLICSLESGRWESQLPQPRACQRPQLWQTIQTQGHFQLQLPPGKMCSADYADLLQTFQVFILDELTARGFCQIQVKTFGTLVSIPVCNNSSVQVGCLTRERLGVNVTWKSRLEDIPVASLPDLHDIERALVGKDLLGRFTDLIQSGSFQLHLDSKTFPAETIRFLQGDHFGTSPRTWFGCSEGFYQVLTSEASQDGLGCVKCPEGSYSQDEECIPCPVGFYQEQAGSLACVPCPVGRTTISAGAFSQTHCVTDCQRNEAGLQCDQNGQYRASQKDRGSGKAFCVDGEGRRLPWWETEAPLEDSQCLMMQKFEKVPESKVIFDANAPVAVRSKVPDSEFPVMQCLTDCTEDEACSFFTVSTTEPEISCDFYAWTSDNVACMTSDQKRDALGNSKATSFGSLRCQVKVRSHGQDSPAVYLKKGQGSTTTLQKRFEPTGFQNMLSGLYNPIVFSASGANLTDAHLFCLLACDRDLCCDGFVLTQVQGGAIICGLLSSPSVLLCNVKDWMDPSEAWANATCPGVTYDQESHQVILRLGDQEFIKSLTPLEGTQDTFTNFQQVYLWKDSDMGSRPESMGCRKDTVPRPASPTEAGLTTELFSPVDLNQVIVNGNQSLSSQKHWLFKHLFSAQQANLWCLSRCVQEHSFCQLAEITESASLYFTCTLYPEAQVCDDIMESNAQGCRLILPQMPKALFRKKVILEDKVKNFYTRLPFQKLMGISIRNKVPMSEKSISNGFFECERRCDADPCCTGFGFLNVSQLKGGEVTCLTLNSLGIQMCSEENGGAWRILDCGSPDIEVHTYPFGWYQKPIAQNNAPSFCPLVVLPSLTEKVSLDSWQSLALSSVVVDPSIRHFDVAHVSTAATSNFSAVRDLCLSECSQHEACLITTLQTQPGAVRCMFYADTQSCTHSLQGQNCRLLLREEATHIYRKPGISLLSYEASVPSVPISTHGRLLGRSQAIQVGTSWKQVDQFLGVPYAAPPLAERRFQAPEPLNWTGSWDASKPRASCWQPGTRTSTSPGVSEDCLYLNVFIPQNVAPNASVLVFFHNTMDREESEGWPAIDGSFLAAVGNLIVVTASYRVGVFGFLSSGSGEVSGNWGLLDQVAALTWVQTHIRGFGGDPRRVSLAADRGGADVASIHLLTARATNSQLFRRAVLMGGSALSPAAVISHERAQQQAIALAKEVSCPMSSSQEVVSCLRQKPANVLNDAQTKLLAVSGPFHYWGPVIDGHFLREPPARALKRSLWVEVDLLIGSSQDDGLINRAKAVKQFEESRGRTSSKTAFYQALQNSLGGEDSDARVEAAATWYYSLEHSTDDYASFSRALENATRDYFIICPIIDMASAWAKRARGNVFMYHAPENYGHGSLELLADVQFALGLPFYPAYEGQFSLEEKSLSLKIMQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDFVPRAGGENYKEFSELLPNRQGLKKADCSFWSKYISSLKTSADGAKGGQSAESEEEELTAGSGLREDLLSLQEPGSKTYSK | |
Overview of Protein Modification Sites with Functional and Structural Information | ||
* ASA = Accessible Surface Area
Locations | Modification | Substrate Peptides & Secondary Structure |
ASA (%) | Reference | Orthologous Protein Cluster |
---|---|---|---|---|---|
24 | Iodination | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | 2760035 | |
24 | Other | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | 2760035 | |
24 | Sulfation | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | - | |
24 | Sulfation | VSANIFEYQVDAQPL HHCCCEEEECCCCCC | 11.77 | - | |
76 | N-linked_Glycosylation | SCWCVGANGSEVLGS EEEEEECCCCEEECC | 48.90 | 32025030 | |
76 | N-linked_Glycosylation | SCWCVGANGSEVLGS EEEEEECCCCEEECC | 48.90 | 32025030 | |
108 | Iodination | QQILLSGYINSTDTS CHHHHHCCCCCCCCC | 8.11 | 32025030 | |
110 | N-linked_Glycosylation | ILLSGYINSTDTSYL HHHHCCCCCCCCCCC | 30.30 | 8530385 | |
110 | N-linked_Glycosylation | ILLSGYINSTDTSYL HHHHCCCCCCCCCCC | 30.30 | 32025030 | |
149 | Iodination | DAEGMEVYGTRQLGR CCCCCEEEEECCCCC | 10.24 | 2760035 | |
198 | N-linked_Glycosylation | PVQCKFVNTTDMMIF CEECEECCCCCCHHH | 40.15 | 32025030 | |
198 | N-linked_Glycosylation | PVQCKFVNTTDMMIF CEECEECCCCCCHHH | 40.15 | 32025030 | |
199 | Phosphorylation | VQCKFVNTTDMMIFD EECEECCCCCCHHHH | 20.90 | 22210691 | |
200 | Phosphorylation | QCKFVNTTDMMIFDL ECEECCCCCCHHHHH | 19.38 | 22210691 | |
234 | Iodination | RFPEVSGYCHCADSQ HCCCCCCEEEECCHH | 3.29 | 32025030 | |
234 | Phosphorylation | RFPEVSGYCHCADSQ HCCCCCCEEEECCHH | 3.29 | - | |
240 | Phosphorylation | GYCHCADSQGRELAE CEEEECCHHCHHHHH | 18.93 | - | |
258 | Iodination | ELLLDEIYDTIFAGL HHHHHHHHHHHHCCC | 12.65 | 2760035 | |
376 | Phosphorylation | QQALSRLYFGTSGYF HHHHHHHHHCCCCCC | 10.00 | - | |
484 | N-linked_Glycosylation | LVNVGQFNLSGALGT EEEEEEEECCCCCCC | 25.96 | 32025030 | |
484 | N-linked_Glycosylation | LVNVGQFNLSGALGT EEEEEEEECCCCCCC | 25.96 | 32025030 | |
529 | N-linked_Glycosylation | LSVGLDSNSSTGTPE CCCCCCCCCCCCCHH | 39.38 | 8615697 | |
529 | N-linked_Glycosylation | LSVGLDSNSSTGTPE CCCCCCCCCCCCCHH | 39.38 | 8615697 | |
704 | Iodination | QCFNSECYCVDAEGQ EEECCEEEEECCCCC | 7.46 | 2760035 | |
704 | Other | QCFNSECYCVDAEGQ EEECCEEEEECCCCC | 7.46 | 2760035 | |
748 | N-linked_Glycosylation | TVQALLSNSSMLPTL HHHHHHHCCCCCCCC | 37.77 | 8615697 | |
748 | N-linked_Glycosylation | TVQALLSNSSMLPTL HHHHHHHCCCCCCCC | 37.77 | 8615697 | |
785 | Iodination | PEQVFELYQRWEAQN HHHHHHHHHHHHHHH | 6.71 | 2760035 | |
816 | N-linked_Glycosylation | YREAASGNFSLFIQS HHHHHCCCHHHHHHH | 22.34 | 9287346 | |
816 | N-linked_Glycosylation | YREAASGNFSLFIQS HHHHHCCCHHHHHHH | 22.34 | 9287346 | |
866 | Iodination | ENILLEPYLFWQILN CCCCCCHHHHHHHHC | 12.73 | 2760035 | |
883 | Iodination | LSQYPGSYSDFSTPL HHCCCCCCCCCCCCC | 20.41 | 2760035 | |
939 | Phosphorylation | VLQFIRETEEIVSAS HHHHHHHHHHHHHCC | 29.12 | - | |
947 | N-linked_Glycosylation | EEIVSASNSSRFPLG HHHHHCCCCCCCCCC | 43.66 | 9287346 | |
947 | N-linked_Glycosylation | EEIVSASNSSRFPLG HHHHHCCCCCCCCCC | 43.66 | 9287346 | |
992 | Iodination | QFLRGSDYAIRLAAQ HHHCCCHHHHHHHHH | 12.96 | 2760035 | |
1005 | Phosphorylation | AQSTLSFYQRRRFSP HHHHHHHHHHCCCCC | 9.57 | - | |
1011 | Phosphorylation | FYQRRRFSPDDSAGA HHHHCCCCCCCCCHH | 26.13 | 22210691 | |
1019 | Phosphorylation | PDDSAGASALLRSGP CCCCCHHHHHHHCCC | 20.66 | 24719451 | |
1082 | Phosphorylation | TTCEKSRTSGLLSSW CCCCHHCCCCCHHHH | 33.95 | 26437602 | |
1083 | Phosphorylation | TCEKSRTSGLLSSWK CCCHHCCCCCHHHHH | 26.74 | 26437602 | |
1087 | Phosphorylation | SRTSGLLSSWKQARS HCCCCCHHHHHHHHC | 38.91 | 26437602 | |
1088 | Phosphorylation | RTSGLLSSWKQARSQ CCCCCHHHHHHHHCC | 37.96 | 26437602 | |
1154 | Phosphorylation | SLCNVLKSGVLSRRV HHHHHHHHCCCCCCC | 29.94 | 30576142 | |
1158 | Phosphorylation | VLKSGVLSRRVSPGY HHHHCCCCCCCCCCC | 18.68 | 24719451 | |
1165 | Phosphorylation | SRRVSPGYVPACRAE CCCCCCCCCCCEECC | 13.47 | - | |
1220 | N-linked_Glycosylation | PRCPLPFNASEVVGG CCCCCCCCHHHEECC | 40.39 | 32025030 | |
1220 | N-linked_Glycosylation | PRCPLPFNASEVVGG CCCCCCCCHHHEECC | 40.39 | 32025030 | |
1310 | Thyroxine | GKMCSADYADLLQTF CCCCCCCHHHHHHHH | 11.17 | - | |
1310 | Iodination | GKMCSADYADLLQTF CCCCCCCHHHHHHHH | 11.17 | 2760035 | |
1310 | Other | GKMCSADYADLLQTF CCCCCCCHHHHHHHH | 11.17 | 2760035 | |
1348 | N-linked_Glycosylation | LVSIPVCNNSSVQVG EEEEEECCCCCEEEE | 52.21 | 8615697 | |
1348 | N-linked_Glycosylation | LVSIPVCNNSSVQVG EEEEEECCCCCEEEE | 52.21 | 8615697 | |
1349 | N-linked_Glycosylation | VSIPVCNNSSVQVGC EEEEECCCCCEEEEE | 30.45 | 32025030 | |
1365 | N-linked_Glycosylation | TRERLGVNVTWKSRL EHHHCCCCEEECHHC | 24.57 | 32025030 | |
1365 | N-linked_Glycosylation | TRERLGVNVTWKSRL EHHHCCCCEEECHHC | 24.57 | 32025030 | |
1421 | Phosphorylation | SKTFPAETIRFLQGD CCCCCHHHHHHHCCC | 21.40 | - | |
1449 | Phosphorylation | EGFYQVLTSEASQDG CCCHHHHCCCCCCCC | 26.18 | 27251275 | |
1450 | Phosphorylation | GFYQVLTSEASQDGL CCHHHHCCCCCCCCC | 27.31 | 27251275 | |
1453 | Phosphorylation | QVLTSEASQDGLGCV HHHCCCCCCCCCCEE | 25.42 | 27251275 | |
1467 | Iodination | VKCPEGSYSQDEECI EECCCCCCCCCCCCE | 22.62 | 2760035 | |
1498 (in isoform 2) | Phosphorylation | - | 23.66 | 17929957 | |
1508 (in isoform 2) | Phosphorylation | - | 18.95 | 17929957 | |
1529 | Phosphorylation | QCDQNGQYRASQKDR CCCCCCCEECCCCCC | 15.00 | - | |
1577 | Phosphorylation | KFEKVPESKVIFDAN CCCCCCCCCEEEECC | 26.59 | - | |
1591 | Phosphorylation | NAPVAVRSKVPDSEF CCCEEEECCCCCCCC | 31.26 | - | |
1652 | Phosphorylation | KRDALGNSKATSFGS HHHHCCCCCCCCCCC | 23.13 | 18452278 | |
1716 | N-linked_Glycosylation | VFSASGANLTDAHLF EEECCCCCCCHHHHH | 47.18 | 32025030 | |
1716 | N-linked_Glycosylation | VFSASGANLTDAHLF EEECCCCCCCHHHHH | 47.18 | 32025030 | |
1774 | N-linked_Glycosylation | DPSEAWANATCPGVT CHHHHHHCCCCCCCC | 25.69 | 32025030 | |
1774 | N-linked_Glycosylation | DPSEAWANATCPGVT CHHHHHHCCCCCCCC | 25.69 | 32025030 | |
1839 | Phosphorylation | SMGCRKDTVPRPASP HHCCCCCCCCCCCCC | 35.13 | 24905233 | |
1869 | N-linked_Glycosylation | NQVIVNGNQSLSSQK CCEEECCCCCCHHCH | 24.06 | 32025030 | |
2013 | N-linked_Glycosylation | CTGFGFLNVSQLKGG CCCEEEEEHHHCCCC | 29.18 | 32025030 | |
2013 | N-linked_Glycosylation | CTGFGFLNVSQLKGG CCCEEEEEHHHCCCC | 29.18 | 32025030 | |
2122 | N-linked_Glycosylation | VSTAATSNFSAVRDL CCCCCCCCCHHHHHH | 30.45 | 32025030 | |
2184 | Iodination | REEATHIYRKPGISL HHHHHHHHCCCCEEE | 12.65 | 2760035 | |
2184 | Phosphorylation | REEATHIYRKPGISL HHHHHHHHCCCCEEE | 12.65 | - | |
2250 | N-linked_Glycosylation | FQAPEPLNWTGSWDA CCCCCCCCCCCCCCC | 45.00 | 32025030 | |
2250 | N-linked_Glycosylation | FQAPEPLNWTGSWDA CCCCCCCCCCCCCCC | 45.00 | 32025030 | |
2295 | N-linked_Glycosylation | IPQNVAPNASVLVFF CCCCCCCCCEEEEEE | 34.80 | 32025030 | |
2295 | N-linked_Glycosylation | IPQNVAPNASVLVFF CCCCCCCCCEEEEEE | 34.80 | 32025030 | |
2467 | Phosphorylation | NVLNDAQTKLLAVSG HHCCHHHHHHHEECC | 25.94 | - | |
2510 | Phosphorylation | EVDLLIGSSQDDGLI EEEEECCCCCCCCHH | 20.17 | 23828894 | |
2540 | Iodination | TSSKTAFYQALQNSL CCHHHHHHHHHHHHH | 7.16 | 32025030 | |
2573 | Thyroxine | LEHSTDDYASFSRAL CCCCCCCHHHHHHHH | 13.62 | - | |
2573 | Iodination | LEHSTDDYASFSRAL CCCCCCCHHHHHHHH | 13.62 | 2760035 | |
2573 | Other | LEHSTDDYASFSRAL CCCCCCCHHHHHHHH | 13.62 | 2760035 | |
2575 | Phosphorylation | HSTDDYASFSRALEN CCCCCHHHHHHHHHH | 19.95 | 24719451 | |
2582 | N-linked_Glycosylation | SFSRALENATRDYFI HHHHHHHHHCCCEEE | 47.68 | 32025030 | |
2582 | N-linked_Glycosylation | SFSRALENATRDYFI HHHHHHHHHCCCEEE | 47.68 | 32025030 | |
2587 | Thyroxine | LENATRDYFIICPII HHHHCCCEEEEEHHH | 8.06 | - | |
2587 | Iodination | LENATRDYFIICPII HHHHCCCEEEEEHHH | 8.06 | 2760035 | |
2617 | Iodination | MYHAPENYGHGSLEL EEECCCCCCCCCHHH | 14.61 | 2760035 | |
2650 | Phosphorylation | QFSLEEKSLSLKIMQ CCCCCHHHHHHHHHH | 26.86 | 24719451 | |
2652 | Phosphorylation | SLEEKSLSLKIMQYF CCCHHHHHHHHHHHH | 34.41 | 24719451 | |
2658 | Phosphorylation | LSLKIMQYFSHFIRS HHHHHHHHHHHHHHC | 6.94 | 25884760 | |
2670 | Phosphorylation | IRSGNPNYPYEFSRK HHCCCCCCCCCCCCC | 14.40 | 25884760 | |
2697 | Iodination | PRAGGENYKEFSELL CCCCCCCHHHHHHHC | 13.95 | 2760035 | |
2721 | Phosphorylation | DCSFWSKYISSLKTS CHHHHHHHHHHHCCC | 10.66 | 22817900 | |
2723 | Phosphorylation | SFWSKYISSLKTSAD HHHHHHHHHHCCCCC | 26.91 | 24719451 | |
2724 | Phosphorylation | FWSKYISSLKTSADG HHHHHHHHHCCCCCC | 25.07 | 29083192 | |
2728 | Phosphorylation | YISSLKTSADGAKGG HHHHHCCCCCCCCCC | 23.83 | 19664994 | |
2737 | Phosphorylation | DGAKGGQSAESEEEE CCCCCCCCCCCHHHH | 36.96 | 27251275 | |
2740 | Phosphorylation | KGGQSAESEEEELTA CCCCCCCCHHHHHHC | 50.45 | 27251275 | |
2749 | O-linked_Glycosylation | EEELTAGSGLREDLL HHHHHCCCCHHHHHH | 31.99 | 16679516 | |
2766 | Triiodothyronine | QEPGSKTYSK----- CCCCCCCCCC----- | 20.42 | - | |
2766 | Iodination | QEPGSKTYSK----- CCCCCCCCCC----- | 20.42 | 2760035 | |
2766 | Other | QEPGSKTYSK----- CCCCCCCCCC----- | 20.42 | 2760035 |
Modified Location | Modified Residue | Modification | Type of Upstream Proteins | Gene Name of Upstream Proteins | UniProt AC of Upstream Proteins | Sources |
---|---|---|---|---|---|---|
Oops, there are no upstream regulatory protein records of THYG_HUMAN !! |
Modified Location | Modified Residue | Modification | Function | Reference | ||
---|---|---|---|---|---|---|
Oops, there are no descriptions of PTM sites of THYG_HUMAN !! |
* Distance = the distance between SAP position and PTM sites.
Modified Location | Modification | Variant Position (Distance <= 10) |
Residue Change | SAP | Related Disease | Reference |
---|---|---|---|---|---|---|
Oops, there are no SNP-PTM records of THYG_HUMAN !! |
Interacting Protein | Gene Name | Interaction Type | PPI Reference | Domain-Domain Interactions |
---|---|---|---|---|
THYG_HUMAN | TG | physical | 11294872 |
loading...
N-linked Glycosylation | |
Reference | PubMed |
"Glycosylation in human thyroglobulin: location of the N-linkedoligosaccharide units and comparison with bovine thyroglobulin."; Yang S.X., Pollock H.G., Rawitch A.B.; Arch. Biochem. Biophys. 327:61-70(1996). Cited for: PARTIAL PROTEIN SEQUENCE, GLYCOSYLATION AT ASN-76; ASN-198; ASN-484;ASN-529; ASN-748; ASN-816; ASN-947; ASN-1220; ASN-1348; ASN-1349;ASN-1365; ASN-1716; ASN-1774; ASN-2013; ASN-2250; ASN-2295 ANDASN-2582, AND ABSENCE OF GLYCOSYLATION AT ASN-110; ASN-496; ASN-1869AND ASN-2122. | |
O-linked Glycosylation | |
Reference | PubMed |
"A single chondroitin 6-sulfate oligosaccharide unit at Ser-2730 ofhuman thyroglobulin enhances hormone formation and limits proteolyticaccessibility at the carboxyl terminus. Potential insights intothyroid homeostasis and autoimmunity."; Conte M., Arcaro A., D'Angelo D., Gnata A., Mamone G., Ferranti P.,Formisano S., Gentile F.; J. Biol. Chem. 281:22200-22211(2006). Cited for: GLYCOSYLATION AT SER-2749. |