Tag |
Content |
WERAM ID |
WERAM-Hos-0181 |
Ensembl Protein ID |
ENSP00000395929.2 |
Uniprot Accession |
Q96L73; NSD1_HUMAN; Q96PD8; Q96RN7 |
Genbank Protein ID |
NP_071900.2; NP_758859.1; XP_005266016.1; XP_005266017.1; XP_005266018.1; XP_011532912.1; XP_011532913.1 |
Protein Name |
Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specific |
Genbank Nucleotide ID |
NM_022455.4; NM_172349.2; XM_005265959.1; XM_005265960.1; XM_005265961.1; XM_011534610.1; XM_011534611.1 |
Gene Name |
NSD1;STO;KMT3B;SOTOS;ARA267;SOTOS1 |
Ensembl Information |
|
Details |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SET2 |
7.90e-52 |
176.7 |
1943 |
2059 |
Me_Reader |
PWWP |
3.00e-32 |
112.8 |
323 |
1817 |
HMT |
SET1 |
1.00e-28 |
102.1 |
1943 |
2059 |
Me_Reader |
PHD |
7.50e-19 |
69.9 |
1544 |
2163 |
|
Organism |
Homo sapiens |
NCBI Taxa ID |
9606 |
Functional Description (View)Functional Description
Histone methyltransferase. Preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4 (in vitro). Transcriptional intermediary factor capable of both negatively or positively influencing transcription, depending on the cellular context. |
Histone methyltransferase. Preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4 (in vitro). Transcriptional intermediary factor capable of both negatively or positively influencing transcription, depending on the cellular context.
|
Domain Profile |
HMT SET2
SET2.txt 2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncetqkw 91 +ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncetqkw ENSP00000395929.2 1943 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKW 2032 699*************************************************************************************** PP SET2.txt 92 tvegelrvglfakkkikkgeeltfdYn 118 +v+g++rvglfa ++ik+g+eltf+Yn ENSP00000395929.2 2033 SVNGDTRVGLFALSDIKAGTELTFNYN 2059 **************************8 PP
Me_Reader PWWP
PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 +gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++ ENSP00000395929.2 323 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387 69********************98766665466888999******************99998775 PP PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 ++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e ENSP00000395929.2 1757 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1817 589*****************************************.***************87 PP
HMT SET1
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNceak 90 e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNce++ ENSP00000395929.2 1943 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNCETQ 2030 6777788889************************99999888887777778789******99..************************** PP SET1.txt 91 vvavdgekkiviyakraIekgeeltydYk 119 +v+g+++++++a +I++g+elt++Y+ ENSP00000395929.2 2031 KWSVNGDTRVGLFALSDIKAGTELTFNYN 2059 ****************************7 PP
Me_Reader PHD
PHD.txt 2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52 ++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++ ENSP00000395929.2 1544 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1588 678888..3333..289***99*************..*****.*******96 PP PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50 +C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C + ENSP00000395929.2 1591 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1638 58****777777...55899889**********988777776657999876 PP PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50 iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++ ENSP00000395929.2 1639 ICITCHAANPANVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1692 8****86666644456677************965599988.55555558999998 PP PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52 C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+ ENSP00000395929.2 1709 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1750 7****..44443..9******************...****.*******85 PP PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51 C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++ ENSP00000395929.2 2121 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2163 9999..33442..9********************..*****.*****886 PP
|
Protein Sequence (Fasta) | MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA 60 YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG 120 PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI 180 EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ 240 RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300 SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP 360 YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420 WEASVGLAEQ YDVPKGSKNR KCIPGSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA 480 FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHIQFEA HKDERRGKIP ENLGLNFISG 540 DISDTQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNGDSL LGLPEGALIS 600 KCSREKNKPQ RSLVCGSKVK LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIEHSSESDN 660 SVLEIPDAFD RTENMLSMQK NEKIKYSRFA ATNTRVKAKQ KPLISNSHTD HLMGCTKSAE 720 PGTETSQVNL SDLKASTLVH KPQSDFTNDA LSPKFNLSSS ISSENSLIKG GAANQALLHS 780 KSKQPKFRSI KCKHKENPVM AEPPVINEEC SLKCCSSDTK GSPLASISKS GKVDGLKLLN 840 NMHEKTRDSS DIETAVVKHV LSELKELSYR SLGEDVSDSG TSKPSKPLLF SSASSQNHIP 900 IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLVSYRSPG RGDCSTNSPV GVSKVLVSGG 960 STHNSEKKGD GTQNSANPSP SGGDSALSGE LSASLPGLLS DKRDLPASGK SRSDCVTRRN 1020 CGRSKPSSKL RDAFSAQMVK NTVNRKALKT ERKRKLNQLP SVTLDAVLQG DRERGGSLRG 1080 GAEDPSKEDP LQIMGHLTSE DGDHFSDVHF DSKVKQSDPG KISEKGLSFE NGKGPELDSV 1140 MNSENDELNG VNQVVPKKRW QRLNQRRTKP RKRMNRFKEK ENSECAFRVL LPSDPVQEGR 1200 DEFPEHRTPS ASILEEPLTE QNHADCLDSA GPRLNVCDKS SASIGDMEKE PGIPSLTPQA 1260 ELPEPAVRSE KKRLRKPSKW LLEYTEEYDQ IFAPKKKQKK VQEQVHKVSS RCEEESLLAR 1320 GRSSAQNKQV DENSLISTKE EPPVLEREAP FLEGPLAQSE LGGGHAELPQ LTLSVPVAPE 1380 VSPRPALESE ELLVKTPGNY ESKRQRKPTK KLLESNDLDP GFMPKKGDLG LSKKCYEAGH 1440 LENGITESCA TSYSKDFGGG TTKIFDKPRK RKRQRHAAAK MQCKKVKNDD SSKEIPGSEG 1500 ELMPHRTATS PKETVEEGVE HDPGMPASKK MQGERGGGAA LKENVCQNCE KLGELLLCEA 1560 QCCGAFHLEC LGLTEMPRGK FICNECRTGI HTCFVCKQSG EDVKRCLLPL CGKFYHEECV 1620 QKYPPTVMQN KGFRCSLHIC ITCHAANPAN VSASKGRLMR CVRCPVAYHA NDFCLAAGSK 1680 ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH RECLNIDIPE 1740 GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD VGEFPVLFFG 1800 SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA QKELRQLQED 1860 RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE CINRMLLYEC 1920 HPTVCPAGGR CQNQCFSKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE YVGELIDEEE 1980 CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ KWSVNGDTRV 2040 GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNQPI ATEEKSKKFK 2100 KKQQGKRRTQ GEITKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT KRPAGKWECP 2160 WHQCDICGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP NPLEPGEIRE 2220 YVPPPVPLPP GPSTHLAEQS TGMAAQAPKM SDKPPADTNQ MLSLSKKALA GTCQRPLLPE 2280 RPLERTDSRP QPLDKVRDLA GSGTKSQSLV SSQRPLDRPP AVAGPRPQLS DKPSPVTSPS 2340 SSPSVRSQPL ERPLGTADPR LDKSIGAASP RPQSLEKTSV PTGLRLPPPD RLLITSSPKP 2400 QTSDRPTDKP HASLSQRLPP PEKVLSAVVQ TLVAKEKALR PVDQNTQSKN RAALVMDLID 2460 LTPRQKERAA SPHQVTPQAD EKMPVLESSS WPASKGLGHM PRAVEKGCVS DPLQTSGKAA 2520 APSEDPWQAV KSLTQARLLS QPPAKAFLYE PTTQASGRAS AGAEQTPGPL SQSPGLVKQA 2580 KQMVGGQQLP ALAAKSGQSF RSLGKAPASL PTEEKKLVTT EQSPWALGKA SSRAGLWPIV 2640 AGQTLAQSCW SAGSTQTLAQ TCWSLGRGQD PKPEQNTLPA LNQAPSSHKC AESEQK 2696Protein Fasta Sequence
>ENSP00000395929.2|NSD1;STO;KMT3B;SOTOS;ARA267;SOTOS1|Homo sapiens MDQTCELPRRNCLLPFSNPVNLDAPEDKDSPFGNGQSNFSEPLNGCTMQLSTVSGTSQNAYGQDSPSCYIPLRRLQDLASMINVEYLNGSADGSESFQDPEKSDSRAQTPIVCTSLSPGGPTALAMKQEPSCNNSPELQVKVTKTIKNGFLHFENFTCVDDADVDSEMDPEQPVTEDESIEEIFEETQTNATCNYETKSENGVKVAMGSEQDSTPESRHGAVKSPFLPLAPQTETQKNKQRNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDPDSSTSTLGNMLELPGTSSSSTSQELPFCQPKKKSTPLKYEVGDLIWAKFKRRPWWPCRICSDPLINTHSKMKVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFEGRHQFEELPVLRRRGKQKEKGYRHKVPQKILSKWEASVGLAEQYDVPKGSKNRKCIPGSIKLDSEEDMPFEDCTNDPESEHDLLLNGCLKSLAFDSEHSADEKEKPCAKSRARKSSDNPKRTSVKKGHIQFEAHKDERRGKIPENLGLNFISGDISDTQASNELSRIANSLTGSNTAPGSFLFSSCGKNTAKKEFETSNGDSLLGLPEGALISKCSREKNKPQRSLVCGSKVKLCYIGAGDEEKRSDSISICTTSDDGSSDLDPIEHSSESDNSVLEIPDAFDRTENMLSMQKNEKIKYSRFAATNTRVKAKQKPLISNSHTDHLMGCTKSAEPGTETSQVNLSDLKASTLVHKPQSDFTNDALSPKFNLSSSISSENSLIKGGAANQALLHSKSKQPKFRSIKCKHKENPVMAEPPVINEECSLKCCSSDTKGSPLASISKSGKVDGLKLLNNMHEKTRDSSDIETAVVKHVLSELKELSYRSLGEDVSDSGTSKPSKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKTKEQRLMTAQNLVSYRSPGRGDCSTNSPVGVSKVLVSGGSTHNSEKKGDGTQNSANPSPSGGDSALSGELSASLPGLLSDKRDLPASGKSRSDCVTRRNCGRSKPSSKLRDAFSAQMVKNTVNRKALKTERKRKLNQLPSVTLDAVLQGDRERGGSLRGGAEDPSKEDPLQIMGHLTSEDGDHFSDVHFDSKVKQSDPGKISEKGLSFENGKGPELDSVMNSENDELNGVNQVVPKKRWQRLNQRRTKPRKRMNRFKEKENSECAFRVLLPSDPVQEGRDEFPEHRTPSASILEEPLTEQNHADCLDSAGPRLNVCDKSSASIGDMEKEPGIPSLTPQAELPEPAVRSEKKRLRKPSKWLLEYTEEYDQIFAPKKKQKKVQEQVHKVSSRCEEESLLARGRSSAQNKQVDENSLISTKEEPPVLEREAPFLEGPLAQSELGGGHAELPQLTLSVPVAPEVSPRPALESEELLVKTPGNYESKRQRKPTKKLLESNDLDPGFMPKKGDLGLSKKCYEAGHLENGITESCATSYSKDFGGGTTKIFDKPRKRKRQRHAAAKMQCKKVKNDDSSKEIPGSEGELMPHRTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVMQNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSKILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRPIGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNQPIATEEKSKKFKKKQQGKRRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEMCPSSFCKQHREGMLFISKLDGRLSCTEHDPCGPNPLEPGEIREYVPPPVPLPPGPSTHLAEQSTGMAAQAPKMSDKPPADTNQMLSLSKKALAGTCQRPLLPERPLERTDSRPQPLDKVRDLAGSGTKSQSLVSSQRPLDRPPAVAGPRPQLSDKPSPVTSPSSSPSVRSQPLERPLGTADPRLDKSIGAASPRPQSLEKTSVPTGLRLPPPDRLLITSSPKPQTSDRPTDKPHASLSQRLPPPEKVLSAVVQTLVAKEKALRPVDQNTQSKNRAALVMDLIDLTPRQKERAASPHQVTPQADEKMPVLESSSWPASKGLGHMPRAVEKGCVSDPLQTSGKAAAPSEDPWQAVKSLTQARLLSQPPAKAFLYEPTTQASGRASAGAEQTPGPLSQSPGLVKQAKQMVGGQQLPALAAKSGQSFRSLGKAPASLPTEEKKLVTTEQSPWALGKASSRAGLWPIVAGQTLAQSCWSAGSTQTLAQTCWSLGRGQDPKPEQNTLPALNQAPSSHKCAESEQK
|
Nucleotide Sequence (Fasta) | GCTGGCCCGG GAGGGGGCGC GGGGCACGGT TGATGCCGGC CCAGGATGGA TCAGACCTGT 60 GAACTACCCA GAAGAAATTG TCTGCTGCCC TTTTCCAATC CAGTGAATTT AGATGCCCCT 120 GAAGACAAGG ACAGCCCTTT CGGTAATGGT CAATCCAATT TTTCTGAGCC ACTTAATGGG 180 TGTACTATGC AGTTATCGAC TGTCAGTGGA ACATCCCAAA ATGCTTATGG ACAAGATTCT 240 CCATCTTGTT ACATTCCACT GCGGAGACTA CAGGATTTGG CCTCCATGAT CAATGTAGAG 300 TATTTAAATG GGTCTGCTGA TGGATCAGAA TCCTTTCAAG ACCCTGAAAA AAGTGATTCA 360 AGAGCTCAGA CGCCAATTGT TTGCACTTCC TTGAGTCCTG GTGGTCCTAC AGCACTTGCT 420 ATGAAACAGG AACCCTCTTG TAATAACTCC CCTGAACTCC AGGTAAAAGT AACAAAGACT 480 ATCAAGAATG GCTTTCTGCA CTTTGAGAAT TTTACTTGTG TGGACGATGC AGATGTAGAT 540 TCTGAAATGG ACCCAGAACA GCCAGTCACA GAGGATGAGA GTATAGAGGA GATCTTTGAG 600 GAAACTCAGA CCAATGCCAC CTGCAATTAT GAGACTAAAT CAGAGAATGG TGTAAAAGTG 660 GCCATGGGAA GTGAACAAGA CAGCACACCA GAGAGTAGAC ACGGTGCAGT CAAATCGCCA 720 TTCTTGCCAT TAGCTCCTCA GACTGAAACA CAGAAAAATA AGCAAAGAAA TGAAGTGGAC 780 GGCAGCAATG AAAAAGCAGC CCTTCTCCCA GCCCCCTTTT CACTAGGAGA CACAAACATT 840 ACAATAGAAG AGCAATTAAA CTCAATAAAT TTATCTTTTC AGGATGATCC AGATTCCAGT 900 ACCAGTACAT TAGGAAACAT GCTAGAATTA CCTGGAACTT CATCATCATC TACTTCACAG 960 GAATTGCCAT TTTGTCAACC TAAGAAAAAG TCTACGCCAC TGAAGTATGA AGTTGGAGAT 1020 CTCATCTGGG CAAAATTCAA GAGACGCCCA TGGTGGCCCT GCAGGATTTG TTCTGATCCG 1080 TTGATTAACA CACATTCAAA AATGAAAGTT TCCAACCGGA GGCCCTATCG GCAGTACTAC 1140 GTGGAGGCTT TTGGAGATCC TTCTGAGAGA GCCTGGGTGG CTGGAAAAGC AATCGTCATG 1200 TTTGAAGGCA GACATCAATT CGAAGAGCTA CCTGTCCTTA GGAGAAGAGG GAAACAGAAA 1260 GAAAAAGGAT ATAGGCATAA GGTTCCTCAG AAAATTTTGA GTAAATGGGA AGCCAGTGTT 1320 GGACTTGCAG AACAGTATGA TGTTCCCAAG GGGTCAAAGA ACCGAAAATG TATTCCTGGT 1380 TCAATCAAGT TGGACAGTGA AGAAGATATG CCATTTGAAG ACTGCACAAA TGATCCTGAG 1440 TCAGAACATG ACCTGTTGCT TAATGGCTGT TTGAAATCAC TGGCTTTTGA TTCTGAACAT 1500 TCTGCAGATG AGAAGGAAAA GCCTTGCGCT AAATCTCGAG CCAGAAAGAG CTCTGATAAT 1560 CCAAAAAGGA CTAGTGTGAA AAAGGGCCAC ATACAATTTG AAGCACATAA AGATGAACGG 1620 AGGGGAAAGA TTCCAGAGAA CCTTGGCCTA AACTTTATCT CTGGGGATAT ATCTGATACG 1680 CAGGCCTCTA ATGAACTTTC CAGGATAGCA AATAGCCTCA CAGGGTCCAA CACTGCCCCA 1740 GGAAGTTTTC TGTTTTCTTC CTGTGGAAAA AACACTGCAA AGAAAGAATT TGAGACTTCA 1800 AATGGTGACT CTTTATTGGG CTTGCCTGAG GGTGCTTTGA TCTCAAAGTG TTCTCGAGAG 1860 AAGAATAAAC CCCAACGAAG CCTGGTGTGT GGTTCAAAAG TGAAGCTCTG CTATATTGGA 1920 GCAGGTGATG AGGAAAAGCG AAGTGATTCC ATTAGTATCT GTACCACTTC TGATGATGGA 1980 AGCAGTGACC TGGATCCCAT AGAACACAGC TCAGAGTCTG ATAACAGTGT CCTTGAAATT 2040 CCAGATGCTT TCGATAGAAC AGAGAACATG TTATCTATGC AGAAAAATGA AAAGATAAAG 2100 TATTCTAGGT TTGCTGCCAC AAACACTAGG GTAAAAGCAA AACAGAAGCC TCTCATTAGT 2160 AACTCACATA CAGACCACTT AATGGGTTGT ACTAAGAGTG CAGAGCCTGG AACCGAGACG 2220 TCTCAGGTTA ATCTCTCTGA TCTGAAGGCA TCTACTCTTG TTCACAAACC CCAGTCAGAT 2280 TTTACAAATG ATGCTCTCTC TCCAAAATTC AACCTGTCAT CAAGCATATC CAGTGAGAAC 2340 TCGTTAATAA AGGGTGGGGC AGCAAATCAA GCTCTATTAC ATTCGAAAAG CAAACAGCCC 2400 AAGTTCCGAA GTATAAAGTG CAAACACAAA GAAAATCCAG TTATGGCAGA ACCCCCAGTT 2460 ATAAATGAGG AGTGCAGTTT GAAATGCTGC TCTTCTGATA CCAAAGGCTC TCCTTTGGCC 2520 AGCATTTCTA AAAGTGGGAA AGTGGATGGT CTAAAACTAC TGAACAATAT GCATGAGAAA 2580 ACCAGGGATT CAAGTGACAT AGAAACAGCA GTGGTGAAAC ATGTTTTATC CGAGTTGAAG 2640 GAACTCTCTT ACAGATCCTT AGGTGAGGAT GTCAGTGACT CTGGAACATC AAAGCCATCA 2700 AAACCATTAC TTTTCTCTTC TGCTTCTAGT CAGAATCACA TACCTATTGA ACCAGACTAC 2760 AAATTCAGTA CATTGCTAAT GATGTTGAAA GATATGCATG ATAGTAAGAC GAAGGAGCAG 2820 CGGTTGATGA CTGCTCAAAA CCTGGTCTCT TACCGGAGTC CTGGTCGTGG GGACTGTTCT 2880 ACTAATAGTC CTGTAGGAGT CTCTAAGGTT TTGGTTTCAG GAGGCTCCAC ACACAATTCA 2940 GAGAAAAAGG GAGATGGCAC TCAGAACTCC GCCAATCCTA GCCCTAGTGG GGGTGACTCT 3000 GCATTATCTG GCGAGTTGTC TGCTTCCCTA CCTGGCTTAC TGTCCGACAA GAGAGACCTC 3060 CCTGCTTCTG GTAAAAGTCG TTCAGACTGT GTTACTAGGC GCAACTGTGG ACGATCAAAG 3120 CCTTCATCCA AATTGCGAGA TGCTTTTTCA GCCCAAATGG TAAAGAACAC AGTGAACCGT 3180 AAAGCCTTAA AGACCGAGCG CAAAAGAAAA CTGAATCAGC TTCCAAGTGT GACTCTTGAT 3240 GCTGTACTGC AGGGAGACCG AGAACGTGGA GGTTCATTGA GAGGTGGGGC AGAAGATCCT 3300 AGTAAAGAGG ATCCCCTTCA GATAATGGGC CACTTAACAA GTGAAGATGG TGACCATTTT 3360 TCTGATGTGC ATTTCGATAG CAAGGTTAAG CAATCTGATC CTGGTAAAAT TTCTGAAAAA 3420 GGACTCTCTT TTGAAAACGG AAAAGGCCCA GAGCTGGACT CTGTAATGAA CAGTGAGAAT 3480 GATGAACTCA ATGGTGTAAA TCAAGTGGTG CCTAAAAAGC GGTGGCAGCG TTTAAACCAA 3540 AGGCGCACTA AACCTCGTAA GCGCATGAAC AGATTTAAAG AGAAAGAAAA CTCTGAGTGT 3600 GCCTTTAGGG TCTTACTTCC TAGTGACCCT GTGCAGGAGG GGCGGGATGA GTTTCCAGAG 3660 CATAGAACTC CTTCAGCAAG CATACTTGAG GAACCACTGA CAGAGCAAAA TCATGCTGAC 3720 TGCTTAGATT CAGCTGGGCC ACGGTTAAAT GTTTGTGATA AATCCAGTGC CAGCATTGGT 3780 GACATGGAAA AGGAGCCAGG AATTCCCAGT TTGACACCAC AGGCTGAGCT CCCTGAACCA 3840 GCTGTGCGGT CAGAGAAGAA ACGCCTTAGG AAGCCAAGCA AGTGGCTTTT GGAATATACA 3900 GAAGAATATG ATCAGATATT TGCTCCTAAG AAAAAACAAA AGAAGGTACA GGAGCAGGTG 3960 CACAAGGTAA GTTCCCGCTG TGAAGAGGAA AGCCTTCTAG CCCGAGGTCG ATCTAGTGCT 4020 CAGAACAAGC AGGTGGACGA GAATTCTTTG ATTTCAACCA AAGAAGAGCC TCCAGTTCTT 4080 GAAAGGGAGG CTCCGTTTTT GGAGGGCCCC TTGGCTCAGT CAGAACTTGG AGGTGGACAT 4140 GCTGAGTTGC CGCAGCTGAC CTTGTCTGTG CCTGTGGCTC CGGAAGTCTC TCCACGGCCT 4200 GCCCTTGAGT CTGAGGAATT GCTAGTTAAA ACGCCAGGAA ATTATGAAAG TAAACGTCAA 4260 AGAAAACCAA CTAAGAAACT TCTTGAATCC AATGATTTAG ACCCTGGATT TATGCCCAAG 4320 AAGGGGGACC TTGGCCTTTC TAAAAAGTGC TATGAAGCTG GTCACCTGGA GAATGGCATA 4380 ACTGAATCTT GTGCCACATC TTATTCAAAA GATTTTGGTG GAGGCACTAC CAAGATATTT 4440 GACAAGCCAA GGAAGCGAAA ACGACAGAGG CATGCTGCAG CCAAGATGCA GTGTAAAAAA 4500 GTGAAAAATG ATGACTCGTC AAAAGAGATT CCAGGCTCAG AGGGAGAACT AATGCCTCAC 4560 AGGACGGCCA CAAGCCCCAA GGAGACTGTT GAGGAAGGTG TAGAACACGA TCCCGGGATG 4620 CCTGCCTCTA AAAAAATGCA GGGTGAACGC GGTGGAGGAG CTGCACTCAA GGAGAATGTC 4680 TGTCAGAATT GTGAAAAATT GGGTGAGCTG CTGTTATGTG AGGCTCAGTG CTGTGGGGCT 4740 TTCCACCTGG AGTGCCTTGG ATTGACTGAG ATGCCAAGAG GAAAATTTAT CTGCAATGAA 4800 TGTCGCACAG GAATCCATAC CTGTTTTGTA TGTAAGCAGA GTGGGGAAGA TGTTAAAAGG 4860 TGCCTTCTAC CCTTGTGTGG AAAGTTTTAC CATGAAGAGT GTGTCCAGAA GTACCCACCC 4920 ACTGTTATGC AGAACAAGGG CTTCCGGTGC TCCCTCCACA TCTGTATAAC CTGTCATGCT 4980 GCTAATCCAG CCAATGTTTC TGCATCTAAA GGTCGGTTGA TGCGCTGTGT CCGCTGTCCT 5040 GTGGCATACC ACGCCAATGA CTTTTGCCTG GCTGCTGGGT CAAAGATCCT TGCATCTAAT 5100 AGTATCATCT GCCCTAATCA CTTTACCCCT AGGCGGGGCT GCCGAAATCA TGAGCATGTT 5160 AATGTTAGCT GGTGCTTTGT GTGCTCAGAA GGAGGCAGCC TTCTGTGCTG TGATTCTTGC 5220 CCTGCTGCTT TTCATCGTGA ATGCCTGAAC ATTGATATCC CTGAAGGAAA CTGGTATTGC 5280 AATGACTGTA AAGCAGGCAA AAAGCCACAC TACAGGGAGA TTGTCTGGGT AAAAGTTGGA 5340 CGATACAGGT GGTGGCCAGC TGAGATCTGC CATCCTCGAG CTGTTCCTTC CAACATTGAT 5400 AAGATGAGAC ATGATGTGGG AGAGTTCCCA GTCCTCTTTT TTGGATCTAA TGACTATTTG 5460 TGGACTCACC AGGCCCGAGT CTTCCCTTAC ATGGAGGGTG ACGTGAGCAG CAAGGATAAG 5520 ATGGGCAAAG GAGTGGATGG GACATATAAA AAAGCTCTTC AGGAAGCTGC AGCAAGGTTT 5580 GAGGAATTAA AGGCCCAAAA AGAGCTAAGA CAGCTGCAGG AAGACCGAAA GAATGACAAG 5640 AAGCCACCAC CTTATAAACA TATAAAGGTA AACCGTCCTA TTGGCAGGGT ACAGATCTTC 5700 ACTGCAGACT TATCTGAAAT ACCCCGTTGC AACTGTAAAG CTACTGATGA GAACCCCTGT 5760 GGGATAGACT CTGAATGCAT CAACCGCATG CTGCTCTATG AGTGCCACCC CACAGTGTGT 5820 CCTGCCGGAG GGCGCTGTCA AAACCAGTGC TTTTCCAAGC GCCAATATCC AGAGGTTGAA 5880 ATTTTCCGCA CATTACAGCG GGGTTGGGGT CTACGGACAA AAACAGATAT TAAAAAGGGT 5940 GAATTTGTGA ATGAGTATGT GGGTGAGCTT ATAGATGAAG AAGAATGCAG AGCTCGAATT 6000 CGCTATGCTC AAGAACATGA TATCACTAAT TTCTATATGC TCACCCTAGA CAAAGACCGA 6060 ATCATTGATG CTGGTCCCAA AGGAAACTAT GCTCGGTTCA TGAATCATTG CTGCCAGCCC 6120 AACTGTGAAA CACAGAAGTG GTCTGTGAAT GGAGATACCC GTGTAGGCCT TTTTGCACTA 6180 AGTGACATTA AAGCAGGCAC TGAACTTACC TTCAACTACA ACCTAGAATG TCTTGGGAAT 6240 GGAAAGACTG TTTGCAAATG TGGAGCCCCG AACTGCAGTG GCTTCTTGGG TGTAAGGCCA 6300 AAGAATCAAC CCATTGCCAC GGAAGAAAAG TCAAAGAAAT TCAAGAAGAA GCAACAGGGA 6360 AAGCGCAGGA CCCAGGGTGA AATCACAAAG GAGCGAGAAG ATGAGTGTTT TAGTTGTGGG 6420 GATGCTGGCC AGCTCGTCTC CTGCAAGAAA CCAGGCTGCC CAAAAGTTTA CCACGCAGAC 6480 TGTCTCAATC TGACCAAGCG ACCAGCAGGG AAATGGGAAT GTCCGTGGCA TCAGTGTGAC 6540 ATCTGCGGGA AGGAAGCAGC CTCCTTCTGT GAGATGTGCC CCAGCTCCTT TTGTAAGCAG 6600 CATCGAGAAG GGATGCTTTT CATTTCCAAA CTGGATGGGC GTCTGTCTTG TACTGAGCAT 6660 GACCCCTGTG GGCCCAATCC TCTGGAACCT GGGGAGATCC GTGAGTATGT GCCTCCCCCA 6720 GTACCGCTGC CTCCAGGGCC AAGCACTCAC CTGGCAGAGC AATCAACAGG AATGGCTGCT 6780 CAGGCACCCA AAATGTCAGA TAAACCTCCT GCTGACACCA ACCAGATGCT GTCGCTCTCC 6840 AAAAAAGCTC TGGCAGGGAC TTGTCAGAGG CCATTGCTAC CTGAAAGACC TCTTGAGAGA 6900 ACTGACTCCA GGCCCCAGCC TTTAGATAAG GTCAGAGACC TCGCTGGGTC AGGGACCAAA 6960 TCCCAATCCT TGGTTTCCAG CCAGAGGCCA CTGGACAGGC CACCAGCAGT GGCAGGACCA 7020 AGACCCCAGC TAAGCGACAA ACCCTCTCCA GTGACCAGCC CAAGCTCCTC ACCCTCAGTC 7080 AGGTCCCAAC CACTGGAAAG ACCTCTGGGG ACGGCTGACC CAAGGCTGGA TAAATCCATA 7140 GGTGCTGCCA GCCCAAGGCC CCAGTCACTG GAGAAAACCT CAGTTCCCAC TGGCCTGAGA 7200 CTTCCGCCGC CAGACAGACT GCTCATTACT AGCAGTCCCA AACCCCAGAC TTCAGACAGG 7260 CCTACTGACA AACCCCATGC CTCTTTGTCC CAGAGACTCC CACCTCCTGA GAAAGTACTA 7320 TCAGCTGTGG TCCAGACCCT TGTAGCTAAA GAAAAAGCAC TGAGGCCTGT GGACCAGAAT 7380 ACTCAGTCAA AAAATAGAGC TGCTTTGGTG ATGGATCTCA TAGACCTAAC TCCTCGCCAG 7440 AAGGAGCGGG CAGCTTCACC TCATCAGGTC ACACCACAGG CTGATGAGAA GATGCCAGTG 7500 TTGGAGTCAA GTTCATGGCC TGCCAGCAAA GGTCTGGGGC ATATGCCGAG AGCTGTTGAG 7560 AAAGGCTGTG TGTCAGATCC TCTTCAGACA TCTGGGAAAG CAGCAGCCCC TTCAGAGGAC 7620 CCCTGGCAAG CTGTTAAATC ACTCACCCAG GCCAGACTTC TTTCTCAGCC TCCTGCCAAG 7680 GCCTTTTTAT ATGAGCCAAC AACTCAGGCC TCAGGAAGAG CTTCTGCAGG GGCTGAGCAG 7740 ACCCCAGGGC CTCTTAGCCA ATCCCCGGGC CTGGTGAAGC AGGCGAAGCA GATGGTCGGA 7800 GGCCAGCAAC TACCTGCACT TGCCGCCAAG AGTGGGCAAT CTTTTAGGTC TCTCGGGAAG 7860 GCCCCAGCCT CCCTCCCCAC TGAAGAAAAG AAGTTGGTAA CCACAGAGCA AAGTCCCTGG 7920 GCCCTGGGAA AAGCCTCATC ACGGGCAGGG CTCTGGCCCA TAGTGGCTGG ACAGACACTG 7980 GCACAGTCTT GCTGGTCTGC TGGGAGCACA CAGACATTGG CACAGACTTG CTGGTCTCTT 8040 GGAAGAGGGC AAGACCCCAA ACCAGAGCAA AATACACTTC CAGCTCTTAA CCAGGCTCCT 8100 TCCAGTCACA AGTGTGCAGA ATCAGAACAG AAGTAGTACC AATCAATGTC ACATGAACAA 8160 ACAAGCTGCC CCCAGGGTAC CATTTGGGGA GGGGAAATCT TTTCTTTCTT TCCCCCTTAA 8220 AAAAAAACAC ATCTGCCCCG AACACTTTCC CACTGTTATT CTTTCCTCAT ATCCCAACAC 8280 TCAGAACTCT TGTGACATTA GCCAGTGGGG GCTTATGGTT GTGTGAACCA TGTATGAAAA 8340 TCCAGTGGGC CCCAACCAAG GAGACAGACA GACTTGGGTC TCTTTCCCCC AACTTTTCCA 8400 CATGGTCATC GTGAAATAAA AAGTCCACTC TGGAGTCAAG TATGGAATTC AATTCCGCTG 8460 GTCAGGTTGG AAGGTATAGG GGCTCTCAAA GCGATTTCCC CAACCAGACA GAGCCCCATT 8520 GAGGGCACCT AGGAACCCTT GGGAGGAAAT GGTGTTCTTT CAAATCAGTG GCGATTTCCT 8580 GAGCATTCAC GTGTTCTAGG CCGGGTGCTA GTCACTGATG AGAGATACAG GCCTCATCCC 8640 TGTGAGCCTG GATTCCAAGG CTTTCAGGAA CCTTTGACCA GGAAGTAACA GGAAGTTCTG 8700 AGGGGCCCTG GGGCTTTAGA CTCATTTTGA AATGTCCTTT GTGGCACCAG AAGTGGTTGT 8760 GTTGAGGAAG TGTCTCTTGG CTGCGGTGTG CATGGGTGCG TGTGCATGCG CGCACACTCA 8820 CAGAGGTCTC CTCTATAGAT GCAAGGGTGC TGCATTGAGG CCAGCAAGGC TGTTGGCTGT 8880 GGGGTCGCCG CTGCTGCTTT TGTCTGGGCT GTGCAGAGTC TCAAGATCAG TCCTTGGAGG 8940 AGCAGGTGGT CAGGGGCAGT CGGGCTCTGT GCGAATGTAG ATTTCCAGCA GTGGAAGAAG 9000 GCATTTGGCA AGCTTCTCTT TCTTTGCTTT TGTTTCTACC TATTTTTCTC TTTGTACATG 9060 AATCCACCCC ATCCCTATTT CCCTAAAACA CTCAGGTGCT TTCAGATTTC AGAGCCTCGG 9120 GCAGTGGACA TAGGGAATCT CTGGCAAGCT CTGAGCTAGA CACACCAGCT TCAGGAAGAG 9180 TACCAGATCC TGATGGGAAA TTTCTTTTCC CCATTCCTTT TCCCTCCTGA GTGGAGGGAG 9240 TCCTCTTCTT CGCCTCCCTG AGAATTGCTG TGCTCTGTAT TGAGAGCACC TGCCTGCTGA 9300 CTTAGCTCAA AGGCAAGCCA GAACCCTTCC CTGAAGACTG GCAAGAGGTG GTGTTTAGAG 9360 CAACGTCCAG GCTAAGAGAT GACTCCTATT AACTGCTGAT TATCTGTTAC TGCTGCCCTG 9420 AGCTGGGGCC CAAGGGCTGG GAAATCTGTT GGTGCTACCC TGCCCTACCA TTCACCCAGC 9480 TCACAGACTG CCAACAGGAA GTGCTGTTTG GCTAGTTTCC TCCCACTTGT CTACCCCTCC 9540 TTTGTCCTTA GACCAACATG TTTACCTCTC TGCTTTGCCA ACTTAGCCAG CAGGCCATCC 9600 CCGGCCCTAA CGTCTCCTGG CCATTATCTC TTAGTTATGG CTTTCACGCT CTCAATAGGA 9660 TTCTGTATTT GGTCCCAATT TCCTCAAGTT CTTATTGAGG TTACTCCCAT CAATTCCACG 9720 GAGGGAACAG TAGTTATTAT AGAAGCATTT GCGCTTTATC TAAAGATTAA AAATAGAATC 9780 TGCTTTTATT TCCCAAAGTC TGTCTCTGAG GTTGAGACAC TTGAACTCAG GCAGAGGGAC 9840 GAGGCTGGGC AGGGCTGTCC TGAGTTTAGG GGCCTATCCC TGCATTTCAC TGAGACCTCG 9900 GAATCTCCTC TGTGAATTCC ACCTGCCTAG TTCTCCCCTT TCATCCTCTC TCTCTTCCCA 9960 CATCATCAAA GAGGAAAAGC TCTTTGTTCA AAAGGAAGAG AAAACGTAAA GCATCTTATT 10020 TTCTTTTAAA AGAATTTTAA ACCATGAAAA AGATATTTTT AAAGAAATTC ACCGAGAACA 10080 TTAAAGTTCA TTATATTAAG TATTTATCAT GTGTGAGAAT AATAAATATA TAACTGCAGC 10140 TAGTAGGTCC CTTTCCCTAA TCTTTTAGGT CATATGAGTA GGGTTTGCTT GGTGCCAGTC 10200 CTGTGCCCTT TTCTCTCCAG TCATCTGTAG TTGTGATCAG AAAAAGGTAT CTGCACTGCA 10260 CTGTCAGAGT CTCCTTTCAC TATGTTGTGT GTTAAATTAC CGTAGCTCTT TGTTTCATGA 10320 AATAAACTGT GAATTTGGGG GGGGCGGGGG GAGGGCGTGC AGGCCATGTA AAAATTTTCC 10380 GTGGAGAAGT TTGATTCTAA AGTAGCTTCT CTAAAGTAGG CTTTGGTAGG TAATCAACTT 10440 GACAGCAGTC TAGATGTCTC ACAGGACAGG AGGGAGTGAG GGAAAGGGGC CATGATTGGC 10500 TGCTTTGTGG TTTTATTTTG GTTCTTTCCA TTCTCCGCCA TTCATTGGAG GCTTCGTTCC 10560 AGACCTGCCT GGGAAAACAG CTTCTGAGCC ATTTTGGGGA GCAGTTCTTC ATCTGAATGG 10620 ATGGACATCT GGGCTTCCTT CAAGGGCCAT TGAATGGGAA CTAGAAAACC ACTGGAAACT 10680 AGAAATTTGA GCTATTGGGC CCACCAGTAG CAGCATGTGA TACTAGATGG TTAAAATCAT 10740 GAAAGCAGTC ACTATCCAAT TAGAAGCAGA GTCACAACAA CTGTTGGGAA ATGTGACTCT 10800 TGGAGGAAGG TGGGGAGGGA GTGGCCTTGC CAGCCCTGTG GGACGTCCCC TGAAGTTTGT 10860 AATAAGACCC CTTTTCCAAA GGGATGTGAA TTGGAGTGAA AAGGAAATCT TTCATCTTAG 10920 AAAACTTCTG GTCCTTAACG CAGGGTGGTA TTTGGGTATG TGCTTGGAAA TTGAGATCTC 10980 AAGAGTGTTT GCCTTGGAGC CAGCTCCCCA GGAGGCCTTT TCCAGGGACA AGGCAAAAGT 11040 TGAAATTCTC CATGGGTAGC TAGAAAGCCA ATACATCTAG CCCTGCTAAG TCAGAAAAAG 11100 ATTATGAAAA ATGTTGAAAT TTACATTCAA AGCCTCATTT GCTTATCTTG CTGGAGCCAA 11160 CCCAGTCTAA TAGCAAAATA GCTGTCATTG ATACAGAAAC ATCCTCATTT TTAAATGTCT 11220 GCTTTACCCT GTTACTGAGT TTGAGATGAC TTAAATCACT GTGTTGACCC TCTTCTGAAC 11280 CAAATCTTTA GCATTGATGA AAATAGTTAT TTTATTCTTT ACATCCTTCA CCCCACACTA 11340 TGGTCAGGGC ATGAAACACC CTGTTGATCC CTTCCCAGGC TCGGCACTGT CTGCTCACTG 11400 GAGCCGGACT CCCAGGTTGT AATTCTAATG TTGCCTCATG AGAACAGAAT GGCAGAAAGT 11460 TTAGTCCTGA CAGATTCCCC CATAGGGAGT AATGAGGACA GCATGAAACT TGGATAGGTT 11520 TTACCCTTAG TCCCTATAAG GTGGATTTTA CTAAGGTTTT TTAAATGATA CTGTCATCCT 11580 CTTGGGGTTT ATCAGCCAGG TTAGAGGAGC CCAGTGTCCT AACCTCTCTC AGATCATGGC 11640 AGAGAAGGAG CTGCCTCCAG CCCCTTTCTT GCTGAGTTTC ATTTGAGCAG TTCCATGTGT 11700 AGACATTCCA AGTCACTGCT TGGTAGTTGC TGTGGGAGCC TGTCATTGGC TATGGCCAGT 11760 TAGTTCTCAG CTGAGCTTCC TAGGGCCAGT GCAACAGGGC CAGAGGCTGC TATAGTGTAA 11820 ATTGAAATAA GAATAGATCA TTGTTTTGTA CACACACACA ATAAAATGTA ATGATGGTGC 11880 TAATTTCACG GTATAAATAA GCACTGCCAA GGGTTGAGGG ACTGGCAGCT CAAGAAACCC 11940 GGGTTCCTGT TTGGGAGGAG ATTTTATGTA GAAAAGTTTG AGGCTTTGTT AAAAGTGGGG 12000 AGAAGGAAGA TCCTCAGTGA AGCCTGCACC CAACCCTGGA GTGGCCCAGT GCAATCCAGA 12060 GGTGGAAGAG ATCCTATATC CAGGTGAAGG TGGCCATTGA GTTTCTCAGG GCTGGGGCCA 12120 CCTTGTCCAT AGCCTCCGTC CACGCTGCCT GGAGCAGGTT GTTAGAGAGC TCTGGTTGTT 12180 GGGTCTTCCT CAGCTCCCTT CTGCCCCTCT CTACCTCTTC CACTCATGGA AGCCCCTCTA 12240 CTGCTTATGA AGATTAAGGG TAGTATTTTC TAAGGAAGTG GAAAGAATTA AACTAGAAAT 12300 CCACAACCTC GGAAGAAGTG TTTCGAGTTT AACATGCGCT GTTTCTGCTT ATGTGGTTCC 12360 TTCTCTAGAG CTGCTTTCCC ATGGCTTTCA AAACATCAGG TTATTGTGGG GCTTCAGGTG 12420 TAAGGTCCTG GAAGTTCAGC AAAGTTTCGT GGACAAGACA TGGGCACAGA GAGTAGAAGC 12480 AGAAATAAAT GGTTCTATGT TTTCAACTTC CAGGGTTGGG GCAGGCCAGA GCAAGGCGGT 12540 CTCATCGAGG TGGGTGCTAC CTGTGTGTGT GTAGATGAGT GTGCTGAAGG TGGGGAGGGC 12600 AGCACACAGC AGCTCATGGC AGAGCCGCCT CCTAGGTCTT GGCAAAGAGG CAAGCTGACG 12660 ATAGACATCT ACCTATATTG TTAAGAAAGG GGTCGGGGGG ATCAGCCAAG GTCCATCATT 12720 GCTTTTTTGC CGCGCCCCCC CCCCCCCGCC CCCATAGATT GTCAGCTGTA AGTGAAACTC 12780 CTAGTGAAAA AGAGGGGAGC CCTGTGTTAG GAGTCCCCAT AAACATGTAC TGTAATTCTT 12840 TGTATATAGA AAAAAAATTT ACTGTAAAGT AAAGTTTAAC TTTACTCATA TA
12893Nucleotide Fasta Sequence
>ENSP00000395929.2|PHD|Homo sapiens GCTGGCCCGGGAGGGGGCGCGGGGCACGGTTGATGCCGGCCCAGGATGGATCAGACCTGTGAACTACCCAGAAGAAATTGTCTGCTGCCCTTTTCCAATCCAGTGAATTTAGATGCCCCTGAAGACAAGGACAGCCCTTTCGGTAATGGTCAATCCAATTTTTCTGAGCCACTTAATGGGTGTACTATGCAGTTATCGACTGTCAGTGGAACATCCCAAAATGCTTATGGACAAGATTCTCCATCTTGTTACATTCCACTGCGGAGACTACAGGATTTGGCCTCCATGATCAATGTAGAGTATTTAAATGGGTCTGCTGATGGATCAGAATCCTTTCAAGACCCTGAAAAAAGTGATTCAAGAGCTCAGACGCCAATTGTTTGCACTTCCTTGAGTCCTGGTGGTCCTACAGCACTTGCTATGAAACAGGAACCCTCTTGTAATAACTCCCCTGAACTCCAGGTAAAAGTAACAAAGACTATCAAGAATGGCTTTCTGCACTTTGAGAATTTTACTTGTGTGGACGATGCAGATGTAGATTCTGAAATGGACCCAGAACAGCCAGTCACAGAGGATGAGAGTATAGAGGAGATCTTTGAGGAAACTCAGACCAATGCCACCTGCAATTATGAGACTAAATCAGAGAATGGTGTAAAAGTGGCCATGGGAAGTGAACAAGACAGCACACCAGAGAGTAGACACGGTGCAGTCAAATCGCCATTCTTGCCATTAGCTCCTCAGACTGAAACACAGAAAAATAAGCAAAGAAATGAAGTGGACGGCAGCAATGAAAAAGCAGCCCTTCTCCCAGCCCCCTTTTCACTAGGAGACACAAACATTACAATAGAAGAGCAATTAAACTCAATAAATTTATCTTTTCAGGATGATCCAGATTCCAGTACCAGTACATTAGGAAACATGCTAGAATTACCTGGAACTTCATCATCATCTACTTCACAGGAATTGCCATTTTGTCAACCTAAGAAAAAGTCTACGCCACTGAAGTATGAAGTTGGAGATCTCATCTGGGCAAAATTCAAGAGACGCCCATGGTGGCCCTGCAGGATTTGTTCTGATCCGTTGATTAACACACATTCAAAAATGAAAGTTTCCAACCGGAGGCCCTATCGGCAGTACTACGTGGAGGCTTTTGGAGATCCTTCTGAGAGAGCCTGGGTGGCTGGAAAAGCAATCGTCATGTTTGAAGGCAGACATCAATTCGAAGAGCTACCTGTCCTTAGGAGAAGAGGGAAACAGAAAGAAAAAGGATATAGGCATAAGGTTCCTCAGAAAATTTTGAGTAAATGGGAAGCCAGTGTTGGACTTGCAGAACAGTATGATGTTCCCAAGGGGTCAAAGAACCGAAAATGTATTCCTGGTTCAATCAAGTTGGACAGTGAAGAAGATATGCCATTTGAAGACTGCACAAATGATCCTGAGTCAGAACATGACCTGTTGCTTAATGGCTGTTTGAAATCACTGGCTTTTGATTCTGAACATTCTGCAGATGAGAAGGAAAAGCCTTGCGCTAAATCTCGAGCCAGAAAGAGCTCTGATAATCCAAAAAGGACTAGTGTGAAAAAGGGCCACATACAATTTGAAGCACATAAAGATGAACGGAGGGGAAAGATTCCAGAGAACCTTGGCCTAAACTTTATCTCTGGGGATATATCTGATACGCAGGCCTCTAATGAACTTTCCAGGATAGCAAATAGCCTCACAGGGTCCAACACTGCCCCAGGAAGTTTTCTGTTTTCTTCCTGTGGAAAAAACACTGCAAAGAAAGAATTTGAGACTTCAAATGGTGACTCTTTATTGGGCTTGCCTGAGGGTGCTTTGATCTCAAAGTGTTCTCGAGAGAAGAATAAACCCCAACGAAGCCTGGTGTGTGGTTCAAAAGTGAAGCTCTGCTATATTGGAGCAGGTGATGAGGAAAAGCGAAGTGATTCCATTAGTATCTGTACCACTTCTGATGATGGAAGCAGTGACCTGGATCCCATAGAACACAGCTCAGAGTCTGATAACAGTGTCCTTGAAATTCCAGATGCTTTCGATAGAACAGAGAACATGTTATCTATGCAGAAAAATGAAAAGATAAAGTATTCTAGGTTTGCTGCCACAAACACTAGGGTAAAAGCAAAACAGAAGCCTCTCATTAGTAACTCACATACAGACCACTTAATGGGTTGTACTAAGAGTGCAGAGCCTGGAACCGAGACGTCTCAGGTTAATCTCTCTGATCTGAAGGCATCTACTCTTGTTCACAAACCCCAGTCAGATTTTACAAATGATGCTCTCTCTCCAAAATTCAACCTGTCATCAAGCATATCCAGTGAGAACTCGTTAATAAAGGGTGGGGCAGCAAATCAAGCTCTATTACATTCGAAAAGCAAACAGCCCAAGTTCCGAAGTATAAAGTGCAAACACAAAGAAAATCCAGTTATGGCAGAACCCCCAGTTATAAATGAGGAGTGCAGTTTGAAATGCTGCTCTTCTGATACCAAAGGCTCTCCTTTGGCCAGCATTTCTAAAAGTGGGAAAGTGGATGGTCTAAAACTACTGAACAATATGCATGAGAAAACCAGGGATTCAAGTGACATAGAAACAGCAGTGGTGAAACATGTTTTATCCGAGTTGAAGGAACTCTCTTACAGATCCTTAGGTGAGGATGTCAGTGACTCTGGAACATCAAAGCCATCAAAACCATTACTTTTCTCTTCTGCTTCTAGTCAGAATCACATACCTATTGAACCAGACTACAAATTCAGTACATTGCTAATGATGTTGAAAGATATGCATGATAGTAAGACGAAGGAGCAGCGGTTGATGACTGCTCAAAACCTGGTCTCTTACCGGAGTCCTGGTCGTGGGGACTGTTCTACTAATAGTCCTGTAGGAGTCTCTAAGGTTTTGGTTTCAGGAGGCTCCACACACAATTCAGAGAAAAAGGGAGATGGCACTCAGAACTCCGCCAATCCTAGCCCTAGTGGGGGTGACTCTGCATTATCTGGCGAGTTGTCTGCTTCCCTACCTGGCTTACTGTCCGACAAGAGAGACCTCCCTGCTTCTGGTAAAAGTCGTTCAGACTGTGTTACTAGGCGCAACTGTGGACGATCAAAGCCTTCATCCAAATTGCGAGATGCTTTTTCAGCCCAAATGGTAAAGAACACAGTGAACCGTAAAGCCTTAAAGACCGAGCGCAAAAGAAAACTGAATCAGCTTCCAAGTGTGACTCTTGATGCTGTACTGCAGGGAGACCGAGAACGTGGAGGTTCATTGAGAGGTGGGGCAGAAGATCCTAGTAAAGAGGATCCCCTTCAGATAATGGGCCACTTAACAAGTGAAGATGGTGACCATTTTTCTGATGTGCATTTCGATAGCAAGGTTAAGCAATCTGATCCTGGTAAAATTTCTGAAAAAGGACTCTCTTTTGAAAACGGAAAAGGCCCAGAGCTGGACTCTGTAATGAACAGTGAGAATGATGAACTCAATGGTGTAAATCAAGTGGTGCCTAAAAAGCGGTGGCAGCGTTTAAACCAAAGGCGCACTAAACCTCGTAAGCGCATGAACAGATTTAAAGAGAAAGAAAACTCTGAGTGTGCCTTTAGGGTCTTACTTCCTAGTGACCCTGTGCAGGAGGGGCGGGATGAGTTTCCAGAGCATAGAACTCCTTCAGCAAGCATACTTGAGGAACCACTGACAGAGCAAAATCATGCTGACTGCTTAGATTCAGCTGGGCCACGGTTAAATGTTTGTGATAAATCCAGTGCCAGCATTGGTGACATGGAAAAGGAGCCAGGAATTCCCAGTTTGACACCACAGGCTGAGCTCCCTGAACCAGCTGTGCGGTCAGAGAAGAAACGCCTTAGGAAGCCAAGCAAGTGGCTTTTGGAATATACAGAAGAATATGATCAGATATTTGCTCCTAAGAAAAAACAAAAGAAGGTACAGGAGCAGGTGCACAAGGTAAGTTCCCGCTGTGAAGAGGAAAGCCTTCTAGCCCGAGGTCGATCTAGTGCTCAGAACAAGCAGGTGGACGAGAATTCTTTGATTTCAACCAAAGAAGAGCCTCCAGTTCTTGAAAGGGAGGCTCCGTTTTTGGAGGGCCCCTTGGCTCAGTCAGAACTTGGAGGTGGACATGCTGAGTTGCCGCAGCTGACCTTGTCTGTGCCTGTGGCTCCGGAAGTCTCTCCACGGCCTGCCCTTGAGTCTGAGGAATTGCTAGTTAAAACGCCAGGAAATTATGAAAGTAAACGTCAAAGAAAACCAACTAAGAAACTTCTTGAATCCAATGATTTAGACCCTGGATTTATGCCCAAGAAGGGGGACCTTGGCCTTTCTAAAAAGTGCTATGAAGCTGGTCACCTGGAGAATGGCATAACTGAATCTTGTGCCACATCTTATTCAAAAGATTTTGGTGGAGGCACTACCAAGATATTTGACAAGCCAAGGAAGCGAAAACGACAGAGGCATGCTGCAGCCAAGATGCAGTGTAAAAAAGTGAAAAATGATGACTCGTCAAAAGAGATTCCAGGCTCAGAGGGAGAACTAATGCCTCACAGGACGGCCACAAGCCCCAAGGAGACTGTTGAGGAAGGTGTAGAACACGATCCCGGGATGCCTGCCTCTAAAAAAATGCAGGGTGAACGCGGTGGAGGAGCTGCACTCAAGGAGAATGTCTGTCAGAATTGTGAAAAATTGGGTGAGCTGCTGTTATGTGAGGCTCAGTGCTGTGGGGCTTTCCACCTGGAGTGCCTTGGATTGACTGAGATGCCAAGAGGAAAATTTATCTGCAATGAATGTCGCACAGGAATCCATACCTGTTTTGTATGTAAGCAGAGTGGGGAAGATGTTAAAAGGTGCCTTCTACCCTTGTGTGGAAAGTTTTACCATGAAGAGTGTGTCCAGAAGTACCCACCCACTGTTATGCAGAACAAGGGCTTCCGGTGCTCCCTCCACATCTGTATAACCTGTCATGCTGCTAATCCAGCCAATGTTTCTGCATCTAAAGGTCGGTTGATGCGCTGTGTCCGCTGTCCTGTGGCATACCACGCCAATGACTTTTGCCTGGCTGCTGGGTCAAAGATCCTTGCATCTAATAGTATCATCTGCCCTAATCACTTTACCCCTAGGCGGGGCTGCCGAAATCATGAGCATGTTAATGTTAGCTGGTGCTTTGTGTGCTCAGAAGGAGGCAGCCTTCTGTGCTGTGATTCTTGCCCTGCTGCTTTTCATCGTGAATGCCTGAACATTGATATCCCTGAAGGAAACTGGTATTGCAATGACTGTAAAGCAGGCAAAAAGCCACACTACAGGGAGATTGTCTGGGTAAAAGTTGGACGATACAGGTGGTGGCCAGCTGAGATCTGCCATCCTCGAGCTGTTCCTTCCAACATTGATAAGATGAGACATGATGTGGGAGAGTTCCCAGTCCTCTTTTTTGGATCTAATGACTATTTGTGGACTCACCAGGCCCGAGTCTTCCCTTACATGGAGGGTGACGTGAGCAGCAAGGATAAGATGGGCAAAGGAGTGGATGGGACATATAAAAAAGCTCTTCAGGAAGCTGCAGCAAGGTTTGAGGAATTAAAGGCCCAAAAAGAGCTAAGACAGCTGCAGGAAGACCGAAAGAATGACAAGAAGCCACCACCTTATAAACATATAAAGGTAAACCGTCCTATTGGCAGGGTACAGATCTTCACTGCAGACTTATCTGAAATACCCCGTTGCAACTGTAAAGCTACTGATGAGAACCCCTGTGGGATAGACTCTGAATGCATCAACCGCATGCTGCTCTATGAGTGCCACCCCACAGTGTGTCCTGCCGGAGGGCGCTGTCAAAACCAGTGCTTTTCCAAGCGCCAATATCCAGAGGTTGAAATTTTCCGCACATTACAGCGGGGTTGGGGTCTACGGACAAAAACAGATATTAAAAAGGGTGAATTTGTGAATGAGTATGTGGGTGAGCTTATAGATGAAGAAGAATGCAGAGCTCGAATTCGCTATGCTCAAGAACATGATATCACTAATTTCTATATGCTCACCCTAGACAAAGACCGAATCATTGATGCTGGTCCCAAAGGAAACTATGCTCGGTTCATGAATCATTGCTGCCAGCCCAACTGTGAAACACAGAAGTGGTCTGTGAATGGAGATACCCGTGTAGGCCTTTTTGCACTAAGTGACATTAAAGCAGGCACTGAACTTACCTTCAACTACAACCTAGAATGTCTTGGGAATGGAAAGACTGTTTGCAAATGTGGAGCCCCGAACTGCAGTGGCTTCTTGGGTGTAAGGCCAAAGAATCAACCCATTGCCACGGAAGAAAAGTCAAAGAAATTCAAGAAGAAGCAACAGGGAAAGCGCAGGACCCAGGGTGAAATCACAAAGGAGCGAGAAGATGAGTGTTTTAGTTGTGGGGATGCTGGCCAGCTCGTCTCCTGCAAGAAACCAGGCTGCCCAAAAGTTTACCACGCAGACTGTCTCAATCTGACCAAGCGACCAGCAGGGAAATGGGAATGTCCGTGGCATCAGTGTGACATCTGCGGGAAGGAAGCAGCCTCCTTCTGTGAGATGTGCCCCAGCTCCTTTTGTAAGCAGCATCGAGAAGGGATGCTTTTCATTTCCAAACTGGATGGGCGTCTGTCTTGTACTGAGCATGACCCCTGTGGGCCCAATCCTCTGGAACCTGGGGAGATCCGTGAGTATGTGCCTCCCCCAGTACCGCTGCCTCCAGGGCCAAGCACTCACCTGGCAGAGCAATCAACAGGAATGGCTGCTCAGGCACCCAAAATGTCAGATAAACCTCCTGCTGACACCAACCAGATGCTGTCGCTCTCCAAAAAAGCTCTGGCAGGGACTTGTCAGAGGCCATTGCTACCTGAAAGACCTCTTGAGAGAACTGACTCCAGGCCCCAGCCTTTAGATAAGGTCAGAGACCTCGCTGGGTCAGGGACCAAATCCCAATCCTTGGTTTCCAGCCAGAGGCCACTGGACAGGCCACCAGCAGTGGCAGGACCAAGACCCCAGCTAAGCGACAAACCCTCTCCAGTGACCAGCCCAAGCTCCTCACCCTCAGTCAGGTCCCAACCACTGGAAAGACCTCTGGGGACGGCTGACCCAAGGCTGGATAAATCCATAGGTGCTGCCAGCCCAAGGCCCCAGTCACTGGAGAAAACCTCAGTTCCCACTGGCCTGAGACTTCCGCCGCCAGACAGACTGCTCATTACTAGCAGTCCCAAACCCCAGACTTCAGACAGGCCTACTGACAAACCCCATGCCTCTTTGTCCCAGAGACTCCCACCTCCTGAGAAAGTACTATCAGCTGTGGTCCAGACCCTTGTAGCTAAAGAAAAAGCACTGAGGCCTGTGGACCAGAATACTCAGTCAAAAAATAGAGCTGCTTTGGTGATGGATCTCATAGACCTAACTCCTCGCCAGAAGGAGCGGGCAGCTTCACCTCATCAGGTCACACCACAGGCTGATGAGAAGATGCCAGTGTTGGAGTCAAGTTCATGGCCTGCCAGCAAAGGTCTGGGGCATATGCCGAGAGCTGTTGAGAAAGGCTGTGTGTCAGATCCTCTTCAGACATCTGGGAAAGCAGCAGCCCCTTCAGAGGACCCCTGGCAAGCTGTTAAATCACTCACCCAGGCCAGACTTCTTTCTCAGCCTCCTGCCAAGGCCTTTTTATATGAGCCAACAACTCAGGCCTCAGGAAGAGCTTCTGCAGGGGCTGAGCAGACCCCAGGGCCTCTTAGCCAATCCCCGGGCCTGGTGAAGCAGGCGAAGCAGATGGTCGGAGGCCAGCAACTACCTGCACTTGCCGCCAAGAGTGGGCAATCTTTTAGGTCTCTCGGGAAGGCCCCAGCCTCCCTCCCCACTGAAGAAAAGAAGTTGGTAACCACAGAGCAAAGTCCCTGGGCCCTGGGAAAAGCCTCATCACGGGCAGGGCTCTGGCCCATAGTGGCTGGACAGACACTGGCACAGTCTTGCTGGTCTGCTGGGAGCACACAGACATTGGCACAGACTTGCTGGTCTCTTGGAAGAGGGCAAGACCCCAAACCAGAGCAAAATACACTTCCAGCTCTTAACCAGGCTCCTTCCAGTCACAAGTGTGCAGAATCAGAACAGAAGTAGTACCAATCAATGTCACATGAACAAACAAGCTGCCCCCAGGGTACCATTTGGGGAGGGGAAATCTTTTCTTTCTTTCCCCCTTAAAAAAAAACACATCTGCCCCGAACACTTTCCCACTGTTATTCTTTCCTCATATCCCAACACTCAGAACTCTTGTGACATTAGCCAGTGGGGGCTTATGGTTGTGTGAACCATGTATGAAAATCCAGTGGGCCCCAACCAAGGAGACAGACAGACTTGGGTCTCTTTCCCCCAACTTTTCCACATGGTCATCGTGAAATAAAAAGTCCACTCTGGAGTCAAGTATGGAATTCAATTCCGCTGGTCAGGTTGGAAGGTATAGGGGCTCTCAAAGCGATTTCCCCAACCAGACAGAGCCCCATTGAGGGCACCTAGGAACCCTTGGGAGGAAATGGTGTTCTTTCAAATCAGTGGCGATTTCCTGAGCATTCACGTGTTCTAGGCCGGGTGCTAGTCACTGATGAGAGATACAGGCCTCATCCCTGTGAGCCTGGATTCCAAGGCTTTCAGGAACCTTTGACCAGGAAGTAACAGGAAGTTCTGAGGGGCCCTGGGGCTTTAGACTCATTTTGAAATGTCCTTTGTGGCACCAGAAGTGGTTGTGTTGAGGAAGTGTCTCTTGGCTGCGGTGTGCATGGGTGCGTGTGCATGCGCGCACACTCACAGAGGTCTCCTCTATAGATGCAAGGGTGCTGCATTGAGGCCAGCAAGGCTGTTGGCTGTGGGGTCGCCGCTGCTGCTTTTGTCTGGGCTGTGCAGAGTCTCAAGATCAGTCCTTGGAGGAGCAGGTGGTCAGGGGCAGTCGGGCTCTGTGCGAATGTAGATTTCCAGCAGTGGAAGAAGGCATTTGGCAAGCTTCTCTTTCTTTGCTTTTGTTTCTACCTATTTTTCTCTTTGTACATGAATCCACCCCATCCCTATTTCCCTAAAACACTCAGGTGCTTTCAGATTTCAGAGCCTCGGGCAGTGGACATAGGGAATCTCTGGCAAGCTCTGAGCTAGACACACCAGCTTCAGGAAGAGTACCAGATCCTGATGGGAAATTTCTTTTCCCCATTCCTTTTCCCTCCTGAGTGGAGGGAGTCCTCTTCTTCGCCTCCCTGAGAATTGCTGTGCTCTGTATTGAGAGCACCTGCCTGCTGACTTAGCTCAAAGGCAAGCCAGAACCCTTCCCTGAAGACTGGCAAGAGGTGGTGTTTAGAGCAACGTCCAGGCTAAGAGATGACTCCTATTAACTGCTGATTATCTGTTACTGCTGCCCTGAGCTGGGGCCCAAGGGCTGGGAAATCTGTTGGTGCTACCCTGCCCTACCATTCACCCAGCTCACAGACTGCCAACAGGAAGTGCTGTTTGGCTAGTTTCCTCCCACTTGTCTACCCCTCCTTTGTCCTTAGACCAACATGTTTACCTCTCTGCTTTGCCAACTTAGCCAGCAGGCCATCCCCGGCCCTAACGTCTCCTGGCCATTATCTCTTAGTTATGGCTTTCACGCTCTCAATAGGATTCTGTATTTGGTCCCAATTTCCTCAAGTTCTTATTGAGGTTACTCCCATCAATTCCACGGAGGGAACAGTAGTTATTATAGAAGCATTTGCGCTTTATCTAAAGATTAAAAATAGAATCTGCTTTTATTTCCCAAAGTCTGTCTCTGAGGTTGAGACACTTGAACTCAGGCAGAGGGACGAGGCTGGGCAGGGCTGTCCTGAGTTTAGGGGCCTATCCCTGCATTTCACTGAGACCTCGGAATCTCCTCTGTGAATTCCACCTGCCTAGTTCTCCCCTTTCATCCTCTCTCTCTTCCCACATCATCAAAGAGGAAAAGCTCTTTGTTCAAAAGGAAGAGAAAACGTAAAGCATCTTATTTTCTTTTAAAAGAATTTTAAACCATGAAAAAGATATTTTTAAAGAAATTCACCGAGAACATTAAAGTTCATTATATTAAGTATTTATCATGTGTGAGAATAATAAATATATAACTGCAGCTAGTAGGTCCCTTTCCCTAATCTTTTAGGTCATATGAGTAGGGTTTGCTTGGTGCCAGTCCTGTGCCCTTTTCTCTCCAGTCATCTGTAGTTGTGATCAGAAAAAGGTATCTGCACTGCACTGTCAGAGTCTCCTTTCACTATGTTGTGTGTTAAATTACCGTAGCTCTTTGTTTCATGAAATAAACTGTGAATTTGGGGGGGGCGGGGGGAGGGCGTGCAGGCCATGTAAAAATTTTCCGTGGAGAAGTTTGATTCTAAAGTAGCTTCTCTAAAGTAGGCTTTGGTAGGTAATCAACTTGACAGCAGTCTAGATGTCTCACAGGACAGGAGGGAGTGAGGGAAAGGGGCCATGATTGGCTGCTTTGTGGTTTTATTTTGGTTCTTTCCATTCTCCGCCATTCATTGGAGGCTTCGTTCCAGACCTGCCTGGGAAAACAGCTTCTGAGCCATTTTGGGGAGCAGTTCTTCATCTGAATGGATGGACATCTGGGCTTCCTTCAAGGGCCATTGAATGGGAACTAGAAAACCACTGGAAACTAGAAATTTGAGCTATTGGGCCCACCAGTAGCAGCATGTGATACTAGATGGTTAAAATCATGAAAGCAGTCACTATCCAATTAGAAGCAGAGTCACAACAACTGTTGGGAAATGTGACTCTTGGAGGAAGGTGGGGAGGGAGTGGCCTTGCCAGCCCTGTGGGACGTCCCCTGAAGTTTGTAATAAGACCCCTTTTCCAAAGGGATGTGAATTGGAGTGAAAAGGAAATCTTTCATCTTAGAAAACTTCTGGTCCTTAACGCAGGGTGGTATTTGGGTATGTGCTTGGAAATTGAGATCTCAAGAGTGTTTGCCTTGGAGCCAGCTCCCCAGGAGGCCTTTTCCAGGGACAAGGCAAAAGTTGAAATTCTCCATGGGTAGCTAGAAAGCCAATACATCTAGCCCTGCTAAGTCAGAAAAAGATTATGAAAAATGTTGAAATTTACATTCAAAGCCTCATTTGCTTATCTTGCTGGAGCCAACCCAGTCTAATAGCAAAATAGCTGTCATTGATACAGAAACATCCTCATTTTTAAATGTCTGCTTTACCCTGTTACTGAGTTTGAGATGACTTAAATCACTGTGTTGACCCTCTTCTGAACCAAATCTTTAGCATTGATGAAAATAGTTATTTTATTCTTTACATCCTTCACCCCACACTATGGTCAGGGCATGAAACACCCTGTTGATCCCTTCCCAGGCTCGGCACTGTCTGCTCACTGGAGCCGGACTCCCAGGTTGTAATTCTAATGTTGCCTCATGAGAACAGAATGGCAGAAAGTTTAGTCCTGACAGATTCCCCCATAGGGAGTAATGAGGACAGCATGAAACTTGGATAGGTTTTACCCTTAGTCCCTATAAGGTGGATTTTACTAAGGTTTTTTAAATGATACTGTCATCCTCTTGGGGTTTATCAGCCAGGTTAGAGGAGCCCAGTGTCCTAACCTCTCTCAGATCATGGCAGAGAAGGAGCTGCCTCCAGCCCCTTTCTTGCTGAGTTTCATTTGAGCAGTTCCATGTGTAGACATTCCAAGTCACTGCTTGGTAGTTGCTGTGGGAGCCTGTCATTGGCTATGGCCAGTTAGTTCTCAGCTGAGCTTCCTAGGGCCAGTGCAACAGGGCCAGAGGCTGCTATAGTGTAAATTGAAATAAGAATAGATCATTGTTTTGTACACACACACAATAAAATGTAATGATGGTGCTAATTTCACGGTATAAATAAGCACTGCCAAGGGTTGAGGGACTGGCAGCTCAAGAAACCCGGGTTCCTGTTTGGGAGGAGATTTTATGTAGAAAAGTTTGAGGCTTTGTTAAAAGTGGGGAGAAGGAAGATCCTCAGTGAAGCCTGCACCCAACCCTGGAGTGGCCCAGTGCAATCCAGAGGTGGAAGAGATCCTATATCCAGGTGAAGGTGGCCATTGAGTTTCTCAGGGCTGGGGCCACCTTGTCCATAGCCTCCGTCCACGCTGCCTGGAGCAGGTTGTTAGAGAGCTCTGGTTGTTGGGTCTTCCTCAGCTCCCTTCTGCCCCTCTCTACCTCTTCCACTCATGGAAGCCCCTCTACTGCTTATGAAGATTAAGGGTAGTATTTTCTAAGGAAGTGGAAAGAATTAAACTAGAAATCCACAACCTCGGAAGAAGTGTTTCGAGTTTAACATGCGCTGTTTCTGCTTATGTGGTTCCTTCTCTAGAGCTGCTTTCCCATGGCTTTCAAAACATCAGGTTATTGTGGGGCTTCAGGTGTAAGGTCCTGGAAGTTCAGCAAAGTTTCGTGGACAAGACATGGGCACAGAGAGTAGAAGCAGAAATAAATGGTTCTATGTTTTCAACTTCCAGGGTTGGGGCAGGCCAGAGCAAGGCGGTCTCATCGAGGTGGGTGCTACCTGTGTGTGTGTAGATGAGTGTGCTGAAGGTGGGGAGGGCAGCACACAGCAGCTCATGGCAGAGCCGCCTCCTAGGTCTTGGCAAAGAGGCAAGCTGACGATAGACATCTACCTATATTGTTAAGAAAGGGGTCGGGGGGATCAGCCAAGGTCCATCATTGCTTTTTTGCCGCGCCCCCCCCCCCCCGCCCCCATAGATTGTCAGCTGTAAGTGAAACTCCTAGTGAAAAAGAGGGGAGCCCTGTGTTAGGAGTCCCCATAAACATGTACTGTAATTCTTTGTATATAGAAAAAAAATTTACTGTAAAGTAAAGTTTAACTTTACTCATATA
|
Sequence Source |
Ensembl |
Keyword |
KW-0002--3D-structure KW-0010--Activator KW-0025--Alternative splicing KW-0156--Chromatin regulator KW-0160--Chromosomal rearrangement KW-0158--Chromosome KW-0181--Complete proteome KW-0225--Disease mutation KW-1017--Isopeptide bond KW-0479--Metal-binding KW-0489--Methyltransferase KW-0539--Nucleus KW-0597--Phosphoprotein KW-0621--Polymorphism KW-0656--Proto-oncogene KW-1185--Reference proteome KW-0677--Repeat KW-0678--Repressor KW-0949--S-adenosyl-L-methionine KW-0804--Transcription KW-0805--Transcription regulation KW-0808--Transferase KW-0832--Ubl conjugation KW-0862--Zinc KW-0863--Zinc-finger --
|
Interpro |
IPR006560--AWS_dom IPR003616--Post-SET_dom IPR000313--PWWP_dom IPR001214--SET_dom IPR019786--Zinc_finger_PHD-type_CS IPR011011--Znf_FYVE_PHD IPR001965--Znf_PHD IPR019787--Znf_PHD-finger IPR013083--Znf_RING/FYVE/PHD
|
PROSITE |
PS51215--AWS PS50868--POST_SET PS50812--PWWP PS50280--SET PS01359--ZF_PHD_1 PS50016--ZF_PHD_2
|
Pfam |
PF00855--PWWP PF00856--SET
|
Gene Ontology |
GO:0005694--C:chromosome GO:0005654--C:nucleoplasm GO:0050681--F:androgen receptor binding GO:0003682--F:chromatin binding GO:0030331--F:estrogen receptor binding GO:0046975--F:histone methyltransferase activity (H3-K36 specific) GO:0042799--F:histone methyltransferase activity (H4-K20 specific) GO:0018024--F:histone-lysine N-methyltransferase activity GO:0042974--F:retinoic acid receptor binding GO:0046965--F:retinoid X receptor binding GO:0000979--F:RNA polymerase II core promoter sequence-specific DNA binding GO:0046966--F:thyroid hormone receptor binding GO:0003712--F:transcription cofactor activity GO:0003714--F:transcription corepressor activity GO:0008270--F:zinc ion binding GO:0010452--P:histone H3-K36 methylation GO:0034770--P:histone H4-K20 methylation GO:0034968--P:histone lysine methylation GO:0016571--P:histone methylation GO:0000122--P:negative regulation of transcription from RNA polymerase II promoter GO:0045893--P:positive regulation of transcription, DNA-templated GO:0000414--P:regulation of histone H3-K36 methylation GO:0033135--P:regulation of peptidyl-serine phosphorylation GO:1903025--P:regulation of RNA polymerase II regulatory region sequence-specific DNA binding GO:0006351--P:transcription, DNA-templated
|
Orthology |
|
Created Date |
25-Jun-2016 |