WERAM Information


Tag Content
WERAM ID WERAM-Hos-0208
Ensembl Protein ID ENSP00000478672.1
Uniprot Accession P55895; RAG2_HUMAN; A8K9E9; Q8TBL4
Genbank Protein ID NP_000527.2; NP_001230714.1; NP_001230715.1
Protein Name V(D)J recombination-activating protein 2
Genbank Nucleotide ID NM_000536.3; NM_001243785.1; NM_001243786.1
Gene Name RAG2
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000175097.7 ENST00000618712.4 ENSP00000478672.1
ENSG00000175097.7 ENST00000311485.7 ENSP00000308620.3
Details
Type Family Domain Substrates AA References (PMIDs)
Me_Reader PHD PHD-type H3K4me3 K 18025461
Status Reviewed
Classification
Type Family E-value Score Start End
Me_Reader PHD 1.60e-08 36.8 418 483
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Core component of the RAG complex, a multiprotein complex that mediates the DNA cleavage phase during V(D)J recombination. V(D)J recombination assembles a diverse repertoire of immunoglobulin and T-cell receptor genes in developing B and T-lymphocytes through rearrangement of different V (variable), in some cases D (diversity), and J (joining) gene segments. DNA cleavage by the RAG complex occurs in 2 steps: a first nick is introduced in the top strand immediately upstream of the heptamer, generating a 3'-hydroxyl group that can attack the phosphodiester bond on the opposite strand in a direct transesterification reaction, thereby creating 4 DNA ends: 2 hairpin coding ends and 2 blunt, 5'-phosphorylated ends. The chromatin structure plays an essential role in the V(D)J recombination reactions and the presence of histone H3 trimethylated at 'Lys-4' (H3K4me3) stimulates both the nicking and haipinning steps. The RAG complex also plays a role in pre-B cell allelic exclusion, a process leading to expression of a single immunoglobulin heavy chain allele to enforce clonality and monospecific recognition by the B-cell antigen receptor (BCR) expressed on individual B-lymphocytes. The introduction of DNA breaks by the RAG complex on one immunoglobulin allele induces ATM-dependent repositioning of the other allele to pericentromeric heterochromatin, preventing accessibility to the RAG complex and recombination of the second allele. In the RAG complex, RAG2 is not the catalytic component but is required for all known catalytic activities mediated by RAG1. It probably acts as a sensor of chromatin state that recruits the RAG complex to H3K4me3 (By similarity).
Domain Profile
  Me_Reader PHD

            PHD.txt   2 tiClvCgkddeg.......eke.....mvqCdeCd.dwfHlkCvklp...lsslpeg.kswyCpsCke 52 
t+C++C ++d + + e m++C+++d +w+H++C++l+ l +l++g +++yC++++e
ENSP00000478672.1 418 TCCPTC-DVDINtwvpfysT-ElnkpaMIYCSHGDgHWVHAQCMDLAertLIHLSAGsNKYYCNEHVE 483
79****.9999966666650.155566******9999**********8777777766678******86 PP

Protein Sequence
(Fasta)
MSLQMVTVSN NIALIQPGFS LMNFDGQVFF FGQKGWPKRS CPTGVFHLDV KHNHVKLKPT 60
IFSKDSCYLP PLRYPATCTF KGSLESEKHQ YIIHGGKTPN NEVSDKIYVM SIVCKNNKKV 120
TFRCTEKDLV GDVPEARYGH SINVVYSRGK SMGVLFGGRS YMPSTHRTTE KWNSVADCLP 180
CVFLVDFEFG CATSYILPEL QDGLSFHVSI AKNDTIYILG GHSLANNIRP ANLYRIRVDL 240
PLGSPAVNCT VLPGGISVSS AILTQTNNDE FVIVGGYQLE NQKRMICNII SLEDNKIEIR 300
EMETPDWTPD IKHSKIWFGS NMGNGTVFLG IPGDNKQVVS EGFYFYMLKC AEDDTNEEQT 360
TFTNSQTSTE DPGDSTPFED SEEFCFSAEA NSFDGDDEFD TYNEDDEEDE SETGYWITCC 420
PTCDVDINTW VPFYSTELNK PAMIYCSHGD GHWVHAQCMD LAERTLIHLS AGSNKYYCNE 480
HVEIARALHT PQRVLPLKKP PMKSLRKKGS GKILTPAKKS FLRRLFD 527
Nucleotide Sequence
(Fasta)
ATTAGATCAG TGTTCATAAG AACATCTGTA GGCACACATA CACACTCTCT TTACAGTCAG 60
CCTTCTGCTT GCCACAGTCA TAGTGGGCAG TCAGTGAATC TTCCCCAAGT GCTGACAATT 120
AATACCTGGT TTAGCGGCAA AGATTCAGAG AGGCGTGAGC AGCCCCTCTG GCCTTCAGAA 180
TGAAGATGAA AAGAAGATGT GTTTAGGAAC CTTCTCTGCT TCCCTAAAAG AACCCTAAAA 240
ACAGGAGTAA AAACAGTGAA GAAAACAAAG GACCTTCAAA GTGCACTCGT TAAAACCTAA 300
TAACTGTATT TCTATTACTA AATGTAGAAG TGGTAAACAT CTTTAACAGG CAATGTCTCC 360
CCTGCACTCT AGGGATTCAA AGATCCATCT TTCGGTTCTG TAAGACAGTC ACGGCTTTTG 420
TAACCTCGGT GCCCCCTTCA ACCTCCCGCC CCAAGCACCT CCAGGGTCGT CAGGGGTGTA 480
GTTTTGAGTC GCGCTCCTAA GCATCCAGAC AGGCAGGACA CCGTAACGAC ATCTCTGCCG 540
GGAGTCCCTT CAGACTGCGG TCTCCAGACA AAAATCTACG TACCATCAGA AACTATGTCT 600
CTGCAGATGG TAACAGTCAG TAATAACATA GCCTTAATTC AGCCAGGCTT CTCACTGATG 660
AATTTTGATG GACAAGTTTT CTTCTTTGGA CAAAAAGGCT GGCCCAAAAG ATCCTGCCCC 720
ACTGGAGTTT TCCATCTGGA TGTAAAGCAT AACCATGTCA AACTGAAGCC TACAATTTTC 780
TCTAAGGATT CCTGCTACCT CCCTCCTCTT CGCTACCCAG CCACTTGCAC ATTCAAAGGC 840
AGCTTGGAGT CTGAAAAGCA TCAATACATC ATCCATGGAG GGAAAACACC AAACAATGAG 900
GTTTCAGATA AGATTTATGT CATGTCTATT GTTTGCAAGA ACAACAAAAA GGTTACTTTT 960
CGCTGCACAG AGAAAGACTT GGTAGGAGAT GTTCCTGAAG CCAGATATGG TCATTCCATT 1020
AATGTGGTGT ACAGCCGAGG GAAAAGTATG GGTGTTCTCT TTGGAGGACG CTCATACATG 1080
CCTTCTACCC ACAGAACCAC AGAAAAATGG AATAGTGTAG CTGACTGCCT GCCCTGTGTT 1140
TTCCTGGTGG ATTTTGAATT TGGGTGTGCT ACATCATACA TTCTTCCAGA ACTTCAGGAT 1200
GGGCTATCTT TTCATGTCTC TATTGCCAAA AATGACACCA TCTATATTTT AGGAGGACAT 1260
TCACTTGCCA ATAATATCCG GCCTGCCAAC CTGTACAGAA TAAGGGTTGA TCTTCCCCTG 1320
GGTAGCCCAG CTGTGAATTG CACAGTCTTG CCAGGAGGAA TCTCTGTCTC CAGTGCAATC 1380
CTGACTCAAA CTAACAATGA TGAATTTGTT ATTGTTGGTG GCTATCAGCT TGAAAATCAA 1440
AAAAGAATGA TCTGCAACAT CATCTCTTTA GAGGACAACA AGATAGAAAT TCGTGAGATG 1500
GAGACCCCAG ATTGGACCCC AGACATTAAG CACAGCAAGA TATGGTTTGG AAGCAACATG 1560
GGAAATGGAA CTGTTTTTCT TGGCATACCA GGAGACAATA AACAAGTTGT TTCAGAAGGA 1620
TTCTATTTCT ATATGTTGAA ATGTGCTGAA GATGATACTA ATGAAGAGCA GACAACATTC 1680
ACAAACAGTC AAACATCAAC AGAAGATCCA GGGGATTCCA CTCCCTTTGA AGACTCTGAA 1740
GAATTTTGTT TCAGTGCAGA AGCAAATAGT TTTGATGGTG ATGATGAATT TGACACCTAT 1800
AATGAAGATG ATGAAGAAGA TGAGTCTGAG ACAGGCTACT GGATTACATG CTGCCCTACT 1860
TGTGATGTGG ATATCAACAC TTGGGTACCA TTCTATTCAA CTGAGCTCAA CAAACCCGCC 1920
ATGATCTACT GCTCTCATGG GGATGGGCAC TGGGTCCATG CTCAGTGCAT GGATCTGGCA 1980
GAACGCACAC TCATCCATCT GTCAGCAGGA AGCAACAAGT ATTACTGCAA TGAGCATGTG 2040
GAGATAGCAA GAGCTCTACA CACTCCCCAA AGAGTCCTAC CCTTAAAAAA GCCTCCAATG 2100
AAATCCCTCC GTAAAAAAGG TTCTGGAAAA ATCTTGACTC CTGCCAAGAA ATCCTTTCTT 2160
AGAAGGTTGT TTGATTAGTT TTGCAAAAGC CTTTCAGATT CAGGTGTATG GAATTTTTGA 2220
ATCTATTTTT AAAATCATAA CATTGATTTT AAAAATACAT TTTTGTTTAT TTAAAATGCC 2280
TATGTTTTCT TTTAGTTACA TGAATTAAGG GCCAGAAAAA AGTGTTTATA ATGCAATGAT 2340
AAATAAAGTC ATTCTAGACC CTATACATTT TGAAAATATT TTACCCAAAT ACTCAATTTA 2400
CTAATTTATT CTTCACTGAG GATTTCTGAT CTGATTTTTT ATTCAACAAA CCTTAAACAC 2460
CCAGAAGCAG TAATAATCAT CGAGGTATGT TTATATTTAT TATATAAGTC TTGGTAACAA 2520
ATAACCTATA AAGTGTTTAT GACAAATTTA GCCAATAAAG AAATTAACAC CCAAAAGAAT 2580
TAAATTGATT ATTTTGTGCA ACATAACAAT TCGGCAGTTG GCCAAAACTT AAAAGCAAGA 2640
TCTACTACAT CCCACATTAG TGTTCTTTAT ATACCTTCAA GCAACCCTTT GGATTATGCC 2700
CATGAACAAG TTAGTTTCTC ATAGCTTTAC AGATGTAGAT ATAAATATAA ATATATGTAT 2760
ACATATAGAT AGATAATGTT CTCCACTGAC ACAAAAGAAG AAATAAATAA TCTACATC 2819
Sequence Source Ensembl
Keyword

KW-0156--Chromatin regulator
KW-0181--Complete proteome
KW-0225--Disease mutation
KW-0233--DNA recombination
KW-0479--Metal-binding
KW-0539--Nucleus
KW-0621--Polymorphism
KW-1185--Reference proteome
KW-0705--SCID
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR011043--Gal_Oxase/kelch_b-propeller
IPR015915--Kelch-typ_b-propeller
IPR004321--RAG2
IPR025162--RAG2_PHD
IPR011011--Znf_FYVE_PHD

PROSITE
Pfam

PF03089--RAG2
PF13341--RAG2_PHD

Gene Ontology

GO:0005654--C:nucleoplasm
GO:0003682--F:chromatin binding
GO:0003677--F:DNA binding
GO:0035064--F:methylated histone binding
GO:0035091--F:phosphatidylinositol binding
GO:0005547--F:phosphatidylinositol-3,4,5-trisphosphate binding
GO:0043325--F:phosphatidylinositol-3,4-bisphosphate binding
GO:0080025--F:phosphatidylinositol-3,5-bisphosphate binding
GO:0005546--F:phosphatidylinositol-4,5-bisphosphate binding
GO:0061630--F:ubiquitin protein ligase activity
GO:0008270--F:zinc ion binding
GO:0030183--P:B cell differentiation
GO:0002358--P:B cell homeostatic proliferation
GO:0002326--P:B cell lineage commitment
GO:0016568--P:chromatin modification
GO:0046622--P:positive regulation of organ growth
GO:0002331--P:pre-B cell allelic exclusion
GO:0033077--P:T cell differentiation in thymus
GO:0002360--P:T cell lineage commitment
GO:0033151--P:V(D)J recombination

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Gog-0221 ENSGGOP00000027993.1 Gorilla gorilla 99 0.0 1098
WERAM-Pat-0025 ENSPTRP00000006100.3 Pan troglodytes 99 0.0 1098
WERAM-Nol-0195 ENSNLEP00000021807.1 Nomascus leucogenys 99 0.0 1095
WERAM-Mam-0171 ENSMMUP00000024014.1 Macaca mulatta 98 0.0 1090
WERAM-Paa-0074 ENSPANP00000007084.1 Papio anubis 98 0.0 1089
WERAM-Poa-0033 ENSPPYP00000003843.1 Pongo abelii 98 0.0 1087
WERAM-Chs-0222 ENSCSAP00000018723.1 Chlorocebus sabaeus 98 0.0 1087
WERAM-Ict-0110 ENSSTOP00000020649.1 Ictidomys tridecemlineatus 92 0.0 1032
WERAM-Myl-0007 ENSMLUP00000022176.1 Myotis lucifugus 92 0.0 1031
WERAM-Vip-0119 ENSVPAP00000010942.1 Vicugna pacos 94 0.0 1027
WERAM-Tut-0159 ENSTTRP00000013341.1 Tursiops truncatus 93 0.0 1024
WERAM-Mim-0082 ENSMICP00000007843.1 Microcebus murinus 93 0.0 1022
WERAM-Orc-0067 ENSOCUP00000020912.1 Oryctolagus cuniculus 92 0.0 1021
WERAM-Cap-0179 ENSCPOP00000018706.1 Cavia porcellus 90 0.0 1019
WERAM-Eqc-0181 ENSECAP00000019369.1 Equus caballus 92 0.0 1014
WERAM-Aim-0218 ENSAMEP00000020459.1 Ailuropoda melanoleuca 92 0.0 1013
WERAM-Ran-0030 ENSRNOP00000006097.4 Rattus norvegicus 89 0.0 1009
WERAM-Loa-0158 ENSLAFP00000013831.2 Loxodonta africana 90 0.0 1009
WERAM-Sus-0097 ENSSSCP00000014119.1 Sus scrofa 91 0.0 1004
WERAM-Bot-0203 ENSBTAP00000039446.4 Bos taurus 91 0.0 1004
WERAM-Mum-0129 ENSMUSP00000106858.1 Mus musculus 88 0.0 1004
WERAM-Mup-0221 ENSMPUP00000019834.1 Mustela putorius furo 91 0.0 1002
WERAM-Ova-0113 ENSOARP00000011219.1 Ovis aries 91 0.0 998
WERAM-Dan-0130 ENSDNOP00000030314.1 Dasypus novemcinctus 89 0.0 991
WERAM-Ocp-0154 ENSOPRP00000015811.1 Ochotona princeps 89 0.0 986
WERAM-Caf-0079 ENSCAFP00000010147.1 Canis familiaris 89 0.0 986
WERAM-Caj-0116 ENSCJAP00000020447.1 Callithrix jacchus 90 0.0 960
WERAM-Mod-0202 ENSMODP00000034256.1 Monodelphis domestica 78 0.0 867
WERAM-Ptv-0042 ENSPVAP00000004477.1 Pteropus vampyrus 90 3e-160 562
Created Date 25-Jun-2016