Gene name: nan

Uniprot entry:

O00370

Protein names:

LINE-1 retrotransposable element ORF2 protein (ORF2p) [Includes: Reverse transcriptase (EC 2.7.7.49); Endonuclease (EC 3.1.21.-)]

Protein sequence:

1_MTGSN 6_ SHITI 11_ LTLNV 16_ NGLNS 21_ PIKRH 26_ RLASW 31_ IKSQD 36_ PSVCC 41_ IQETH 46_ LTCRD 51_ THRLK 56_ IKGWR 61_ KIYQA 66_ NGKQK 71_ KAGVA 76_ ILVSD 81_ KTDFK 86_ PTKIK 91_ RDKEG 96_ HYIMV 101_ KGSIQ 106_ QEELT 111_ ILNIY 116_ APNTG 121_ APRFI 126_ KQVLS 131_ DLQRD 136_ LDSHT 141_ LIMGD 146_ FNTPL 151_ SILDR 156_ STRQK 161_ VNKDT 166_ QELNS 171_ ALHQT 176_ DLIDI 181_ YRTLH 186_ PKSTE 191_ YTFFS 196_ APHHT 201_ YSKID 206_ HIVGS 211_ KALLS 216_ KCKRT 221_ EIITN 226_ YLSDH 231_ SAIKL 236_ ELRIK 241_ NLTQS 246_ RSTTW 251_ KLNNL 256_ LLNDY 261_ WVHNE 266_ MKAEI 271_ KMFFE 276_ TNENK 281_ DTTYQ 286_ NLWDA 291_ FKAVC 296_ RGKFI 301_ ALNAY 306_ KRKQE 311_ RSKID 316_ TLTSQ 321_ LKELE 326_ KQEQT 331_ HSKAS 336_ RRQEI 341_ TKIRA 346_ ELKEI 351_ ETQKT 356_ LQKIN 361_ ESRSW 366_ FFERI 371_ NKIDR 376_ PLARL 381_ IKKKR 386_ EKNQI 391_ DTIKN 396_ DKGDI 401_ TTDPT 406_ EIQTT 411_ IREYY 416_ KHLYA 421_ NKLEN 426_ LEEMD 431_ TFLDT 436_ YTLPR 441_ LNQEE 446_ VESLN 451_ RPITG 456_ SEIVA 461_ IINSL 466_ PTKKS 471_ PGPDG 476_ FTAEF 481_ YQRYK 486_ EELVP 491_ FLLKL 496_ FQSIE 501_ KEGIL 506_ PNSFY 511_ EASII 516_ LIPKP 521_ GRDTT 526_ KKENF 531_ RPISL 536_ MNIDA 541_ KILNK 546_ ILANR 551_ IQQHI 556_ KKLIH 561_ HDQVG 566_ FIPGM 571_ QGWFN 576_ IRKSI 581_ NVIQH 586_ INRAK 591_ DKNHV 596_ IISID 601_ AEKAF 606_ DKIQQ 611_ PFMLK 616_ TLNKL 621_ GIDGM 626_ YLKII 631_ RAIYD 636_ KPTAN 641_ IILNG 646_ QKLEA 651_ FPLKT 656_ GTRQG 661_ CPLSP 666_ LLFNI 671_ VLEVL 676_ ARAIR 681_ QEKEI 686_ KGIQL 691_ GKEEV 696_ KLSLF 701_ ADDMI 706_ VYLEN 711_ PIVSA 716_ QNLLK 721_ LISNF 726_ SKVSG 731_ YKINV 736_ QKSQA 741_ FLYNN 746_ NRQTE 751_ SQIMG 756_ ELPFT 761_ IASKR 766_ IKYLG 771_ IQLTR 776_ DVKDL 781_ FKENY 786_ KPLLK 791_ EIKED 796_ TNKWK 801_ NIPCS 806_ WVGRI 811_ NIVKM 816_ AILPK 821_ VIYRF 826_ NAIPI 831_ KLPMT 836_ FFTEL 841_ EKTTL 846_ KFIWN 851_ QKRAR 856_ IAKSI 861_ LSQKN 866_ KAGGI 871_ TLPDF 876_ KLYYK 881_ ATVTK 886_ TAWYW 891_ YQNRD 896_ IDQWN 901_ RTEPS 906_ EIMPH 911_ IYNYL 916_ IFDKP 921_ EKNKQ 926_ WGKDS 931_ LLNKW 936_ CWENW 941_ LAICR 946_ KLKLD 951_ PFLTP 956_ YTKIN 961_ SRWIK 966_ DLNVK 971_ PKTIK 976_ TLEEN 981_ LGITI 986_ QDIGV 991_ GKDFM 996_ SKTPK 1001_ AMATK 1006_ DKIDK 1011_ WDLIK 1016_ LKSFC 1021_ TAKET 1026_ TIRVN 1031_ RQPTT 1036_ WEKIF 1041_ ATYSS 1046_ DKGLI 1051_ SRIYN 1056_ ELKQI 1061_ YKKKT 1066_ NNPIK 1071_ KWAKD 1076_ MNRHF 1081_ SKEDI 1086_ YAAKK 1091_ HMKKC 1096_ SSSLA 1101_ IREMQ 1106_ IKTTM 1111_ RYHLT 1116_ PVRMA 1121_ IIKKS 1126_ GNNRC 1131_ WRGCG 1136_ EIGTL 1141_ VHCWW 1146_ DCKLV 1151_ QPLWK 1156_ SVWRF 1161_ LRDLE 1166_ LEIPF 1171_ DPAIP 1176_ LLGIY 1181_ PKDYK 1186_ SCCYK 1191_ DTCTR 1196_ MFIAA 1201_ LFTIA 1206_ KTWNQ 1211_ PNCPT 1216_ MIDWI 1221_ KKMWH 1226_ IYTME 1231_ YYAAI 1236_ KNDEF 1241_ ISFVG 1246_ TWMKL 1251_ ETIIL 1256_ SKLSQ 1261_ EQKTK 1266_HRIFS

Protein annotations

Protein functions:

1: Has reverse transcriptase activity required for target-primed reverse transcription of the LINE-1 element mRNA, a crucial step in LINE-1 retrotransposition (PubMed:38096901, PubMed:38096902, PubMed:7516468, PubMed:9140393). Selectively binds and reversely transcribes RNA with a poly(A) tail consisting of at least 20 adenosines (PubMed:38096901). Also has endonuclease activity that allows the introduction of nicks in the chromosomal target DNA (PubMed:17626046, PubMed:34554261, PubMed:38096901, PubMed:38096902, PubMed:8945517). Cleaves DNA in AT-rich regions between a 5' stretch of purines and a 3' stretch of pyrimidines, corresponding to the sites of LINE-1 integration in the genome (PubMed:8945517). Conformational properties of the target DNA sequence rather than specific nucleotides are key determinants of the ORF2p capacity for sequence-specific DNA recognition (PubMed:17626046, PubMed:34554261). Unlike related endonucleases, does not bend the DNA helix but causes compression near the cleavage site (PubMed:34554261)