
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146855.1 + phase: 0 /pseudo
(1356 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YTX2_XENLA (P14381) Transposon TX1 hypothetical 149 kDa protein ... 80 3e-14
LIN1_HUMAN (P08547) LINE-1 reverse transcriptase homolog 63 5e-09
LIN1_NYCCO (P08548) LINE-1 reverse transcriptase homolog 58 2e-07
POL2_MOUSE (P11369) Retrovirus-related Pol polyprotein [Contains... 49 9e-05
RTJK_DROFU (P21329) RNA-directed DNA polymerase from mobile elem... 44 0.003
APEA_DICDI (P51173) DNA-(apurinic or apyrimidinic site) lyase (E... 33 4.1
YFF2_YEAST (P43551) Putative 62.1 kDa transcriptional regulatory... 33 6.9
>YTX2_XENLA (P14381) Transposon TX1 hypothetical 149 kDa protein
(ORF 2)
Length = 1308
Score = 80.5 bits (197), Expect = 3e-14
Identities = 93/405 (22%), Positives = 162/405 (39%), Gaps = 38/405 (9%)
Query: 77 HCSILNYSQHFINMSIQDPVKGPWRLTAFYGYPDHGRRRDSWELLRSLHSQSDDPWCIIG 136
H + + + M++ P GP R F + DS D+ I G
Sbjct: 94 HLRVRESGRTYNLMNVYAPTTGPERARFFESLSAYMETIDS-----------DEALIIGG 142
Query: 137 DFNDHLSPSDKRGGPDRPHWLIRGFQEAVSDCNLIDL----PLNGYQFTWFKSIGTSHAK 192
DFN L D R P + +E ++ +L+D+ FT+ + + H
Sbjct: 143 DFNYTLDARD-RNVPKKRDSSESVLRELIAHFSLVDVWREQNPETVAFTYVR-VRDGHVS 200
Query: 193 EARIDRALCTAPWLELFPHASLQTL-VAPMSDHTPLLLQLDPPPWRAPHNSFRFNNSWLI 251
++RIDR ++ L A T+ +AP SDH + L++ P + FNNS L
Sbjct: 201 QSRIDRIYISS---HLMSRAQSSTIRLAPFSDHNCVSLRMSIAPSLPKAAYWHFNNSLLE 257
Query: 252 EPELAHLVKDNW--------------SYYPSSNIVTKLSYCIEDMKYWSKANHPHFNQRK 297
+ A V+D W ++ + KL C E K S +
Sbjct: 258 DEGFAKSVRDTWRGWRAFQDEFATLNQWWDVGKVHLKL-LCQEYTKSVSGQRNAEIEALN 316
Query: 298 QQLKNQIDVMRNTSDASVDPRLIELQNSLANLILQEDVYWRQRSKIFWLKDGDKNSKFFH 357
++ + + + D ++ +E + +L N+ ++ RS++ L D D+ S+FF+
Sbjct: 317 GEVLDLEQRLSGSEDQALQCEYLERKEALRNMEQRQARGAFVRSRMQLLCDMDRGSRFFY 376
Query: 358 LTASSRRRKNTISKLRHPNGFWLTSQDDMSAHIHDYFSGLFQA--INGDQQPVITRIQAR 415
+ + I+ L +G L + + ++ LF I+ D +
Sbjct: 377 ALEKKKGNRKQITCLFAEDGTPLEDPEAIRDRARSFYQNLFSPDPISPDACEELWDGLPV 436
Query: 416 VTENDNATLTKPFSIDEFKEAVFSMHSDKSPGPDGLNPGFYQFFF 460
V+E L P ++DE +A+ M +KSPG DGL F+QFF+
Sbjct: 437 VSERRKERLETPITLDELSQALRLMPHNKSPGLDGLTIEFFQFFW 481
>LIN1_HUMAN (P08547) LINE-1 reverse transcriptase homolog
Length = 1259
Score = 63.2 bits (152), Expect = 5e-09
Identities = 77/372 (20%), Positives = 152/372 (40%), Gaps = 29/372 (7%)
Query: 109 PDHGRRRDSWELLRSLHSQSDDPWCIIGDFNDHLSPSDKRGGPDRPHWLIRGFQEAVSDC 168
P+ G R ++L L D I+GDFN LS D R + + I+ A+
Sbjct: 116 PNTGAPRFIKQVLSDLQRDLDSHTIIMGDFNTPLSTLD-RSTRQKINKDIQELNSALHQA 174
Query: 169 NLID----LPLNGYQFTWFKSIGTSHAKEARIDRALCTAPWLELFPHASLQTLVAPMSDH 224
+LID L ++T+F + H ++ D L + L + T +SDH
Sbjct: 175 DLIDIYRTLHPKSTEYTFFSA---PHHTYSKTDHILGSKTLLSKCKRTEIITNC--LSDH 229
Query: 225 TPLLLQLDPPPWRAPH------NSFRFNNSWL---IEPELAHLVKDNWSYYPSSNIVTKL 275
+ + L+L H N+ N+ W+ ++ E+ + N + + +
Sbjct: 230 SAIKLELRIKKLTQNHSTTWKLNNLLLNDYWVHNEMKAEIKKFFETNENKDTTYQNLWDT 289
Query: 276 SYCIEDMKYWSKANHPHFNQRKQ------QLKNQIDVMRNTSDASVDPRLIELQNSLANL 329
+ + K+ + H +R + QLK + S AS +I+++ L +
Sbjct: 290 AKAVCRGKFIALNAHKRKQERSKIDTLISQLKELEKQEQTNSKASRRQEIIKIRAELKEI 349
Query: 330 ILQEDVYWRQRSKIFWLKDGDKNSKFFHLTASSRRRKNTISKLRHPNGFWLTSQDDMSAH 389
Q+ + S+ ++ + +K + +R KN I +++ G T ++
Sbjct: 350 ETQKTLQKINESRSWFFEKINKIDRPLARLIKKKREKNQIDTIKNDRGDITTDPTEIQTT 409
Query: 390 IHDYFSGLF----QAINGDQQPVITRIQARVTENDNATLTKPFSIDEFKEAVFSMHSDKS 445
I +Y+ L+ + + + + T R+ + + +L +P + E + + S+ + KS
Sbjct: 410 IREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITSSEIEAIINSLPNKKS 469
Query: 446 PGPDGLNPGFYQ 457
PGP+G FYQ
Sbjct: 470 PGPEGFTAEFYQ 481
>LIN1_NYCCO (P08548) LINE-1 reverse transcriptase homolog
Length = 1260
Score = 58.2 bits (139), Expect = 2e-07
Identities = 96/494 (19%), Positives = 192/494 (38%), Gaps = 52/494 (10%)
Query: 1 MNVIAWNCRGLGNVKAVPCIKDLVRVYKPDIVILIETLCNNNKISGLKYAIGFDYHFSVD 60
+++ + N GL + D ++ KPDI + E+ LK G+ F +
Sbjct: 7 LSIFSINVNGLNCPLKRHRLADWIQKLKPDICCIQESHLTLKDKYRLKVK-GWSSIFQAN 65
Query: 61 CIGRSGGIAVLWRNSAHCSILNYSQ----HFINMSIQDPVKGPWR-----LTAFYGYPDH 111
+ GIA+L+ ++ + HFI VKG + + Y P+H
Sbjct: 66 GKQKKAGIAILFADAIGFKPTKIRKDKDGHFIF------VKGNTQYDEISIINIYA-PNH 118
Query: 112 GRRRDSWELLRSLHSQSDDPWCIIGDFNDHLSPSDKRGGPDRPHWLIRGFQEAVSDCNLI 171
+ E L + + ++GDFN L+ D R + I + +L
Sbjct: 119 NAPQFIRETLTDMSNLISSTSIVVGDFNTPLAVLD-RSSKKKLSKEILDLNSTIQHLDLT 177
Query: 172 DLPL----NGYQFTWFKSIGTSHAKEARIDRALCTAPWLELFPHASLQTLVAPMSDHTPL 227
D+ N ++T+F S +H ++ID L L F ++ + SDH +
Sbjct: 178 DIYRTFHPNKTEYTFFSS---AHGTYSKIDHILGHKSNLSKFK--KIEIIPCIFSDHHGI 232
Query: 228 LLQLD--------PPPWRAPHNSFRFNNSWLIEP---ELAHLVKDN-------WSYYPSS 269
++L+ W+ N+ ++W+I+ E+ ++ N + + ++
Sbjct: 233 KVELNNNRNLHTHTKTWKL--NNLMLKDTWVIDEIKKEITKFLEQNNNQDTNYQNLWDTA 290
Query: 270 NIVTKLSYCIEDMKYWSKANHPHFNQRKQQLKNQIDVMRNTSDASVDPRLIELQNSLANL 329
V + + I + K N LK + S + +++ L +
Sbjct: 291 KAVLRGKF-IALQAFLKKTEREEVNNLMGHLKQLEKEEHSNPKPSRRKEITKIRAELNEI 349
Query: 330 ILQEDVYWRQRSKIFWLKDGDKNSKFFHLTASSRRRKNTISKLRHPNGFWLTSQDDMSAH 389
+ + +SK ++ + +K K +R K+ IS +R+ N T ++
Sbjct: 350 ENKRIIQQINKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQKI 409
Query: 390 IHDYFSGLFQAINGDQQPVITRIQA----RVTENDNATLTKPFSIDEFKEAVFSMHSDKS 445
+++Y+ L+ + + + ++A R+++ + L +P S E + ++ KS
Sbjct: 410 LNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKKS 469
Query: 446 PGPDGLNPGFYQFF 459
PGPDG FYQ F
Sbjct: 470 PGPDGFTSEFYQTF 483
>POL2_MOUSE (P11369) Retrovirus-related Pol polyprotein [Contains:
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1300
Score = 48.9 bits (115), Expect = 9e-05
Identities = 72/349 (20%), Positives = 133/349 (37%), Gaps = 29/349 (8%)
Query: 134 IIGDFNDHLSPSDKRGGPDRPHWLIRGFQEAVSDCNLIDLPLNGYQFT-WFKSIGTSHAK 192
I+GDFN LS D+ ++ E + +L D+ Y T + H
Sbjct: 168 IVGDFNTPLSSKDRSWKQKLNRDTVK-LTEVMKQMDLTDIYRTFYPKTKGYTFFSAPHGT 226
Query: 193 EARIDRALCTAPWLELFPHASLQTLVAPMSDHTPLLLQLD------PPPWRAPHNSFRFN 246
++ID + L + + + + +SDH L L + P + N+ N
Sbjct: 227 FSKIDHIIGHKTGLNRYKNIEIVPCI--LSDHHGLRLIFNNNINNGKPTFTWKLNNTLLN 284
Query: 247 NSWLIEPELAHLVKDNWSYYPSSNIVTKLSYCIEDMKYW------------SKANHPHFN 294
++ L++ + +KD + + N T + MK + K H +
Sbjct: 285 DT-LVKEGIKKEIKDFLEF--NENEATTYPNLWDTMKAFLRGKLIALSASKKKRETAHTS 341
Query: 295 QRKQQLKNQIDVMRNTSDASVDPRLIELQNSLANLILQEDVYWRQRSKIFWLKDGDKNSK 354
LK N+ S +I+L+ + + + + +++ ++ + +K K
Sbjct: 342 SLTTHLKALEKKEANSPKRSRRQEIIKLRGEINQVETRRTIQRINQTRSWFFEKINKIDK 401
Query: 355 FFHLTASSRRRKNTISKLRHPNGFWLTSQDDMSAHIHDYFSGLFQAI--NGDQQP-VITR 411
R K I+K+R+ G T +++ I ++ L+ N D+ + R
Sbjct: 402 PLARLTKGHRDKILINKIRNEKGDITTDPEEIQNTIRSFYKRLYSTKLENLDEMDKFLDR 461
Query: 412 IQARVTENDNAT-LTKPFSIDEFKEAVFSMHSDKSPGPDGLNPGFYQFF 459
Q D L P S E + + S+ + KSPGPDG + FYQ F
Sbjct: 462 YQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTF 510
>RTJK_DROFU (P21329) RNA-directed DNA polymerase from mobile element
jockey (EC 2.7.7.49) (Reverse transcriptase)
Length = 916
Score = 43.9 bits (102), Expect = 0.003
Identities = 62/267 (23%), Positives = 106/267 (39%), Gaps = 34/267 (12%)
Query: 213 SLQTLVAPMSDHTPLLLQLDPPPWRAPHNSFRFNNSWLIEPELAHLVKD---NWSYYPSS 269
++QTL SDHTPLL L P P S IE A+L + + +
Sbjct: 203 NVQTLHELSSDHTPLLADLHAMPINKPPRSCLLARGADIERFKAYLTQHIDLSVGIQGTD 262
Query: 270 NIVTKLSYCIEDMKYWSKANHPHFNQRKQ---------------QLKNQIDVMRNTSDAS 314
+I + ++ +K + + P Q + +LK ++ R +
Sbjct: 263 DIDNAIDSFMDILKRAAIRSAPSHQQNVESSRQLQLPPIVASLIRLKRKV---RREYART 319
Query: 315 VDPRLIELQNSLANLILQEDVYWRQRSKIFWL-----KDGDKNSKFFHLTASSRRRKNTI 369
D R+ ++ + LAN L + + R++S+I L DG N + +T + +
Sbjct: 320 GDARIQQIHSRLANR-LHKVLNRRKQSQIDNLLENLDTDGSTNFSLWRITKRYKTQATPN 378
Query: 370 SKLRHPNGFWLTSQDD----MSAHIHDYFSGLFQAINGDQQPVITRIQARVTENDNATLT 425
S +R+P G W + + + H+ F L A V +Q T A
Sbjct: 379 SAIRNPAGGWCRTSREKTEVFANHLEQRFKALAFAPESHSLMVAESLQ---TPFQMALPA 435
Query: 426 KPFSIDEFKEAVFSMHSDKSPGPDGLN 452
P +++E KE V + K+PG D L+
Sbjct: 436 DPVTLEEVKELVSKLKPKKAPGEDLLD 462
>APEA_DICDI (P51173) DNA-(apurinic or apyrimidinic site) lyase (EC
4.2.99.18) (Class II
apurinic/apyrimidinic(AP)-endonuclease)
Length = 361
Score = 33.5 bits (75), Expect = 4.1
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 1 MNVIAWNCRGLGNVKAVPCIKDLVRVYKPDIVILIETLCNNNKISGLKYAIGFDYHF 57
M +I+WN G +V + + V PD++ L ET N + I + G++YHF
Sbjct: 105 MKIISWNVAGFKSVLSKG-FTECVEKENPDVLCLQETKINPSNIKKDQMPKGYEYHF 160
>YFF2_YEAST (P43551) Putative 62.1 kDa transcriptional regulatory
protein in DAK3-ALR2 intergenic region
Length = 465
Score = 32.7 bits (73), Expect = 6.9
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 761 VRNLSLTISKTVFGKSVNLGVLNLCREQVKRCSSNRLLKQFLHIVWELFSFQPLY 815
VRNL VF KS + G++ + + + +C RL L+++W L + LY
Sbjct: 50 VRNLKKVDDVQVFSKSSSGGIMKVPKALIDQCL--RLYNDKLYVIWPLLCYDDLY 102
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.338 0.147 0.484
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 147,560,996
Number of Sequences: 164201
Number of extensions: 5931266
Number of successful extensions: 17534
Number of sequences better than 10.0: 7
Number of HSP's better than 10.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 17520
Number of HSP's gapped (non-prelim): 11
length of query: 1356
length of database: 59,974,054
effective HSP length: 122
effective length of query: 1234
effective length of database: 39,941,532
effective search space: 49287850488
effective search space used: 49287850488
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 72 (32.3 bits)
Medicago: description of AC146855.1