
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140850.3 + phase: 0 /pseudo
(650 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CF922488 110 5e-28
NP334778 reverse transcriptase [Glycine max] 106 3e-23
NP395547 reverse transcriptase [Glycine max] 77 2e-14
NP595172 polyprotein [Glycine max] 76 5e-14
NP395548 reverse transcriptase [Glycine max] 67 3e-11
BG725601 similar to PIR|H86337|H86 protein F5M15.26 [imported] -... 51 2e-06
TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 44 2e-04
BQ628592 35 0.14
TC219643 weakly similar to UP|Q6WAY7 (Q6WAY7) Gag/pol polyprotei... 33 0.40
BI498328 32 0.68
TC222732 similar to UP|Q9LM48 (Q9LM48) F2E2.18, partial (24%) 30 2.6
TC228017 29 5.7
AW318041 29 5.7
>CF922488
Length = 741
Score = 110 bits (275), Expect(2) = 5e-28
Identities = 53/70 (75%), Positives = 59/70 (83%)
Frame = +3
Query: 111 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNLIKMSP 170
V K+DGKV MCVD+RDLN ASPKD FPLPHI+VLVDNT FSFMDGFSGYN IK++P
Sbjct: 3 VLKEDGKV*MCVDYRDLN*ASPKDKFPLPHINVLVDNTTSFSQFSFMDGFSGYNQIKIAP 182
Query: 171 EDREKTSFIT 180
ED EKT+FIT
Sbjct: 183 EDMEKTTFIT 212
Score = 32.7 bits (73), Expect(2) = 5e-28
Identities = 48/161 (29%), Positives = 69/161 (42%)
Frame = +1
Query: 179 ITHGILSATK*CRSA**MLVLPTKEE*LLCFMT*FPKKSKYMWTI*L*SQKMRSNMSNT* 238
+ +G SA + CR * ML T T* ++ + W * *+Q+ R N +
Sbjct: 208 LLYGEPSAIRLCRLG*RMLGQHTSGPWWHYSRT*CTRR*RSTWMT*S*NQERRRNTLSIC 387
Query: 239 QKCSKG*ESTSFD*TLTNVRSASDLENY*ASLSVKRALKSTLIKSVPSEKCQPRRPRNKS 298
+ C +T D* +V + E+ L + + + S + R+KS
Sbjct: 388 ESCLGDYVNTG*D*IPQSVCLR*NPESCSTLLIAREE*RWIRTR*K*SLRWPSHIQRSKS 567
Query: 299 EVSSGD*ITSPDSYLT*PQPAGRSSSYSGRISLLYGTMNAK 339
+VS G * TS DSY +* A S RISL GTM K
Sbjct: 568 KVSWGG*TTS*DSYHS*LPLASLFSYCCARISLSNGTMIVK 690
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 106 bits (265), Expect = 3e-23
Identities = 48/62 (77%), Positives = 55/62 (88%)
Frame = +3
Query: 119 RMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNLIKMSPEDREKTSF 178
RMCVD+RDLN+ASPKDNFPLPHID+L+ N A +FSFMDGFSGYN IKM+PED EKT+F
Sbjct: 3 RMCVDYRDLNRASPKDNFPLPHIDILMANMASFALFSFMDGFSGYNQIKMAPEDMEKTTF 182
Query: 179 IT 180
IT
Sbjct: 183 IT 188
>NP395547 reverse transcriptase [Glycine max]
Length = 762
Score = 77.0 bits (188), Expect = 2e-14
Identities = 36/93 (38%), Positives = 55/93 (58%), Gaps = 18/93 (19%)
Frame = +1
Query: 104 WVANIVPVPKKDGKV------------------RMCVDFRDLNKASPKDNFPLPHIDVLV 145
WV+ + VPKK G RMC+D+R LN+A+ KD++PLP +D ++
Sbjct: 64 WVSPVQVVPKKGGMTVVKNDRNELIPTRRVTRWRMCIDYRKLNEATRKDHYPLPFMDQML 243
Query: 146 DNTAQSKVFSFMDGFSGYNLIKMSPEDREKTSF 178
A+ + F+DG+SGYN I + P+D+EKT+F
Sbjct: 244 KRLARQSFYRFLDGYSGYNQIAVDPQDQEKTAF 342
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 75.9 bits (185), Expect = 5e-14
Identities = 37/74 (50%), Positives = 50/74 (67%)
Frame = +1
Query: 108 IVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNLIK 167
I+ V KKDG R C D+R LN + KD+FP+P +D L+D ++ FS +D SGY+ I
Sbjct: 1909 ILLVKKKDGSWRFCTDYRALNAITVKDSFPMPTVDELLDELHGAQYFSKLDLRSGYHQIL 2088
Query: 168 MSPEDREKTSFITH 181
+ PEDREKT+F TH
Sbjct: 2089 VQPEDREKTAFRTH 2130
>NP395548 reverse transcriptase [Glycine max]
Length = 762
Score = 66.6 bits (161), Expect = 3e-11
Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 18/93 (19%)
Frame = +1
Query: 104 WVANIVPVPKKDGKV------------------RMCVDFRDLNKASPKDNFPLPHIDVLV 145
WV+ ++ V KK+G ++C+D+R LN+A+ KD+FPLP +D ++
Sbjct: 64 WVSPVLVVSKKEGMTVIRNEKNDLIPTRTVTSWKLCIDYRKLNEATRKDHFPLPFMDQML 243
Query: 146 DNTAQSKVFSFMDGFSGYNLIKMSPEDREKTSF 178
+ A + F+D + GYN I + P+D+EK +F
Sbjct: 244 ERLAGHAYYCFLDAYFGYNQIVVDPKDQEKMAF 342
>BG725601 similar to PIR|H86337|H86 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (1%)
Length = 285
Score = 50.8 bits (120), Expect = 2e-06
Identities = 23/52 (44%), Positives = 34/52 (65%)
Frame = -3
Query: 95 FLMTVEYPEWVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVD 146
F+ + Y + ++V V K +GK R+C D+ DLN A PKD +PLP+ID + D
Sbjct: 163 FIRDINYST*LFSVVMVKKPNGKWRICTDYIDLN*ACPKDAYPLPNIDHMTD 8
>TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 402
Score = 43.9 bits (102), Expect = 2e-04
Identities = 20/27 (74%), Positives = 23/27 (85%)
Frame = +2
Query: 154 FSFMDGFSGYNLIKMSPEDREKTSFIT 180
FSFMDGFSGYN I M+ ED EKT+F+T
Sbjct: 2 FSFMDGFSGYNQI*MAREDVEKTTFVT 82
>BQ628592
Length = 423
Score = 34.7 bits (78), Expect = 0.14
Identities = 26/64 (40%), Positives = 33/64 (50%)
Frame = -2
Query: 338 AKKLLIASRITCCNHLSLSHPWKEGL*LCICQCLMNLWDAYLVNKMRLERKNMLSTI*AR 397
+KK ASRI C+ H +E L C C +LWDA + L ++N TI*AR
Sbjct: 374 SKKSNRASRIPRCS----CHL*QEDLFSCT*LC*TSLWDACWFSTTTLGKRNKPFTI*AR 207
Query: 398 SSPT 401
S PT
Sbjct: 206 SLPT 195
>TC219643 weakly similar to UP|Q6WAY7 (Q6WAY7) Gag/pol polyprotein (Fragment),
partial (8%)
Length = 1320
Score = 33.1 bits (74), Expect = 0.40
Identities = 12/18 (66%), Positives = 14/18 (77%)
Frame = +3
Query: 95 FLMTVEYPEWVANIVPVP 112
FL YP+WVANIVP+P
Sbjct: 1266 FLAVARYPKWVANIVPIP 1319
>BI498328
Length = 335
Score = 32.3 bits (72), Expect = 0.68
Identities = 22/53 (41%), Positives = 25/53 (46%)
Frame = +2
Query: 529 SSMVLSMLMVKESGQSLYPRRGITSLLPPGFCSNVQTIWPNTKRVSLGSMKQL 581
+SM ML E GQSLYPR L V TIWP+TK G + L
Sbjct: 161 ASMGHPMLWATE*GQSLYPRMISVFLSRLD*VLIVPTIWPSTKHAPSGFRRPL 319
>TC222732 similar to UP|Q9LM48 (Q9LM48) F2E2.18, partial (24%)
Length = 930
Score = 30.4 bits (67), Expect = 2.6
Identities = 19/47 (40%), Positives = 24/47 (50%)
Frame = +3
Query: 455 HAGKCSCPNMTLCSRLKKQSKVAFLPIILPTNLLMITNQLSLISPMK 501
H + P MT+ S LKKQ V +IL LLMI L ++ MK
Sbjct: 657 HCSLLTFPLMTMMSTLKKQHLVHMKRLILLITLLMIMILLVMVGLMK 797
>TC228017
Length = 752
Score = 29.3 bits (64), Expect = 5.7
Identities = 16/46 (34%), Positives = 24/46 (51%), Gaps = 3/46 (6%)
Frame = +1
Query: 79 WLSRLRVRFKSRLMRVFLMTVE-YPEW--VANIVPVPKKDGKVRMC 121
W +L + +K + RV ++ YP W V N+V PKK K+ C
Sbjct: 445 WEYKLLIGWKQLVPRVLKFSISCYPTWFLVVNVVS*PKK*KKINFC 582
>AW318041
Length = 310
Score = 29.3 bits (64), Expect = 5.7
Identities = 9/25 (36%), Positives = 16/25 (64%)
Frame = +1
Query: 351 NHLSLSHPWKEGL*LCICQCLMNLW 375
N+++ +HPW G C C ++N+W
Sbjct: 166 NNVASTHPWALG*ETCCCYIMLNVW 240
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.347 0.151 0.514
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 31,396,944
Number of Sequences: 63676
Number of extensions: 476905
Number of successful extensions: 4257
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 3065
Number of HSP's successfully gapped in prelim test: 125
Number of HSP's that attempted gapping in prelim test: 1153
Number of HSP's gapped (non-prelim): 3246
length of query: 650
length of database: 12,639,632
effective HSP length: 103
effective length of query: 547
effective length of database: 6,081,004
effective search space: 3326309188
effective search space used: 3326309188
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.7 bits)
S2: 62 (28.5 bits)
Medicago: description of AC140850.3