
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0029a.5
(1393 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NP595172 polyprotein [Glycine max] 1154 0.0
CO982196 245 1e-73
TC211627 116 7e-49
TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 118 1e-43
NP395548 reverse transcriptase [Glycine max] 164 2e-40
BI425021 159 8e-39
BQ299538 108 4e-36
NP395547 reverse transcriptase [Glycine max] 150 5e-36
TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%) 147 2e-35
AW830191 147 4e-35
AW570005 141 2e-33
CF922488 140 4e-33
CA820403 weakly similar to GP|13273463|gb| pol protein integrase... 108 5e-33
BI317507 124 3e-28
CD487724 120 4e-27
BQ627806 76 2e-26
TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, parti... 115 1e-25
BI317638 weakly similar to GP|9294238|dbj| contains similarity t... 93 4e-22
BM731326 weakly similar to GP|21740635|em OSJNBb0043H09.2 {Oryza... 102 2e-21
NP334778 reverse transcriptase [Glycine max] 100 4e-21
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 1154 bits (2986), Expect = 0.0
Identities = 616/1406 (43%), Positives = 858/1406 (60%), Gaps = 27/1406 (1%)
Frame = +1
Query: 15 RVDIPMFNGNDAYGWVTKVERFFRLSRVEEAEKIEMVMIAMEDRALGWFQWWEEQTLERA 74
++D P F+G + W+ K E+FF +A+++ + + ++ + W+Q ++ +
Sbjct: 295 KLDFPRFDGKNVMDWIFKAEQFFDYYATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSS 474
Query: 75 WEPFKQALFRRFQPALLQNPFGPLLSVKQKGSVMEYRENFELLAAPMRNADREVLKGVFL 134
W+ F +AL F P+ P L + Q +V EY F L + E + F+
Sbjct: 475 WQAFTRALELDFGPSAYDCPRATLFKLNQSATVNEYYMQFTALVNRVDGLSAEAILDCFV 654
Query: 135 NGLQEEIKAEMKLYPADDLAELMDRALLLEEKNTAMRGGKPKEEDKRGWKD---LQNKGG 191
+GLQEEI ++K L + + A L EEK T+ K R + K
Sbjct: 655 SGLQEEISRDVKAMEPRTLTKAVALAKLFEEKYTSPPKTKTFSNLARNFTSNTSATQKYP 834
Query: 192 TGNQDTEGKQPE----------KKWN----GGQRLTQTELQERSRKGLCFKCGDKWGKEH 237
NQ + +P K +N ++++ E+Q R K LC+ C +K+ H
Sbjct: 835 PTNQKNDNPKPNLPPLLPTPSTKPFNLRNQNIKKISPAEIQLRREKNLCYFCDEKFSPAH 1014
Query: 238 ICSMKNYQLILMEVEEDEEEEEIFEEAEDGEFVLEGKVLQLSLNSKEGLTSNRSFKVKGK 297
C N Q++L+++EE +E++ + E ++ LSLN+ G + + G+
Sbjct: 1015 KCP--NRQVMLLQLEETDEDQTDEQVMVTEEANMDDDTHHLSLNAMRGSNGVGTIRFTGQ 1188
Query: 298 IGNREVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEV 357
+G V IL+D G++ NFI + L++PV V VGNG G+ + L L +
Sbjct: 1189 VGGIAVKILVDGGSSDNFIQPRVAQVLKLPVEPAPNLRVLVGNGQILSAEGIVQQLPLHI 1368
Query: 358 QGISIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCR 417
QG + ++L + G +V+LG WLA+LG A++ L +++ + + LQGE +
Sbjct: 1369 QGQEVKVPVYLLQISGADVILGSTWLATLGPHVADYAALTLKFFQNDKFITLQGEGNSEA 1548
Query: 418 VTANWKSIK-ITEQQEAEGYYLSYEYQKE-EEKTEAEVPEGMRK----ILEEYPEVFQEP 471
A + + + E + QKE E T ++P + +L Y +VF P
Sbjct: 1549 TQAQLHHFRRLQNTKSIEECFAIQLIQKEVPEDTLKDLPTNIDPELAILLHTYAQVFAVP 1728
Query: 472 KGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSP 531
LPP+R DHAI L++G+ +RPYRYP QK++IEK+++EML GII+ S SPFS P
Sbjct: 1729 ASLPPQREQDHAIPLKQGSGPVKVRPYRYPHTQKDQIEKMIQEMLVQGIIQPSNSPFSLP 1908
Query: 532 AILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIR 591
+LVKKKDG WRFC DYRALN T+ D FP+P +DELLDE+ A FSKLDL+SGYHQI
Sbjct: 1909 ILLVKKKDGSWRFCTDYRALNAITVKDSFPMPTVDELLDELHGAQYFSKLDLRSGYHQIL 2088
Query: 592 MKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIY 651
++ ED KTAFRTH GHYE+LV+PFGLTNAP+TFQ LMN++ + LRKFVLVFFDDILIY
Sbjct: 2089 VQPEDREKTAFRTHHGHYEWLVMPFGLTNAPATFQCLMNKIFQFALRKFVLVFFDDILIY 2268
Query: 652 SKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDML 711
S + + H HL VLQ LK++ L A KCSFG E+ YLGH +S GV+ + +K++ +L
Sbjct: 2269 SASWKDHLKHLESVLQTLKQHQLFARLSKCSFGDTEVDYLGHKVSGLGVSMENTKVQAVL 2448
Query: 712 DWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEV 771
DWP P VK LRGFLGLTGYYRRF+K+Y+ +A PL LL+K+SF W A AFVKLK+
Sbjct: 2449 DWPTPNNVKQLRGFLGLTGYYRRFIKSYANIAGPLTDLLQKDSFLWNNEAEAAFVKLKKA 2628
Query: 772 MTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELM 831
MT PVL P+F +PFILETDASG G+GAVL Q G P+AY SK L+ R Q +S Y REL+
Sbjct: 2629 MTEAPVLSLPDFSQPFILETDASGIGVGAVLGQNGHPIAYFSKKLAPRMQKQSAYTRELL 2808
Query: 832 AVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGI 891
A+ A+ K+RHYLLG+KF+I TDQRSL+ L DQ + EQQ W+ K +GYDF+I+YKPG
Sbjct: 2809 AITEALSKFRHYLLGNKFIIRTDQRSLKSLMDQSLQTPEQQAWLHKFLGYDFKIEYKPGK 2988
Query: 892 ENKAADALSRKLQFSAISSVQCAEWADLEAEILEDERYRKVLQELATQGNSAVGYQLKRG 951
+N+AADALSR A S +L A ++ D + K L E QG A Y ++ G
Sbjct: 2989 DNQAADALSRMFML-AWSEPHSIFLEELRARLISDP-HLKQLMETYKQGADASHYTVREG 3162
Query: 952 RLLYKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQNYVQ 1011
L +KDR+V+P + +L+E+H + +GGHAGI RT R+ A FYW M+ D++ Y+Q
Sbjct: 3163 LLYWKDRVVIPAEEEIVNKILQEYHSSPIGGHAGITRTLARLKAQFYWPKMQEDVKAYIQ 3342
Query: 1012 KCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYA 1071
KC +CQ+ K PAG LQPLPIP Q W D++MDFI GLP + G I+VV+DR TKYA
Sbjct: 3343 KCLICQQAKSNNTLPAGLLQPLPIPQQVWEDVAMDFITGLPNSFGLSVIMVVIDRLTKYA 3522
Query: 1072 HFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSS 1131
HFI L YN+K +AE F+ +V+LHG P SIVSDRDRVF STFW +FKL GT L SS
Sbjct: 3523 HFIPLKADYNSKVVAEAFMSHIVKLHGIPRSIVSDRDRVFTSTFWQHLFKLQGTTLAMSS 3702
Query: 1132 AYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSAIKTTPFKALY 1191
AYHPQ+DGQ+EV+N+C+E YLRC T PK W K L WAEFWYNT YH ++ TPF+ALY
Sbjct: 3703 AYHPQSDGQSEVLNKCLEMYLRCFTYEHPKGWVKALPWAEFWYNTAYHMSLGMTPFRALY 3882
Query: 1192 GREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQANKHRRDVQYEV 1251
GREPP + + S+ EV + +R+ +L +LK NL +AQ M++QA+K R DV +++
Sbjct: 3883 GREPPTLTRQACSIDDPAEVREQLTDRDALLAKLKINLTRAQQVMKRQADKKRLDVSFQI 4062
Query: 1252 GDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPEGSQVHPVFHIS 1311
GD V +K+QPY+ S R NQKLS RY+GP+ ++AKI AYKL+LP +++HPVFH+S
Sbjct: 4063 GDEVLVKLQPYRQHSAVLRKNQKLSMRYFGPFKVLAKIGDVAYKLELPSAARIHPVFHVS 4242
Query: 1312 LLKKAVNAGVQSQPLPAALT-EEWELKVEPEAIMDTR---ENRDGDLEVLIRWKDLPTFE 1367
L K N Q LP LT E ++P I+ +R + ++L++W++ E
Sbjct: 4243 QL-KPFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQDE 4419
Query: 1368 DSWEDFSKLLDQFPNHQLEDKLNLQG 1393
+WED + +P LEDK+ +G
Sbjct: 4420 ATWEDIEDIKASYPTFNLEDKVVFKG 4497
>CO982196
Length = 812
Score = 245 bits (626), Expect(4) = 1e-73
Identities = 119/201 (59%), Positives = 149/201 (73%)
Frame = +1
Query: 915 EWADLEAEILEDERYRKVLQELATQGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKE 974
E AD E EI ++ Q + T+ GY ++ G+L +KDR+VL K STKI +LKE
Sbjct: 208 ELADWEEEIQAYLELYEIYQGILTKTTKKPGYAIRGGKLYFKDRLVLSKNSTKIPLLLKE 387
Query: 975 FHDTALGGHAGIFRTYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLP 1034
D+ LGGH+G FRT+KR++ + +W+GMK ++YV CE+C+RNK L+PAG L LP
Sbjct: 388 LQDSPLGGHSGFFRTFKRVANVVFWQGMKKTTRDYVAACEICRRNKTSTLSPAGLL*LLP 567
Query: 1035 IPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVV 1094
IP++ WTDISMDFIGGLPKA GKD ILVVVDR TKYAHF ALSHPY AKE+AE+FIKE+V
Sbjct: 568 IPTKVWTDISMDFIGGLPKAQGKDNILVVVDRLTKYAHFFALSHPYTAKEVAELFIKELV 747
Query: 1095 RLHGFPTSIVSDRDRVFLSTF 1115
RLHGFP SIVSD R+F+S F
Sbjct: 748 RLHGFPASIVSDXXRLFMSLF 810
Score = 46.2 bits (108), Expect(4) = 1e-73
Identities = 20/34 (58%), Positives = 26/34 (75%)
Frame = +2
Query: 841 RHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQKW 874
RHY +G KF+I T+ RS +FL +QR+M EEQ KW
Sbjct: 53 RHYPVGKKFIIRTN*RSSKFLNEQRLMSEEQFKW 154
Score = 25.0 bits (53), Expect(4) = 1e-73
Identities = 11/22 (50%), Positives = 15/22 (68%)
Frame = +3
Query: 891 IENKAADALSRKLQFSAISSVQ 912
+ N ++ LSR+ FSAIS VQ
Sbjct: 135 VRNSSSGLLSRQFSFSAISMVQ 200
Score = 21.9 bits (45), Expect(4) = 1e-73
Identities = 10/13 (76%), Positives = 11/13 (83%)
Frame = +3
Query: 825 VYERELMAVVLAV 837
+YERELM VVL V
Sbjct: 6 MYERELMDVVLPV 44
>TC211627
Length = 1034
Score = 116 bits (290), Expect(2) = 7e-49
Identities = 71/213 (33%), Positives = 119/213 (55%), Gaps = 10/213 (4%)
Frame = +3
Query: 1190 LYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQANKHRRDVQY 1249
+YG+ PP + + ++V+ V+ + I L L+K Q+ M++ A+ HRRD+ +
Sbjct: 243 MYGKPPPALPLYSAGTSTVEAVDAILHSLATIHHTLTCRLQKYQDSMKRIADSHRRDLTF 422
Query: 1250 EVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPEGSQVHPVFH 1309
+GD VY+++ PY+ S+ + + KLS R+YGPY I A++ AY+LQLP S++HP+FH
Sbjct: 423 NIGDWVYVRL*PYRQTSI-QSTYTKLSKRFYGPYQIQARVGQVAYRLQLPPTSKIHPIFH 599
Query: 1310 ISLLKKAVNAGVQSQPLPAAL-------TEEWELKVEPEAIMDTRENRDGD---LEVLIR 1359
+SLLK V P+P L T L V+P +D + + +VL++
Sbjct: 600 VSLLK------VHHGPIPPELLALPPFSTTNHPL-VQPLQFLDWKMDESTTPPIPQVLVQ 758
Query: 1360 WKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNLQ 1392
W +L + +WE +++L D + LEDK+ Q
Sbjct: 759 WTNLAPEDTTWESWTQLKDIY---DLEDKVCFQ 848
Score = 98.2 bits (243), Expect(2) = 7e-49
Identities = 44/79 (55%), Positives = 58/79 (72%)
Frame = +2
Query: 1110 VFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSW 1169
+F+S W E+F ++GTKL+FS+AYHPQTDGQTEV+NR +E YLR P+ W K+LS
Sbjct: 2 IFISGLWHELFHISGTKLRFSTAYHPQTDGQTEVINRILEQYLRAFVHDHPQHWFKFLSL 181
Query: 1170 AEFWYNTNYHSAIKTTPFK 1188
AE YNT+ HS I +PF+
Sbjct: 182 AE*CYNTSVHSGIGFSPFE 238
>TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 730
Score = 118 bits (296), Expect(2) = 1e-43
Identities = 57/109 (52%), Positives = 78/109 (71%)
Frame = +1
Query: 1208 VDEVEKLTAERNLILEELKSNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSL 1267
++EV KL R+ +L L+ NL K+ + M NK RRD++Y VGD V+LK+QPY+ +SL
Sbjct: 13 LEEVNKLIIARDGLLATLRENLLKS*DIM*ANTNKRRRDIEYVVGD*VFLKMQPYRRRSL 192
Query: 1268 AKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPEGSQVHPVFHISLLKKA 1316
AKR N+KLSPR+Y P+ + K+ AYKL LP ++HPVFH+SLLKKA
Sbjct: 193 AKRINEKLSPRFYAPFQVFNKVGTIAYKLDLPSHIKIHPVFHVSLLKKA 339
Score = 78.2 bits (191), Expect(2) = 1e-43
Identities = 34/72 (47%), Positives = 55/72 (76%)
Frame = +3
Query: 1322 QSQPLPAALTEEWELKVEPEAIMDTRENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFP 1381
QSQPLP L+E+W+L+ ++++D+RE + G+++VLI+WK+LP E+SWE +KL + F
Sbjct: 339 QSQPLPPMLSEDWKLQTYSDSVLDSRELQPGNVKVLIQWKNLPPSENSWESVAKLQEIFS 518
Query: 1382 NHQLEDKLNLQG 1393
+ LEDK++L G
Sbjct: 519 IYHLEDKVSLLG 554
>NP395548 reverse transcriptase [Glycine max]
Length = 762
Score = 164 bits (416), Expect = 2e-40
Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 19/254 (7%)
Frame = +1
Query: 508 IEKLVKEMLNSGIIRH-STSPFSSPAILVKKKDG------------------GWRFCVDY 548
+ K V ++L G+I S S + SP ++V KK+G W+ C+DY
Sbjct: 1 VRKEVLKLLEVGLIYPISDSAWVSPVLVVSKKEGMTVIRNEKNDLIPTRTVTSWKLCIDY 180
Query: 549 RALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGH 608
R LN+AT D FP+P +D++L+ + + LD GY+QI + +D K AF G
Sbjct: 181 RKLNEATRKDHFPLPFMDQMLERLAGHAYYCFLDAYFGYNQIVVDPKDQEKMAFTCPFGV 360
Query: 609 YEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQV 668
+ Y +PFGL NAP+TFQ M + + K + VF DD ++ + E L +VLQ
Sbjct: 361 FAYRRIPFGLCNAPTTFQMCMLAIFADIVEKSIEVFMDDFSVFVPSLESCLKKLEMVLQR 540
Query: 669 LKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGL 728
E NLV N +KC F E I LGH IS G+ D +KI + P P VKG+R FLG
Sbjct: 541 CVETNLVLNWEKCHFMVREGIVLGHKISTRGIEVDQTKIDVIEKLPPPSNVKGIRSFLGQ 720
Query: 729 TGYYRRFVKNYSKL 742
+YRRF+K+++K+
Sbjct: 721 ARFYRRFIKDFTKV 762
>BI425021
Length = 426
Score = 159 bits (402), Expect = 8e-39
Identities = 74/142 (52%), Positives = 99/142 (69%)
Frame = -1
Query: 1030 LQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVF 1089
L PLP+P + W D+SMDFI GLP G TI VVV+RF+K H L + A +A +F
Sbjct: 426 LCPLPVPQRPWEDLSMDFIVGLPPYHGHTTIFVVVNRFSKGIHLGTLPTSHTAHMVASLF 247
Query: 1090 IKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVE 1149
+ V++LHGFP SIVSDRD +F+S FW ++F+L+GT L+ SSAYHPQTDGQTEV+NR +E
Sbjct: 246 LNIVIKLHGFPRSIVSDRDPLFISHFWQDLFRLSGTVLRMSSAYHPQTDGQTEVLNRVIE 67
Query: 1150 TYLRCVTGSKPKQWPKWLSWAE 1171
YLR +P+ +++ W E
Sbjct: 66 QYLRAFVHGRPRNLGRFIPWVE 1
>BQ299538
Length = 426
Score = 108 bits (269), Expect(2) = 4e-36
Identities = 49/99 (49%), Positives = 68/99 (68%)
Frame = +3
Query: 1121 KLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHS 1180
KL GT LK S++YHP DGQT VN C+ET+LRC +PK +WLSWAE+WYNTN+H+
Sbjct: 135 KLPGTYLKMSTSYHP*IDGQT--VNHCLETFLRCFVADQPKM*VQWLSWAEYWYNTNFHA 308
Query: 1181 AIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERN 1219
+ TTPF+ +YGR+PPV+ + V+ V + +R+
Sbjct: 309 STGTTPFEVVYGRKPPVLNRFLPGEVRVEAVRRELQDRD 425
Score = 63.5 bits (153), Expect(2) = 4e-36
Identities = 25/44 (56%), Positives = 35/44 (78%)
Frame = +1
Query: 1076 LSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFLSTFWSEM 1119
L HPY+A+ +AE+F KEVV LHG P S++SD D +F+S+FW E+
Sbjct: 1 LKHPYSARVLAEIFTKEVVHLHGVPASVLSDEDPIFVSSFWKEL 132
>NP395547 reverse transcriptase [Glycine max]
Length = 762
Score = 150 bits (378), Expect = 5e-36
Identities = 91/254 (35%), Positives = 134/254 (51%), Gaps = 19/254 (7%)
Frame = +1
Query: 508 IEKLVKEMLNSGIIRH-STSPFSSPAILVKKKDG------------------GWRFCVDY 548
+ K V ++L +G+I S S + SP +V KK G WR C+DY
Sbjct: 1 VRKEVFKLLEAGLIYPISDSSWVSPVQVVPKKGGMTVVKNDRNELIPTRRVTRWRMCIDY 180
Query: 549 RALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGH 608
R LN+AT D +P+P +D++L + + LD SGY+QI + +D KTAF
Sbjct: 181 RKLNEATRKDHYPLPFMDQMLKRLARQSFYRFLDGYSGYNQIAVDPQDQEKTAFTCPFSV 360
Query: 609 YEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQV 668
+ Y +PFGL NA +TFQ M + + K + VF DD + + +L VLQ
Sbjct: 361 FAYRRMPFGLCNASTTFQRCMMAIFDDMVEKCIEVFMDDFSFFGASFGNCLANLEKVLQR 540
Query: 669 LKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGL 728
+++NLV N +KC F E I LGH IS+ G+ K+ + P P VKG+ FLG
Sbjct: 541 CEKSNLVLNWEKCHFMVQEGIVLGHKISKRGIEVVKEKLDVIDKLPPPVNVKGIHSFLGH 720
Query: 729 TGYYRRFVKNYSKL 742
G+YRRF+K+++K+
Sbjct: 721 VGFYRRFIKDFTKV 762
>TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%)
Length = 589
Score = 147 bits (372), Expect = 2e-35
Identities = 74/163 (45%), Positives = 103/163 (62%), Gaps = 1/163 (0%)
Frame = +1
Query: 714 PIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVM 772
P K V +R F GL +YRRFV N+S +A PLN+L+KKN +F W E QAF LKE +
Sbjct: 97 PTLKSVGDIRSFHGLASFYRRFVPNFSTVASPLNELVKKNMAFTWGEKQEQAFALLKEKL 276
Query: 773 TTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMA 832
T PVL P+F K F LE DASG G+ AVL+Q G P+AY S+ L Y++EL A
Sbjct: 277 TKAPVLALPDFSKTFELECDASGVGVRAVLLQGGHPIAYFSEKLHSATLNYPTYDKELYA 456
Query: 833 VVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQKWM 875
++ A Q W H+L+ +FVIH+D +SL+++ + + + KW+
Sbjct: 457 LIRAPQTWEHFLVCKEFVIHSDHQSLKYIRGKSKLNKRHAKWV 585
Score = 33.1 bits (74), Expect = 0.88
Identities = 15/37 (40%), Positives = 22/37 (58%)
Frame = +2
Query: 688 IIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRG 724
I + G V+ + GV DP KIK + +WP P +V + G
Sbjct: 17 IFFSGFVVGRNGVQMDPEKIKAIQEWPPP*KVWEILG 127
>AW830191
Length = 372
Score = 147 bits (370), Expect = 4e-35
Identities = 66/123 (53%), Positives = 96/123 (77%)
Frame = +3
Query: 1152 LRCVTGSKPKQWPKWLSWAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEV 1211
LRC+TG+KP+QWPK LSWAEFW+NTNY++++K TPFK LYG +PP + KG ++ +EV
Sbjct: 6 LRCLTGTKPQQWPKRLSWAEFWFNTNYNNSLKLTPFKVLYGCDPPHLLKGAIISSTAEEV 185
Query: 1212 EKLTAERNLILEELKSNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRS 1271
+T +R+ +L +LK NL KAQN+M+ A++ RR + VGD VYLK+QPY+ +SLA+++
Sbjct: 186 NVMTNDRDQMLHDLKGNLAKAQNQMK-YADRSRRSIPLNVGDWVYLKLQPYRQRSLARKT 362
Query: 1272 NQK 1274
N+K
Sbjct: 363 NEK 371
>AW570005
Length = 413
Score = 141 bits (355), Expect = 2e-33
Identities = 70/137 (51%), Positives = 88/137 (64%)
Frame = -2
Query: 714 PIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMT 773
P P+ + LRGFL LTG+YRRF+K Y+ +A PL+ LL K+SF W+ A AF LK V+T
Sbjct: 412 PPPRTARSLRGFLRLTGFYRRFIKGYAAMAAPLSHLLTKDSFVWSPEADVAFQALKNVVT 233
Query: 774 TVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAV 833
VL P+F KPF +ETDASG +GAVL QEG P+A+ SK + S Y EL A+
Sbjct: 232 NTLVLALPDFTKPFTVETDASGSDMGAVLSQEGHPIAFFSKEFCPKLVRSSTYVHELAAI 53
Query: 834 VLAVQKWRHYLLGSKFV 850
V+KWR YLLG V
Sbjct: 52 TNVVKKWRQYLLGHHLV 2
>CF922488
Length = 741
Score = 140 bits (353), Expect = 4e-33
Identities = 89/246 (36%), Positives = 132/246 (53%), Gaps = 1/246 (0%)
Frame = +3
Query: 535 VKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKE 594
V K+DG CVDYR LN A+ DKFP+P I+ L+D + FS +D SGY+QI++
Sbjct: 3 VLKEDGKV*MCVDYRDLN*ASPKDKFPLPHINVLVDNTTSFSQFSFMDGFSGYNQIKIAP 182
Query: 595 EDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKN 654
ED+ KT F T G + Y + FGL N +T+Q M + + K + V+ DD+++ S+
Sbjct: 183 EDMEKTTFITLWGTFCYKAMSFGLKNVGATYQRAMVALF*DMMHKEIEVYMDDMIVKSRT 362
Query: 655 EELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWP 714
EE H +LR + + L++ L N KC F L + S G+ D +K+K +L+
Sbjct: 363 EEEHLVNLRKLFRRLRKYRLRLNPAKCMFEVKSRKLLDFIDS*RGIEVDSNKVKVILEMA 542
Query: 715 IPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSF-QWTEGATQAFVKLKEVMT 773
P K ++GFLG Y RF+ +PL LL KN F +W AF ++K+ +
Sbjct: 543 KPHTEKQVQGFLGRLNYIVRFIS*LIATCEPLFILLCKNQFVKWDHDC*VAFERIKQCLI 722
Query: 774 TVPVLV 779
VLV
Sbjct: 723 NPHVLV 740
>CA820403 weakly similar to GP|13273463|gb| pol protein integrase region
{Ginkgo biloba}, partial (52%)
Length = 421
Score = 108 bits (269), Expect(2) = 5e-33
Identities = 52/77 (67%), Positives = 62/77 (79%), Gaps = 1/77 (1%)
Frame = -3
Query: 1089 FIKEVVRLHGFPTSIVSDRDRVFL-STFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRC 1147
FIKE V+LHG +SIVSD DR+FL S FW+E+FK+ GTKLKFS AYHPQ D T+VVNRC
Sbjct: 335 FIKEAVKLHGCSSSIVSDWDRLFLIS*FWTELFKMEGTKLKFSLAYHPQPDSHTKVVNRC 156
Query: 1148 VETYLRCVTGSKPKQWP 1164
+E L+C+T SK KQWP
Sbjct: 155 IEMNLQCLTTSKRKQWP 105
Score = 53.1 bits (126), Expect(2) = 5e-33
Identities = 22/34 (64%), Positives = 30/34 (87%)
Frame = -1
Query: 1060 ILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEV 1093
I+V+V RFTKYAHF+ LSHPY AKE++EV +K++
Sbjct: 421 IMVIVYRFTKYAHFVVLSHPYLAKEVSEVLLKKL 320
>BI317507
Length = 359
Score = 124 bits (311), Expect = 3e-28
Identities = 62/114 (54%), Positives = 81/114 (70%), Gaps = 1/114 (0%)
Frame = -1
Query: 645 FDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADP 704
F +ILIYS + + H HL VL VLK+ LVAN+KKC F Q I YLGHVIS+ VA D
Sbjct: 359 FYNILIYSPDWKSHIMHLTAVLDVLKKERLVANRKKCYFSQTTIEYLGHVISKDCVAMDS 180
Query: 705 SKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLA-QPLNQLLKKNSFQW 757
+K+K +++WP+PK VK + FL LTGYYR+F+K+Y KLA +PL L K + F+W
Sbjct: 179 NKVKSVIEWPVPKNVKRVCSFLRLTGYYRKFIKDYGKLAPRPLTDLTKNDGFKW 18
>CD487724
Length = 676
Score = 120 bits (301), Expect = 4e-27
Identities = 67/216 (31%), Positives = 117/216 (54%), Gaps = 2/216 (0%)
Frame = +3
Query: 303 VLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGISI 362
++ L+D G+T NF+ Q LV +L +P +T V VGNG + + +C+ + + +Q I
Sbjct: 24 LVYLVDGGSTHNFVQQPLVSQLGLPCRSTPPLRVMVGNGHHLKCTTICEAIPISIQNIEF 203
Query: 363 MQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCRVTANW 422
+ H ++L + G +VLG+ WL +LG I ++ L +Q+ Q + + L+GE N
Sbjct: 204 LVHLYVLPIVGANIVLGVQWLKTLGPILVDYNSLSMQFFYQHRLVQLKGESEAQLGLLNH 383
Query: 423 KSIKITEQ--QEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTT 480
++ Q + ++++ + + +P+ ++ +L+++ +FQ P+GLPP R T
Sbjct: 384 HQLRRLHQTHEPVTYFHIAILTENTSPTSSPPLPQPIQHLLDQFSALFQ*PQGLPPARET 563
Query: 481 DHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEML 516
DH I L + N+R Y YP Y NEIE V ML
Sbjct: 564 DHHIHLLP*SEPVNMRLY*YPHY*NNEIEHQVNLML 671
>BQ627806
Length = 435
Score = 75.9 bits (185), Expect(2) = 2e-26
Identities = 36/83 (43%), Positives = 52/83 (62%)
Frame = +1
Query: 808 PVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIM 867
P+ + + S Y REL A+ +AV+KWR YLLG FVI TD RSL+ L Q +
Sbjct: 181 PLPFSFMPFCSKLLRASTYVRELAAITVAVKKWRQYLLGHHFVILTDHRSLKELMSQAVQ 360
Query: 868 GEEQQKWMSKLMGYDFEIKYKPG 890
EQQ ++++LMG+D+ I+Y+ G
Sbjct: 361 TPEQQIYLARLMGFDYTIQYRAG 429
Score = 63.5 bits (153), Expect(2) = 2e-26
Identities = 30/63 (47%), Positives = 42/63 (66%)
Frame = +3
Query: 749 LLKKNSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRP 808
LL K+ F W E A +AF +LK + PVL P+F+ F++ETDASG G+GA+L Q P
Sbjct: 3 LLVKDQFHWNEEADRAFSQLKLALCQAPVLGLPDFNSSFVVETDASGIGMGAILSQNHHP 182
Query: 809 VAY 811
+A+
Sbjct: 183 LAF 191
>TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, partial (7%)
Length = 810
Score = 115 bits (288), Expect = 1e-25
Identities = 56/98 (57%), Positives = 71/98 (72%)
Frame = +3
Query: 1236 MRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYK 1295
M+ A++ RR V VGD VYLK+QPY+LKSLAK+ N+KLSPR+YGPY I +I A++
Sbjct: 3 MKAYADRSRRAVTLSVGDWVYLKLQPYRLKSLAKKRNEKLSPRFYGPYQIKKQIGLVAFE 182
Query: 1296 LQLPEGSQVHPVFHISLLKKAVNAGVQSQPLPAALTEE 1333
L LP ++HPVFH SLLKKAV A QPLP L+E+
Sbjct: 183 LDLPPARKIHPVFHASLLKKAVAATANPQPLPLMLSED 296
Score = 56.6 bits (135), Expect = 7e-08
Identities = 28/71 (39%), Positives = 41/71 (57%)
Frame = +2
Query: 1323 SQPLPAALTEEWELKVEPEAIMDTRENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPN 1382
S P+ + +EL+V P + N +G EVLI+ +DLP FE +WE + +QFP+
Sbjct: 263 SSTTPSDVI*RFELRVFPAEVKAVHNNSNGIAEVLIQLEDLPDFEATWESVEVIKEQFPS 442
Query: 1383 HQLEDKLNLQG 1393
LEDK+ L G
Sbjct: 443 FHLEDKVTLLG 475
>BI317638 weakly similar to GP|9294238|dbj| contains similarity to reverse
transcriptase~gene_id:K11J14.5 {Arabidopsis thaliana},
partial (5%)
Length = 420
Score = 92.8 bits (229), Expect(2) = 4e-22
Identities = 49/117 (41%), Positives = 64/117 (53%)
Frame = -2
Query: 1004 LDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVV 1063
L+ + C CQ KYE L PL +P + W D+S+DFI GL ILVV
Sbjct: 353 LECAQMLPNCLDCQHTKYETKRIVDLLCPLLVPHRPWEDLSLDFITGLLPYHVHTAILVV 174
Query: 1064 VDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMF 1120
VD F+K H L + A +A +FI V +LHG P S+VSD D +F+S FW E+F
Sbjct: 173 VDHFSKGIHLGMLPSSHTAHTVACLFIDSVAKLHGLPRSLVSDCDLLFVSHFWQELF 3
Score = 32.0 bits (71), Expect(2) = 4e-22
Identities = 13/29 (44%), Positives = 17/29 (57%)
Frame = -1
Query: 978 TALGGHAGIFRTYKRISALFYWEGMKLDI 1006
T GGH GI +T +S YW GM+ D+
Sbjct: 420 TTTGGHTGIAKTLA*LSKNIYWFGMRTDV 334
>BM731326 weakly similar to GP|21740635|em OSJNBb0043H09.2 {Oryza sativa
(japonica cultivar-group)}, partial (3%)
Length = 424
Score = 102 bits (253), Expect = 2e-21
Identities = 53/133 (39%), Positives = 84/133 (62%), Gaps = 2/133 (1%)
Frame = +1
Query: 1229 LEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAK 1288
LE+AQ+ M + AN HRR VGD VYLKI+P++ S+ R + KL+ R+YGPY ++ +
Sbjct: 25 LERAQSLMVKHANNHRRPHDINVGDWVYLKIRPHRQGSMPPRLHPKLTARFYGPYLVMRQ 204
Query: 1289 INPAAYKLQLPEGSQVHPVFHISLLKKAVNAGVQSQPLPAALTEEWEL--KVEPEAIMDT 1346
+ A++LQLP +++HPVFH+S LK+A+ + LP L + EL V+ I +
Sbjct: 205 VGAVAFQLQLPSEARIHPVFHVSQLKRALGNHQAQEELPPDLEHQAELYFPVQILKIREV 384
Query: 1347 RENRDGDLEVLIR 1359
++ + + +VLIR
Sbjct: 385 QKQHEVERQVLIR 423
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 100 bits (249), Expect = 4e-21
Identities = 54/142 (38%), Positives = 84/142 (59%)
Frame = +3
Query: 543 RFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAF 602
R CVDYR LN+A+ D FP+P ID L+ + + +FS +D SGY+QI+M ED+ KT F
Sbjct: 3 RMCVDYRDLNRASPKDNFPLPHIDILMANMASFALFSFMDGFSGYNQIKMAPEDMEKTTF 182
Query: 603 RTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHL 662
T G + Y V+ FGL N +T+ M + + + K + + D+++ S+ EE H +L
Sbjct: 183 ITLWGTFCYKVMSFGLKNFGATYHRAMVALFQDMMHKEIEAYVDEMIAKSRMEEEHLVNL 362
Query: 663 RIVLQVLKENNLVANQKKCSFG 684
+ + L++ L N +KC FG
Sbjct: 363 QNLFGQLRKYRLRLNPRKCVFG 428
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.136 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,431,894
Number of Sequences: 63676
Number of extensions: 762284
Number of successful extensions: 4492
Number of sequences better than 10.0: 108
Number of HSP's better than 10.0 without gapping: 4317
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4449
length of query: 1393
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1284
effective length of database: 5,698,948
effective search space: 7317449232
effective search space used: 7317449232
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0029a.5