
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0096b.4
(1566 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NP595172 polyprotein [Glycine max] 1173 0.0
CO982196 206 5e-53
BQ299538 148 5e-51
AI416791 191 2e-48
NP395548 reverse transcriptase [Glycine max] 178 1e-44
BI425021 166 6e-41
NP395547 reverse transcriptase [Glycine max] 166 6e-41
TC212015 123 1e-38
TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%) 151 2e-36
TC211627 150 4e-36
BI317507 147 3e-35
CF922488 145 2e-34
AW570005 137 5e-32
BQ627806 82 5e-31
BM731326 weakly similar to GP|21740635|em OSJNBb0043H09.2 {Oryza... 121 3e-27
CO979236 120 4e-27
TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 95 1e-26
BI317638 weakly similar to GP|9294238|dbj| contains similarity t... 100 2e-24
NP334778 reverse transcriptase [Glycine max] 107 5e-23
TC213413 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 107 5e-23
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 1173 bits (3035), Expect = 0.0
Identities = 628/1465 (42%), Positives = 898/1465 (60%), Gaps = 8/1465 (0%)
Frame = +1
Query: 70 ADHGSGSGKGSMTRLTGDVLSEFRQSAKKVELPMFDGDDPAGWISRAEVYFRVQDTPPEV 129
+ HG+ + + R S F+ + K++ P FDG + WI +AE +F TP
Sbjct: 229 SSHGASNSQKEQQR------SSFQVRSVKLDFPRFDGKNVMDWIFKAEQFFDYYATPDAD 390
Query: 130 RASLAQLCMEGPTIHFFNSLLSEEENLTWERFKCALLERYGGQGDGDVYEQLTELRQRGT 189
R +A + ++ + ++ L E +W+ F AL +G L +L Q T
Sbjct: 391 RLIIASVHLDQDVVPWYQMLQKTEPFSSWQAFTRALELDFGPSAYDCPRATLFKLNQSAT 570
Query: 190 VEEYITAFEYLTAQIPRLPEKQFLGYFLHGLKGEIRGRVRSMVTMADLSRMKILQIARAV 249
V EY F L ++ L + L F+ GL+ EI R + M + K + +A+
Sbjct: 571 VNEYYMQFTALVNRVDGLSAEAILDCFVSGLQEEIS---RDVKAMEPRTLTKAVALAKLF 741
Query: 250 ERETMGDGGSGHARPTRSSLGGNRANRSGSNRSSDWVFVKGSKETNSGSGYNNSRAGGNG 309
E + + P ++ N A SN S+ + +++ N N
Sbjct: 742 EEK--------YTSPPKTKTFSNLARNFTSNTSATQKYPPTNQK--------NDNPKPNL 873
Query: 310 PRNDRQAQPEKNRSTPRDRGFTHLSYNELMERRQKGLCFKCGGAFHPMHQCPDKQLRVLI 369
P P R++ +S E+ RR+K LC+ C F P H+CP++Q+ +L
Sbjct: 874 P--PLLPTPSTKPFNLRNQNIKKISPAEIQLRREKNLCYFCDEKFSPAHKCPNRQVMLLQ 1047
Query: 370 MEDEEEKEGGGNLLAVEVIEEEESSEGELSSMSLSQVEQVGKDKPQTIKLLGLIQGLPIV 429
+E+ +E + ++ + EE + + + +SL+ + G + TI+ G + G+ +
Sbjct: 1048 LEETDEDQTDEQVM----VTEEANMDDDTHHLSLNAMR--GSNGVGTIRFTGQVGGIAVK 1209
Query: 430 ILIDSGATHNFVSTSLVHKLGKTVVDTPSLRITLGDGSQARTKGKCKELMIIAGNHPLCV 489
IL+D G++ NF+ + L V P+LR+ +G+G +G ++L + + V
Sbjct: 1210 ILVDGGSSDNFIQPRVAQVLKLPVEPAPNLRVLVGNGQILSAEGIVQQLPLHIQGQEVKV 1389
Query: 490 DAQLFELGNVDMVLGIEWLRTLGDMIVNWDKKTMSFWSGHKWVTLQGHEEQEGLLVALQ- 548
L ++ D++LG WL TLG + ++ T+ F+ K++TLQG E L
Sbjct: 1390 PVYLLQISGADVILGSTWLATLGPHVADYAALTLKFFQNDKFITLQGEGNSEATQAQLHH 1569
Query: 549 ------TMISRAGFSGYLGKEKVQLEKDNKGVTGVQQAELDMILERHSVVFQAPKGLPPK 602
T F+ L +++V E K + EL ++L ++ VF P LPP+
Sbjct: 1570 FRRLQNTKSIEECFAIQLIQKEVP-EDTLKDLPTNIDPELAILLHTYAQVFAVPASLPPQ 1746
Query: 603 RNKQHAITLKEGEGPVNVRPYRYPHHQKNEIENQVKELLEGGVIRHSTSSFSSPVILVKK 662
R + HAI LK+G GPV VRPYRYPH QK++IE ++E+L G+I+ S S FS P++LVKK
Sbjct: 1747 REQDHAIPLKQGSGPVKVRPYRYPHTQKDQIEKMIQEMLVQGIIQPSNSPFSLPILLVKK 1926
Query: 663 KDHSWRMCVDYRALNKATIPDKFPIPIIEELLDELHGARYFSKLDLKSGYHQVRVKEEDV 722
KD SWR C DYRALN T+ D FP+P ++ELLDELHGA+YFSKLDL+SGYHQ+ V+ ED
Sbjct: 1927 KDGSWRFCTDYRALNAITVKDSFPMPTVDELLDELHGAQYFSKLDLRSGYHQILVQPEDR 2106
Query: 723 HKTAFRTHEGHYEFLVMPFGLMNAPSTFQSLMNDIFRHLLRKRVLVFFDDILVYSKDWPS 782
KTAFRTH GHYE+LVMPFGL NAP+TFQ LMN IF+ LRK VLVFFDDIL+YS W
Sbjct: 2107 EKTAFRTHHGHYEWLVMPFGLTNAPATFQCLMNKIFQFALRKFVLVFFDDILIYSASWKD 2286
Query: 783 HLEHLQEVLGILREQGLVANRKKCLFGREKVEYLGHMISGQGVEVDPSKVESVTSWPTPK 842
HL+HL+ VL L++ L A KC FG +V+YLGH +SG GV ++ +KV++V WPTP
Sbjct: 2287 HLKHLESVLQTLKQHQLFARLSKCSFGDTEVDYLGHKVSGLGVSMENTKVQAVLDWPTPN 2466
Query: 843 NVKGVRGFLGLTGYYRKFIRDYGKIAKPLTELTKKNGFEWSEKAQEAFETLKKKLTTSPV 902
NVK +RGFLGLTGYYR+FI+ Y IA PLT+L +K+ F W+ +A+ AF LKK +T +PV
Sbjct: 2467 NVKQLRGFLGLTGYYRRFIKSYANIAGPLTDLLQKDSFLWNNEAEAAFVKLKKAMTEAPV 2646
Query: 903 LALPDFSKEFTIECDASGVGVGAILMQEKRPIAYFSKALGVRNLSKSAYEKELMALGLAI 962
L+LPDFS+ F +E DASG+GVGA+L Q PIAYFSK L R +SAY +EL+A+ A+
Sbjct: 2647 LSLPDFSQPFILETDASGIGVGAVLGQNGHPIAYFSKKLAPRMQKQSAYTRELLAITEAL 2826
Query: 963 QHWRPYLLGRHFKVTTDQRSLKELLQQKVVTMEQQNWAAKLLGFDFEISYKPGKLNKGAD 1022
+R YLLG F + TDQRSLK L+ Q + T EQQ W K LG+DF+I YKPGK N+ AD
Sbjct: 2827 SKFRHYLLGNKFIIRTDQRSLKSLMDQSLQTPEQQAWLHKFLGYDFKIEYKPGKDNQAAD 3006
Query: 1023 ALSRVNETLELRQMGSHVDWLGGKDLKEEVSKDEELQRIIKSVHEKKDSSLGYTYENGIL 1082
ALSR+ L H +L ++L+ + D L++++++ + D+S YT G+L
Sbjct: 3007 ALSRM---FMLAWSEPHSIFL--EELRARLISDPHLKQLMETYKQGADAS-HYTVREGLL 3168
Query: 1083 LYEGRLVLPRESPLIHTMLTEFHTTPQGGHSGFYRTYRRLAANVYWRGMKSAVQDFVKQC 1142
++ R+V+P E +++ +L E+H++P GGH+G RT RL A YW M+ V+ ++++C
Sbjct: 3169 YWKDRVVIPAEEEIVNKILQEYHSSPIGGHAGITRTLARLKAQFYWPKMQEDVKAYIQKC 3348
Query: 1143 DVCQRQKYLASSPGGLLQPLPIPERIWEDLSMDFITGLPKSKGFEAILVVVDRLSKYAHF 1202
+CQ+ K + P GLLQPLPIP+++WED++MDFITGLP S G I+VV+DRL+KYAHF
Sbjct: 3349 LICQQAKSNNTLPAGLLQPLPIPQQVWEDVAMDFITGLPNSFGLSVIMVVIDRLTKYAHF 3528
Query: 1203 IPLKHPYTAKSVAEVFGKEIVRLHGVPSSIVSDRDPIFVSNFWRELFKLQGTKLKMSTAY 1262
IPLK Y +K VAE F IV+LHG+P SIVSDRD +F S FW+ LFKLQGT L MS+AY
Sbjct: 3529 IPLKADYNSKVVAEAFMSHIVKLHGIPRSIVSDRDRVFTSTFWQHLFKLQGTTLAMSSAY 3708
Query: 1263 HPESDGQSEVVNRCLETYLRCFIADQPKTWVIWIPWAEYWYNTCFHASTGVTPFEVVYGR 1322
HP+SDGQSEV+N+CLE YLRCF + PK WV +PWAE+WYNT +H S G+TPF +YGR
Sbjct: 3709 HPQSDGQSEVLNKCLEMYLRCFTYEHPKGWVKALPWAEFWYNTAYHMSLGMTPFRALYGR 3888
Query: 1323 PPPTITRWIQGETRVEAVQKELLERDEALRQLRLQLARAQDRMKQFADRKRSDRSFSIGE 1382
PPT+TR V+++L +RD L +L++ L RAQ MK+ AD+KR D SF IG+
Sbjct: 3889 EPPTLTRQACSIDDPAEVREQLTDRDALLAKLKINLTRAQQVMKRQADKKRLDVSFQIGD 4068
Query: 1383 WVFVKLRAHRQKSVVTRIYAKLAAKYYGPYPVVARVGAVAYQLKLPPGSKVHPVFHVSLL 1442
V VKL+ +RQ S V R KL+ +Y+GP+ V+A++G VAY+L+LP +++HPVFHVS L
Sbjct: 4069 EVLVKLQPYRQHSAVLRKNQKLSMRYFGPFKVLAKIGDVAYKLELPSAARIHPVFHVSQL 4248
Query: 1443 KKAVGTYHEGE-ELPDLEGDGGILIEPTEVLATRTVQLQGQSIKQILIQWKGQQPEEATW 1501
K GT + LP + G +++P ++LA+R + I+QIL+QW+ +EATW
Sbjct: 4249 KPFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQDEATW 4428
Query: 1502 EDVDMIKSQFPSFCLEDKARAYGEG 1526
ED++ IK+ +P+F LEDK GEG
Sbjct: 4429 EDIEDIKASYPTFNLEDKVVFKGEG 4503
>CO982196
Length = 812
Score = 206 bits (525), Expect = 5e-53
Identities = 102/198 (51%), Positives = 134/198 (67%)
Frame = +1
Query: 1047 DLKEEVSKDEELQRIIKSVHEKKDSSLGYTYENGILLYEGRLVLPRESPLIHTMLTEFHT 1106
D +EE+ EL I + + K GY G L ++ RLVL + S I +L E
Sbjct: 217 DWEEEIQAYLELYEIYQGILTKTTKKPGYAIRGGKLYFKDRLVLSKNSTKIPLLLKELQD 396
Query: 1107 TPQGGHSGFYRTYRRLAANVYWRGMKSAVQDFVKQCDVCQRQKYLASSPGGLLQPLPIPE 1166
+P GGHSGF+RT++R+A V+W+GMK +D+V C++C+R K SP GLL LPIP
Sbjct: 397 SPLGGHSGFFRTFKRVANVVFWQGMKKTTRDYVAACEICRRNKTSTLSPAGLL*LLPIPT 576
Query: 1167 RIWEDLSMDFITGLPKSKGFEAILVVVDRLSKYAHFIPLKHPYTAKSVAEVFGKEIVRLH 1226
++W D+SMDFI GLPK++G + ILVVVDRL+KYAHF L HPYTAK VAE+F KE+VRLH
Sbjct: 577 KVWTDISMDFIGGLPKAQGKDNILVVVDRLTKYAHFFALSHPYTAKEVAELFIKELVRLH 756
Query: 1227 GVPSSIVSDRDPIFVSNF 1244
G P+SIVSD +F+S F
Sbjct: 757 GFPASIVSDXXRLFMSLF 810
>BQ299538
Length = 426
Score = 148 bits (374), Expect(2) = 5e-51
Identities = 69/101 (68%), Positives = 81/101 (79%)
Frame = +3
Query: 1248 LFKLQGTKLKMSTAYHPESDGQSEVVNRCLETYLRCFIADQPKTWVIWIPWAEYWYNTCF 1307
+ KL GT LKMST+YHP DGQ+ VN CLET+LRCF+ADQPK V W+ WAEYWYNT F
Sbjct: 129 IVKLPGTYLKMSTSYHP*IDGQT--VNHCLETFLRCFVADQPKM*VQWLSWAEYWYNTNF 302
Query: 1308 HASTGVTPFEVVYGRPPPTITRWIQGETRVEAVQKELLERD 1348
HASTG TPFEVVYGR PP + R++ GE RVEAV++EL +RD
Sbjct: 303 HASTGTTPFEVVYGRKPPVLNRFLPGEVRVEAVRRELQDRD 425
Score = 73.2 bits (178), Expect(2) = 5e-51
Identities = 30/44 (68%), Positives = 40/44 (90%)
Frame = +1
Query: 1205 LKHPYTAKSVAEVFGKEIVRLHGVPSSIVSDRDPIFVSNFWREL 1248
LKHPY+A+ +AE+F KE+V LHGVP+S++SD DPIFVS+FW+EL
Sbjct: 1 LKHPYSARVLAEIFTKEVVHLHGVPASVLSDEDPIFVSSFWKEL 132
>AI416791
Length = 420
Score = 191 bits (485), Expect = 2e-48
Identities = 93/140 (66%), Positives = 110/140 (78%)
Frame = -3
Query: 143 IHFFNSLLSEEENLTWERFKCALLERYGGQGDGDVYEQLTELRQRGTVEEYITAFEYLTA 202
IHFFNSL+ E+E+LTWE K ALLERYGG GDGDVYEQLTEL+Q G+VE+YIT FEYL A
Sbjct: 418 IHFFNSLIGEDEDLTWESLKIALLERYGGHGDGDVYEQLTELKQEGSVEDYITEFEYLIA 239
Query: 203 QIPRLPEKQFLGYFLHGLKGEIRGRVRSMVTMADLSRMKILQIARAVERETMGDGGSGHA 262
QIPRLPEKQF GYFLHGL+ EIRGRVR++ TM ++SR K+LQ+ RAVE+E G GSG
Sbjct: 238 QIPRLPEKQFQGYFLHGLQSEIRGRVRTLATMGEMSRTKLLQVTRAVEKEVKGGNGSGFG 59
Query: 263 RPTRSSLGGNRANRSGSNRS 282
R ++ G R+N G N S
Sbjct: 58 RGPKN--GPYRSNSHGGNNS 5
>NP395548 reverse transcriptase [Glycine max]
Length = 762
Score = 178 bits (452), Expect = 1e-44
Identities = 100/254 (39%), Positives = 147/254 (57%), Gaps = 19/254 (7%)
Frame = +1
Query: 633 IENQVKELLEGGVIRH-STSSFSSPVILVKKKDH------------------SWRMCVDY 673
+ +V +LLE G+I S S++ SPV++V KK+ SW++C+DY
Sbjct: 1 VRKEVLKLLEVGLIYPISDSAWVSPVLVVSKKEGMTVIRNEKNDLIPTRTVTSWKLCIDY 180
Query: 674 RALNKATIPDKFPIPIIEELLDELHGARYFSKLDLKSGYHQVRVKEEDVHKTAFRTHEGH 733
R LN+AT D FP+P ++++L+ L G Y+ LD GY+Q+ V +D K AF G
Sbjct: 181 RKLNEATRKDHFPLPFMDQMLERLAGHAYYCFLDAYFGYNQIVVDPKDQEKMAFTCPFGV 360
Query: 734 YEFLVMPFGLMNAPSTFQSLMNDIFRHLLRKRVLVFFDDILVYSKDWPSHLEHLQEVLGI 793
+ + +PFGL NAP+TFQ M IF ++ K + VF DD V+ S L+ L+ VL
Sbjct: 361 FAYRRIPFGLCNAPTTFQMCMLAIFADIVEKSIEVFMDDFSVFVPSLESCLKKLEMVLQR 540
Query: 794 LREQGLVANRKKCLFGREKVEYLGHMISGQGVEVDPSKVESVTSWPTPKNVKGVRGFLGL 853
E LV N +KC F + LGH IS +G+EVD +K++ + P P NVKG+R FLG
Sbjct: 541 CVETNLVLNWEKCHFMVREGIVLGHKISTRGIEVDQTKIDVIEKLPPPSNVKGIRSFLGQ 720
Query: 854 TGYYRKFIRDYGKI 867
+YR+FI+D+ K+
Sbjct: 721 ARFYRRFIKDFTKV 762
>BI425021
Length = 426
Score = 166 bits (421), Expect = 6e-41
Identities = 78/142 (54%), Positives = 103/142 (71%)
Frame = -1
Query: 1159 LQPLPIPERIWEDLSMDFITGLPKSKGFEAILVVVDRLSKYAHFIPLKHPYTAKSVAEVF 1218
L PLP+P+R WEDLSMDFI GLP G I VVV+R SK H L +TA VA +F
Sbjct: 426 LCPLPVPQRPWEDLSMDFIVGLPPYHGHTTIFVVVNRFSKGIHLGTLPTSHTAHMVASLF 247
Query: 1219 GKEIVRLHGVPSSIVSDRDPIFVSNFWRELFKLQGTKLKMSTAYHPESDGQSEVVNRCLE 1278
+++LHG P SIVSDRDP+F+S+FW++LF+L GT L+MS+AYHP++DGQ+EV+NR +E
Sbjct: 246 LNIVIKLHGFPRSIVSDRDPLFISHFWQDLFRLSGTVLRMSSAYHPQTDGQTEVLNRVIE 67
Query: 1279 TYLRCFIADQPKTWVIWIPWAE 1300
YLR F+ +P+ +IPW E
Sbjct: 66 QYLRAFVHGRPRNLGRFIPWVE 1
>NP395547 reverse transcriptase [Glycine max]
Length = 762
Score = 166 bits (421), Expect = 6e-41
Identities = 96/254 (37%), Positives = 142/254 (55%), Gaps = 19/254 (7%)
Frame = +1
Query: 633 IENQVKELLEGGVIRH-STSSFSSPVILVKKKDHS------------------WRMCVDY 673
+ +V +LLE G+I S SS+ SPV +V KK WRMC+DY
Sbjct: 1 VRKEVFKLLEAGLIYPISDSSWVSPVQVVPKKGGMTVVKNDRNELIPTRRVTRWRMCIDY 180
Query: 674 RALNKATIPDKFPIPIIEELLDELHGARYFSKLDLKSGYHQVRVKEEDVHKTAFRTHEGH 733
R LN+AT D +P+P ++++L L ++ LD SGY+Q+ V +D KTAF
Sbjct: 181 RKLNEATRKDHYPLPFMDQMLKRLARQSFYRFLDGYSGYNQIAVDPQDQEKTAFTCPFSV 360
Query: 734 YEFLVMPFGLMNAPSTFQSLMNDIFRHLLRKRVLVFFDDILVYSKDWPSHLEHLQEVLGI 793
+ + MPFGL NA +TFQ M IF ++ K + VF DD + + + L +L++VL
Sbjct: 361 FAYRRMPFGLCNASTTFQRCMMAIFDDMVEKCIEVFMDDFSFFGASFGNCLANLEKVLQR 540
Query: 794 LREQGLVANRKKCLFGREKVEYLGHMISGQGVEVDPSKVESVTSWPTPKNVKGVRGFLGL 853
+ LV N +KC F ++ LGH IS +G+EV K++ + P P NVKG+ FLG
Sbjct: 541 CEKSNLVLNWEKCHFMVQEGIVLGHKISKRGIEVVKEKLDVIDKLPPPVNVKGIHSFLGH 720
Query: 854 TGYYRKFIRDYGKI 867
G+YR+FI+D+ K+
Sbjct: 721 VGFYRRFIKDFTKV 762
>TC212015
Length = 659
Score = 123 bits (309), Expect(3) = 1e-38
Identities = 62/118 (52%), Positives = 80/118 (67%)
Frame = -2
Query: 127 PEVRASLAQLCMEGPTIHFFNSLLSEEENLTWERFKCALLERYGGQGDGDVYEQLTELRQ 186
P+ L QL ++G TIHFF SLL E +LTWE+ K LLE YGG +GD E+L +RQ
Sbjct: 430 PKCEGELGQLFLDGSTIHFFKSLLDEYPSLTWEKLKSELLE*YGGIDEGDDLERLAVIRQ 251
Query: 187 RGTVEEYITAFEYLTAQIPRLPEKQFLGYFLHGLKGEIRGRVRSMVTMADLSRMKILQ 244
G V++YI FE LTAQ RLP QF GYF+HGLK IRGRVRS+ T+ L + ++++
Sbjct: 250 DGMVDKYILEFETLTAQESRLPNDQFFGYFVHGLKDGIRGRVRSLHTLGPLFQSRMMK 77
Score = 41.6 bits (96), Expect(3) = 1e-38
Identities = 29/71 (40%), Positives = 37/71 (51%)
Frame = -1
Query: 34 TMERNQETLIALLEKSIGKTKVDDDSTGDNVTPAKEADHGSGSGKGSMTRLTGDVLSEFR 93
T E N E L+ LL K+ + DS G++ +P K + L GD L EFR
Sbjct: 653 TAEENHEKLVVLLSKN------NSDSNGES-SPMKLS-----------AILHGDTLDEFR 528
Query: 94 QSAKKVELPMF 104
+S KKVELPMF
Sbjct: 527 KSVKKVELPMF 495
Score = 35.8 bits (81), Expect(3) = 1e-38
Identities = 16/26 (61%), Positives = 18/26 (68%)
Frame = -3
Query: 107 DDPAGWISRAEVYFRVQDTPPEVRAS 132
+DPAGWI EVYF VQ T P V+ S
Sbjct: 489 EDPAGWIICVEVYF*VQGTHPNVKVS 412
>TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%)
Length = 589
Score = 151 bits (381), Expect = 2e-36
Identities = 77/162 (47%), Positives = 104/162 (63%), Gaps = 1/162 (0%)
Frame = +1
Query: 839 PTPKNVKGVRGFLGLTGYYRKFIRDYGKIAKPLTELTKKN-GFEWSEKAQEAFETLKKKL 897
PT K+V +R F GL +YR+F+ ++ +A PL EL KKN F W EK ++AF LK+KL
Sbjct: 97 PTLKSVGDIRSFHGLASFYRRFVPNFSTVASPLNELVKKNMAFTWGEKQEQAFALLKEKL 276
Query: 898 TTSPVLALPDFSKEFTIECDASGVGVGAILMQEKRPIAYFSKALGVRNLSKSAYEKELMA 957
T +PVLALPDFSK F +ECDASGVGV A+L+Q PIAYFS+ L L+ Y+KEL A
Sbjct: 277 TKAPVLALPDFSKTFELECDASGVGVRAVLLQGGHPIAYFSEKLHSATLNYPTYDKELYA 456
Query: 958 LGLAIQHWRPYLLGRHFKVTTDQRSLKELLQQKVVTMEQQNW 999
L A Q W +L+ + F + +D +SLK + + + W
Sbjct: 457 LIRAPQTWEHFLVCKEFVIHSDHQSLKYIRGKSKLNKRHAKW 582
Score = 33.1 bits (74), Expect = 0.98
Identities = 11/39 (28%), Positives = 23/39 (58%)
Frame = +2
Query: 811 EKVEYLGHMISGQGVEVDPSKVESVTSWPTPKNVKGVRG 849
+ + + G ++ GV++DP K++++ WP P V + G
Sbjct: 11 DNIFFSGFVVGRNGVQMDPEKIKAIQEWPPP*KVWEILG 127
>TC211627
Length = 1034
Score = 150 bits (379), Expect = 4e-36
Identities = 89/236 (37%), Positives = 126/236 (52%), Gaps = 11/236 (4%)
Frame = +3
Query: 1304 NTCFHASTGVTPFEVV----------YGRPPPTITRWIQGETRVEAVQKELLERDEALRQ 1353
N+C ++ P +V YG+PPP + + G + VEAV L
Sbjct: 168 NSCLSLNSATIPLSIVVSASHHLSVMYGKPPPALPLYSAGTSTVEAVDAILHSLATIHHT 347
Query: 1354 LRLQLARAQDRMKQFADRKRSDRSFSIGEWVFVKLRAHRQKSVVTRIYAKLAAKYYGPYP 1413
L +L + QD MK+ AD R D +F+IG+WV+V+L +RQ S+ + Y KL+ ++YGPY
Sbjct: 348 LTCRLQKYQDSMKRIADSHRRDLTFNIGDWVYVRL*PYRQTSIQST-YTKLSKRFYGPYQ 524
Query: 1414 VVARVGAVAYQLKLPPGSKVHPVFHVSLLKKAVGTY-HEGEELPDLEGDGGILIEPTEVL 1472
+ ARVG VAY+L+LPP SK+HP+FHVSLLK G E LP L++P + L
Sbjct: 525 IQARVGQVAYRLQLPPTSKIHPIFHVSLLKVHHGPIPPELLALPPFSTTNHPLVQPLQFL 704
Query: 1473 ATRTVQLQGQSIKQILIQWKGQQPEEATWEDVDMIKSQFPSFCLEDKARAYGEGID 1528
+ + I Q+L+QW PE+ TWE +K + LEDK GID
Sbjct: 705 DWKMDESTTPPIPQVLVQWTNLAPEDTTWESWTQLKDIYD---LEDKVCFQTGGID 863
Score = 99.0 bits (245), Expect = 1e-20
Identities = 43/79 (54%), Positives = 56/79 (70%)
Frame = +2
Query: 1239 IFVSNFWRELFKLQGTKLKMSTAYHPESDGQSEVVNRCLETYLRCFIADQPKTWVIWIPW 1298
IF+S W ELF + GTKL+ STAYHP++DGQ+EV+NR LE YLR F+ D P+ W ++
Sbjct: 2 IFISGLWHELFHISGTKLRFSTAYHPQTDGQTEVINRILEQYLRAFVHDHPQHWFKFLSL 181
Query: 1299 AEYWYNTCFHASTGVTPFE 1317
AE YNT H+ G +PFE
Sbjct: 182 AE*CYNTSVHSGIGFSPFE 238
>BI317507
Length = 359
Score = 147 bits (372), Expect = 3e-35
Identities = 69/116 (59%), Positives = 89/116 (76%), Gaps = 1/116 (0%)
Frame = -1
Query: 770 FDDILVYSKDWPSHLEHLQEVLGILREQGLVANRKKCLFGREKVEYLGHMISGQGVEVDP 829
F +IL+YS DW SH+ HL VL +L+++ LVANRKKC F + +EYLGH+IS V +D
Sbjct: 359 FYNILIYSPDWKSHIMHLTAVLDVLKKERLVANRKKCYFSQTTIEYLGHVISKDCVAMDS 180
Query: 830 SKVESVTSWPTPKNVKGVRGFLGLTGYYRKFIRDYGKIA-KPLTELTKKNGFEWSE 884
+KV+SV WP PKNVK V FL LTGYYRKFI+DYGK+A +PLT+LTK +GF+W +
Sbjct: 179 NKVKSVIEWPVPKNVKRVCSFLRLTGYYRKFIKDYGKLAPRPLTDLTKNDGFKWGD 12
>CF922488
Length = 741
Score = 145 bits (365), Expect = 2e-34
Identities = 91/245 (37%), Positives = 133/245 (54%), Gaps = 1/245 (0%)
Frame = +3
Query: 660 VKKKDHSWRMCVDYRALNKATIPDKFPIPIIEELLDELHGARYFSKLDLKSGYHQVRVKE 719
V K+D MCVDYR LN A+ DKFP+P I L+D FS +D SGY+Q+++
Sbjct: 3 VLKEDGKV*MCVDYRDLN*ASPKDKFPLPHINVLVDNTTSFSQFSFMDGFSGYNQIKIAP 182
Query: 720 EDVHKTAFRTHEGHYEFLVMPFGLMNAPSTFQSLMNDIFRHLLRKRVLVFFDDILVYSKD 779
ED+ KT F T G + + M FGL N +T+Q M +F ++ K + V+ DD++V S+
Sbjct: 183 EDMEKTTFITLWGTFCYKAMSFGLKNVGATYQRAMVALF*DMMHKEIEVYMDDMIVKSRT 362
Query: 780 WPSHLEHLQEVLGILREQGLVANRKKCLFGREKVEYLGHMISGQGVEVDPSKVESVTSWP 839
HL +L+++ LR+ L N KC+F + + L + S +G+EVD +KV+ +
Sbjct: 363 EEEHLVNLRKLFRRLRKYRLRLNPAKCMFEVKSRKLLDFIDS*RGIEVDSNKVKVILEMA 542
Query: 840 TPKNVKGVRGFLGLTGYYRKFIRDYGKIAKPLTELTKKNGF-EWSEKAQEAFETLKKKLT 898
P K V+GFLG Y +FI +PL L KN F +W AFE +K+ L
Sbjct: 543 KPHTEKQVQGFLGRLNYIVRFIS*LIATCEPLFILLCKNQFVKWDHDC*VAFERIKQCLI 722
Query: 899 TSPVL 903
VL
Sbjct: 723 NPHVL 737
>AW570005
Length = 413
Score = 137 bits (344), Expect = 5e-32
Identities = 66/135 (48%), Positives = 89/135 (65%)
Frame = -2
Query: 839 PTPKNVKGVRGFLGLTGYYRKFIRDYGKIAKPLTELTKKNGFEWSEKAQEAFETLKKKLT 898
P P+ + +RGFL LTG+YR+FI+ Y +A PL+ L K+ F WS +A AF+ LK +T
Sbjct: 412 PPPRTARSLRGFLRLTGFYRRFIKGYAAMAAPLSHLLTKDSFVWSPEADVAFQALKNVVT 233
Query: 899 TSPVLALPDFSKEFTIECDASGVGVGAILMQEKRPIAYFSKALGVRNLSKSAYEKELMAL 958
+ VLALPDF+K FT+E DASG +GA+L QE PIA+FSK + + S Y EL A+
Sbjct: 232 NTLVLALPDFTKPFTVETDASGSDMGAVLSQEGHPIAFFSKEFCPKLVRSSTYVHELAAI 53
Query: 959 GLAIQHWRPYLLGRH 973
++ WR YLLG H
Sbjct: 52 TNVVKKWRQYLLGHH 8
>BQ627806
Length = 435
Score = 81.6 bits (200), Expect(2) = 5e-31
Identities = 38/71 (53%), Positives = 50/71 (69%)
Frame = +1
Query: 946 LSKSAYEKELMALGLAIQHWRPYLLGRHFKVTTDQRSLKELLQQKVVTMEQQNWAAKLLG 1005
L S Y +EL A+ +A++ WR YLLG HF + TD RSLKEL+ Q V T EQQ + A+L+G
Sbjct: 220 LRASTYVRELAAITVAVKKWRQYLLGHHFVILTDHRSLKELMSQAVQTPEQQIYLARLMG 399
Query: 1006 FDFEISYKPGK 1016
FD+ I Y+ GK
Sbjct: 400 FDYTIQYRAGK 432
Score = 73.2 bits (178), Expect(2) = 5e-31
Identities = 34/68 (50%), Positives = 45/68 (66%)
Frame = +3
Query: 874 LTKKNGFEWSEKAQEAFETLKKKLTTSPVLALPDFSKEFTIECDASGVGVGAILMQEKRP 933
L K+ F W+E+A AF LK L +PVL LPDF+ F +E DASG+G+GAIL Q P
Sbjct: 3 LLVKDQFHWNEEADRAFSQLKLALCQAPVLGLPDFNSSFVVETDASGIGMGAILSQNHHP 182
Query: 934 IAYFSKAL 941
+A+F A+
Sbjct: 183 LAFFIYAI 206
>BM731326 weakly similar to GP|21740635|em OSJNBb0043H09.2 {Oryza sativa
(japonica cultivar-group)}, partial (3%)
Length = 424
Score = 121 bits (303), Expect = 3e-27
Identities = 61/136 (44%), Positives = 92/136 (66%), Gaps = 1/136 (0%)
Frame = +1
Query: 1356 LQLARAQDRMKQFADRKRSDRSFSIGEWVFVKLRAHRQKSVVTRIYAKLAAKYYGPYPVV 1415
+ L RAQ M + A+ R ++G+WV++K+R HRQ S+ R++ KL A++YGPY V+
Sbjct: 19 IXLERAQSLMVKHANNHRRPHDINVGDWVYLKIRPHRQGSMPPRLHPKLTARFYGPYLVM 198
Query: 1416 ARVGAVAYQLKLPPGSKVHPVFHVSLLKKAVGTYHEGEEL-PDLEGDGGILIEPTEVLAT 1474
+VGAVA+QL+LP +++HPVFHVS LK+A+G + EEL PDLE + P ++L
Sbjct: 199 RQVGAVAFQLQLPSEARIHPVFHVSQLKRALGNHQAQEELPPDLEHQAELYF-PVQILKI 375
Query: 1475 RTVQLQGQSIKQILIQ 1490
R VQ Q + +Q+LI+
Sbjct: 376 REVQKQHEVERQVLIR 423
>CO979236
Length = 729
Score = 120 bits (302), Expect = 4e-27
Identities = 83/214 (38%), Positives = 118/214 (54%), Gaps = 8/214 (3%)
Frame = -1
Query: 120 FRVQDTPPEVRASLAQLCMEGPTIHFFNSLLSEEENLTWERFKCALLERYGGQGDGDVYE 179
F+VQ+T EV+ LAQ+ ME T+HFFN LL E ENLTWE K L++RY G+ G YE
Sbjct: 729 FQVQETSDEVKVRLAQISMENGTVHFFNLLLLENENLTWEELKYELMQRYAGK--GTAYE 556
Query: 180 QLTELRQRGTVEEYITAFEYLTAQIPRLPEKQFLGYFLHGLKGEIRGRVRSMVTMADLSR 239
QL+ L+Q ++++YI FE L AQIP+L ++Q+ F HGLK S+ L+R
Sbjct: 555 QLSALQQTRSIDDYIQRFECLIAQIPKLQDEQYFACFTHGLK--------SLHVANLLTR 400
Query: 240 MKILQIARAVERET-------MGDGGSGHARPTRSSLG-GNRANRSGSNRSSDWVFVKGS 291
+++ IARAV+ E +G G + TR++ G N +G S ++ G
Sbjct: 399 GRLMNIARAVKMEMTSKNRAWVGRGDTCMESGTRTASGQRNPLVATGRTGSRNY----GG 232
Query: 292 KETNSGSGYNNSRAGGNGPRNDRQAQPEKNRSTP 325
K TN G G N A G+G +Q Q ++ P
Sbjct: 231 KGTN-GPGPN---ARGDGAEKYKQGQ*DRGAIVP 142
>TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 730
Score = 94.7 bits (234), Expect(2) = 1e-26
Identities = 47/111 (42%), Positives = 74/111 (66%)
Frame = +1
Query: 1335 TRVEAVQKELLERDEALRQLRLQLARAQDRMKQFADRKRSDRSFSIGEWVFVKLRAHRQK 1394
+R+E V K ++ RD L LR L ++ D M +++R D + +G+ VF+K++ +R++
Sbjct: 7 SRLEEVNKLIIARDGLLATLRENLLKS*DIM*ANTNKRRRDIEYVVGD*VFLKMQPYRRR 186
Query: 1395 SVVTRIYAKLAAKYYGPYPVVARVGAVAYQLKLPPGSKVHPVFHVSLLKKA 1445
S+ RI KL+ ++Y P+ V +VG +AY+L LP K+HPVFHVSLLKKA
Sbjct: 187 SLAKRINEKLSPRFYAPFQVFNKVGTIAYKLDLPSHIKIHPVFHVSLLKKA 339
Score = 45.4 bits (106), Expect(2) = 1e-26
Identities = 34/110 (30%), Positives = 50/110 (44%)
Frame = +3
Query: 1456 PDLEGDGGILIEPTEVLATRTVQLQGQSIKQILIQWKGQQPEEATWEDVDMIKSQFPSFC 1515
P L D + VL +R +LQ ++K +LIQWK P E +WE V ++ F +
Sbjct: 357 PMLSEDWKLQTYSDSVLDSR--ELQPGNVK-VLIQWKNLPPSENSWESVAKLQEIFSIYH 527
Query: 1516 LEDKARAYGEGIDRTQGPQQASDAPLIADGAVGPKIWKIYSRRSKVGMTH 1565
LEDK G GID+ + P I K+Y+R+ G +
Sbjct: 528 LEDKVSLLGGGIDKHKHK---------------PPIPKVYTRKHPRGANY 632
>BI317638 weakly similar to GP|9294238|dbj| contains similarity to reverse
transcriptase~gene_id:K11J14.5 {Arabidopsis thaliana},
partial (5%)
Length = 420
Score = 99.8 bits (247), Expect(2) = 2e-24
Identities = 55/108 (50%), Positives = 67/108 (61%)
Frame = -2
Query: 1142 CDVCQRQKYLASSPGGLLQPLPIPERIWEDLSMDFITGLPKSKGFEAILVVVDRLSKYAH 1201
C CQ KY LL PL +P R WEDLS+DFITGL AILVVVD SK H
Sbjct: 326 CLDCQHTKYETKRIVDLLCPLLVPHRPWEDLSLDFITGLLPYHVHTAILVVVDHFSKGIH 147
Query: 1202 FIPLKHPYTAKSVAEVFGKEIVRLHGVPSSIVSDRDPIFVSNFWRELF 1249
L +TA +VA +F + +LHG+P S+VSD D +FVS+FW+ELF
Sbjct: 146 LGMLPSSHTAHTVACLFIDSVAKLHGLPRSLVSDCDLLFVSHFWQELF 3
Score = 32.7 bits (73), Expect(2) = 2e-24
Identities = 13/29 (44%), Positives = 19/29 (64%)
Frame = -1
Query: 1107 TPQGGHSGFYRTYRRLAANVYWRGMKSAV 1135
T GGH+G +T L+ N+YW GM++ V
Sbjct: 420 TTTGGHTGIAKTLA*LSKNIYWFGMRTDV 334
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 107 bits (266), Expect = 5e-23
Identities = 55/142 (38%), Positives = 86/142 (59%)
Frame = +3
Query: 668 RMCVDYRALNKATIPDKFPIPIIEELLDELHGARYFSKLDLKSGYHQVRVKEEDVHKTAF 727
RMCVDYR LN+A+ D FP+P I+ L+ + FS +D SGY+Q+++ ED+ KT F
Sbjct: 3 RMCVDYRDLNRASPKDNFPLPHIDILMANMASFALFSFMDGFSGYNQIKMAPEDMEKTTF 182
Query: 728 RTHEGHYEFLVMPFGLMNAPSTFQSLMNDIFRHLLRKRVLVFFDDILVYSKDWPSHLEHL 787
T G + + VM FGL N +T+ M +F+ ++ K + + D+++ S+ HL +L
Sbjct: 183 ITLWGTFCYKVMSFGLKNFGATYHRAMVALFQDMMHKEIEAYVDEMIAKSRMEEEHLVNL 362
Query: 788 QEVLGILREQGLVANRKKCLFG 809
Q + G LR+ L N +KC+FG
Sbjct: 363 QNLFGQLRKYRLRLNPRKCVFG 428
>TC213413 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (5%)
Length = 761
Score = 107 bits (266), Expect = 5e-23
Identities = 61/153 (39%), Positives = 87/153 (55%), Gaps = 3/153 (1%)
Frame = +1
Query: 1379 SIGEWVFVKLRAHRQKSVVTRIYAKLAAKYYGPYPVVARVGAVAYQLKLPPGSKVHPVFH 1438
+IG+WV VKLR HRQ S Y+KL +YYGP+ V R+G V Y+LKL S++HPVFH
Sbjct: 1 NIGDWVLVKLRPHRQGSASETTYSKLTKRYYGPFEVQERLGKVVYRLKLTAHSRIHPVFH 180
Query: 1439 VSLLKKAVG---TYHEGEELPDLEGDGGILIEPTEVLATRTVQLQGQSIKQILIQWKGQQ 1495
VSLLK VG T H G LP + + P V+ ++ V + +L+QW
Sbjct: 181 VSLLKAFVGDPETTHAG-PLPVMRTEEA-TNTPLTVIDSKLVPADNGPRRMVLVQWPSAS 354
Query: 1496 PEEATWEDVDMIKSQFPSFCLEDKARAYGEGID 1528
++A+WED +++ ++ LEDK + G D
Sbjct: 355 RQDASWEDWQVLRERYN---LEDKVLSEERGDD 444
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.317 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 66,031,313
Number of Sequences: 63676
Number of extensions: 919813
Number of successful extensions: 4640
Number of sequences better than 10.0: 110
Number of HSP's better than 10.0 without gapping: 4526
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4600
length of query: 1566
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1456
effective length of database: 5,635,272
effective search space: 8204956032
effective search space used: 8204956032
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0096b.4