
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC122728.10 + phase: 0
(2239 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 367 e-101
CF922488 331 2e-90
TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 276 7e-74
TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 212 2e-68
NP595172 polyprotein [Glycine max] 230 5e-60
NP334778 reverse transcriptase [Glycine max] 219 8e-57
TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 204 3e-52
BQ628592 106 2e-47
NP395547 reverse transcriptase [Glycine max] 188 3e-47
NP395548 reverse transcriptase [Glycine max] 178 3e-44
TC219643 weakly similar to UP|Q6WAY7 (Q6WAY7) Gag/pol polyprotei... 170 7e-42
TC232528 weakly similar to UP|Q6WAY5 (Q6WAY5) Gag/pol polyprotei... 155 2e-37
BI498328 119 2e-26
AW184779 109 2e-23
BG839293 108 3e-23
BI316922 100 1e-20
CF922341 99 2e-20
BE804087 83 2e-15
BI321666 80 1e-14
TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%) 63 1e-14
>TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (9%)
Length = 843
Score = 367 bits (943), Expect = e-101
Identities = 169/274 (61%), Positives = 209/274 (75%)
Frame = +1
Query: 1786 NHWNDVPIIKGQRLERPSHVFAIGDVIDQAGENVVGYRPWYYDIKQFLLSREYPPGASKQ 1845
N D+P I+ +P+H + + D +PWY+DIK++++S+EY P +
Sbjct: 46 NAARDLPYIEFWCRGKPAHCCQVEEERDG--------KPWYFDIKRYVISKEYLPEIADN 201
Query: 1846 DKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLVHDVHDGTFGTHATGHTMSR 1905
DK+TLRRLA F + G ILYKRN+DM LRCVD EA ++ +VH+G+FGTHA GH M+R
Sbjct: 202 DKRTLRRLAAGFFMSGSILYKRNHDMKPLRCVDAREANHMIEEVHEGSFGTHANGHAMAR 381
Query: 1906 KLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRI 1965
K+LRAGYYW+ ME DC + RKCHKCQ +AD ++ PPH LNVMSSPWPFSMWGID+IG I
Sbjct: 382 KILRAGYYWLTMESDCCVHVRKCHKCQAFADNVNAPPHPLNVMSSPWPFSMWGIDVIGAI 561
Query: 1966 EPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTN 2025
EPKASNGHRFILVAIDYFTKWVEAASYT+V + VV +FIK IICRYG+P KIITDNGTN
Sbjct: 562 EPKASNGHRFILVAIDYFTKWVEAASYTDVMRGVVVRFIKKEIICRYGLPRKIITDNGTN 741
Query: 2026 LNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAA 2059
LNN ++ +CEEFKI+HHN +PYRP+MN AVE A
Sbjct: 742 LNNKMMGEICEEFKIQHHNPTPYRPKMN*AVEVA 843
>CF922488
Length = 741
Score = 331 bits (849), Expect = 2e-90
Identities = 166/246 (67%), Positives = 192/246 (77%)
Frame = +3
Query: 1242 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1301
V K+DGKV MCVD+RDLN ASPKD FPLPHI+VLVDNT FSFMDGFSGYNQIK++P
Sbjct: 3 VLKEDGKV*MCVDYRDLN*ASPKDKFPLPHINVLVDNTTSFSQFSFMDGFSGYNQIKIAP 182
Query: 1302 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSAD 1361
ED EKT+FIT WGTFCYK M FGL N GATYQR M LF DM+HKE+EVY+DDMIVKS
Sbjct: 183 EDMEKTTFITLWGTFCYKAMSFGLKNVGATYQRAMVALF*DMMHKEIEVYMDDMIVKSRT 362
Query: 1362 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1421
EE+H+ L K+F RLRKY+LRLNP KC F V+S KLL FI S +GIEVD +KV+ I EM
Sbjct: 363 EEEHLVNLRKLFRRLRKYRLRLNPAKCMFEVKSRKLLDFIDS*RGIEVDSNKVKVILEMA 542
Query: 1422 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1481
P TEKQV+GFLGRLNYI RFIS + ATC P+F LL KNQ + W+ +C AF+ IK L+
Sbjct: 543 KPHTEKQVQGFLGRLNYIVRFIS*LIATCEPLFILLCKNQFVKWDHDC*VAFERIKQCLI 722
Query: 1482 EPPILV 1487
P +LV
Sbjct: 723 NPHVLV 740
>TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 669
Score = 276 bits (706), Expect = 7e-74
Identities = 129/183 (70%), Positives = 157/183 (85%)
Frame = +1
Query: 2057 EAANKNIKRIVQKMVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 2116
EAANKNIK+I+QKM +YKDWHEMLP+ALHGYRT+VR+STGATPFSLVYGMEAVLP EVE
Sbjct: 1 EAANKNIKKIIQKMTVSYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPFEVE 180
Query: 2117 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKAAFDKKVNPREFK 2176
+PSLR++ E+ L E+EW Q+RYDQLNLIE KR+ AM+ G YQ RMK+AFDKKV R+F
Sbjct: 181 VPSLRILAESGLKESEWAQTRYDQLNLIEGKRLTAMSHGRLYQQRMKSAFDKKVCLRKFH 360
Query: 2177 VGELVLKRRISQQPDPRGKWTPNYEGPYVVKKAFSGGALILTHMDGVELPNPVNADIVKK 2236
G+LVLK+ D RGKW PNYEGP+VVK+AFSGGAL+LT+MDG ELP+P+N+D+VK+
Sbjct: 361 EGDLVLKKMSHAVKDHRGKWAPNYEGPFVVKRAFSGGALVLTNMDGEELPSPMNSDVVKR 540
Query: 2237 YFA 2239
Y+A
Sbjct: 541 YYA 549
>TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (3%)
Length = 803
Score = 212 bits (540), Expect(2) = 2e-68
Identities = 102/207 (49%), Positives = 141/207 (67%)
Frame = +2
Query: 1708 IEEAIDMRIKHLDIYGDSALVINQIKGEWETHHAKLIPYHDYARRLLTYFTKVELHHIPR 1767
++ AID +K L +YGDSALVI+Q++GE ET LIPY Y + L +F ++ HH+
Sbjct: 206 VQAAIDSNVKLLKVYGDSALVIHQLRGECETRDPNLIPYQAYIKELAGFFDEISFHHVA* 385
Query: 1768 DENQMADALATLSSMFRVNHWNDVPIIKGQRLERPSHVFAIGDVIDQAGENVVGYRPWYY 1827
+ENQMADALATL SMF++ D+P I+ + RP+H + + D +PWY+
Sbjct: 386 EENQMADALATLVSMFQLTPHGDLPYIEFRCRGRPAHCCLVEEERDG--------KPWYF 541
Query: 1828 DIKQFLLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLVH 1887
DIK+++ S+EYP AS DK+ RRLA F + G ILYKRN+DMVLL CV+ E E ++
Sbjct: 542 DIKRYVESKEYPLEASDNDKRRKRRLAAGFFMSGSILYKRNHDMVLLHCVNGKEVENMLG 721
Query: 1888 DVHDGTFGTHATGHTMSRKLLRAGYYW 1914
+VH+G+FGTH+ GH M+RK+LRAGYYW
Sbjct: 722 EVHEGSFGTHSNGHAMARKILRAGYYW 802
Score = 68.2 bits (165), Expect(2) = 2e-68
Identities = 30/53 (56%), Positives = 37/53 (69%)
Frame = +1
Query: 1652 DPNSKWGLVFDGAVNAYGKGMGAVIVSPQGHHIPFTARILFECTNNMAEYEAC 1704
+ KW + FD A N G G+GA++VSP IPFT R+ F+CTNNMAEYEAC
Sbjct: 37 EDRDKWIVWFDRASNVLGHGVGAILVSPDNQCIPFTTRLGFDCTNNMAEYEAC 195
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 230 bits (587), Expect = 5e-60
Identities = 151/465 (32%), Positives = 237/465 (50%), Gaps = 1/465 (0%)
Frame = +1
Query: 1152 LEEGVKQKIIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMA 1211
L + ++ LL Y +FA P + +H IP K PV+ + R
Sbjct: 1660 LPTNIDPELAILLHTYAQVFAVPASLPPQREQ---DHAIPLKQGSGPVKVRPYRYPHTQK 1830
Query: 1212 LKIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPH 1271
+I+ +Q+ + G + P + I+ V KKDG R C D+R LN + KD+FP+P
Sbjct: 1831 DQIEKMIQEMLVQGIIQPSNSP-FSLPILLVKKKDGSWRFCTDYRALNAITVKDSFPMPT 2007
Query: 1272 IDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGAT 1331
+D L+D ++ FS +D SGY+QI + PEDREKT+F T G + + VMPFGL NA AT
Sbjct: 2008 VDELLDELHGAQYFSKLDLRSGYHQILVQPEDREKTAFRTHHGHYEWLVMPFGLTNAPAT 2187
Query: 1332 YQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNKCTFG 1391
+Q M +F + K V V+ DD+++ SA + H+++L + + L++++L +KC+FG
Sbjct: 2188 FQCLMNKIFQFALRKFVLVFFDDILIYSASWKDHLKHLESVLQTLKQHQLFARLSKCSFG 2367
Query: 1392 VRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCG 1451
LG VS G+ ++ KV+A+ + P P KQ+RGFLG Y RFI G
Sbjct: 2368 DTEVDYLGHKVSGLGVSMENTKVQAVLDWPTPNNVKQLRGFLGLTGYYRRFIKSYANIAG 2547
Query: 1452 PIFKLLRKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLG 1511
P+ LL+K+ +WN+E + AF +K + E P+L P +P I+ +G VLG
Sbjct: 2548 PLTDLLQKDS-FLWNNEAEAAFVKLKKAMTEAPVLSLPDFSQPFILETDASGIGVGAVLG 2724
Query: 1512 QQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDP 1571
Q H I Y SKK + + + A+ A + RHYL+ + + +
Sbjct: 2725 QNG------HPIAYFSKKLAPRMQKQSAYTRELLAITEALSKFRHYLLGNKFIIRTDQRS 2886
Query: 1572 IKYIFEKAAVTGKIARWQMLLSEYDIVFKTQ-KAIKGSILADHLA 1615
+K + +++ T + W YD FK + K K + AD L+
Sbjct: 2887 LKSLMDQSLQTPEQQAWLHKFLGYD--FKIEYKPGKDNQAADALS 3015
Score = 92.4 bits (228), Expect = 2e-18
Identities = 137/555 (24%), Positives = 228/555 (40%), Gaps = 14/555 (2%)
Frame = +1
Query: 1666 NAYGKGMGAVIVSPQGHHIPFTARILFECTNNMAEYEACIFGIEEAIDMRIKHLDIYGDS 1725
+A G G+GAV+ GH I + ++ L + Y + I EA+ + +H +
Sbjct: 2689 DASGIGVGAVL-GQNGHPIAYFSKKLAPRMQKQSAYTRELLAITEALS-KFRHYLLGNKF 2862
Query: 1726 ALVINQIKGEWETHHAKLIPYHD-YARRLLTYFTKVELHHIPRDENQMADALATLSSMFR 1784
+ +Q + + P + + L Y K+E + P +NQ ADAL S MF
Sbjct: 2863 IIRTDQRSLKSLMDQSLQTPEQQAWLHKFLGYDFKIE--YKPGKDNQAADAL---SRMFM 3027
Query: 1785 VNHWNDVPIIKGQRLERPSHVFAIGDVIDQAGENVVGYRPWYYDIKQFLLSREYPPGASK 1844
+ W++ P +F +++ ++ +KQ L Y GA
Sbjct: 3028 LA-WSE-----------PHSIF-----LEELRARLISDP----HLKQ--LMETYKQGAD- 3135
Query: 1845 QDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLVHDVHDGTFGTHATGHTMS 1904
A + + +LY + D V++ +E +++ + H G HA G T +
Sbjct: 3136 ---------ASHYTVREGLLYWK--DRVVIPA-EEEIVNKILQEYHSSPIGGHA-GITRT 3276
Query: 1905 RKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNVMSSPWPFSMW---GIDM 1961
L+A +YW M+ D Y +KC CQ +P L + P P +W +D
Sbjct: 3277 LARLKAQFYWPKMQEDVKAYIQKCLICQQAKSNNTLPAGLLQPL--PIPQQVWEDVAMDF 3450
Query: 1962 IGRIEPKASNGHRFILVAIDYFTKWVEAASY-TNVTKQVVAKFIKNNIICRYGVPSKIIT 2020
I + S G I+V ID TK+ + +VVA+ ++I+ +G+P I++
Sbjct: 3451 ITGLPN--SFGLSVIMVVIDRLTKYAHFIPLKADYNSKVVAEAFMSHIVKLHGIPRSIVS 3624
Query: 2021 DNGTNLNNNVVQALCEEFKIEHHN---SSPYRPQMNGAVEAANKNIKRIVQKMVTTY-KD 2076
D + Q L FK++ SS Y PQ +G E NK ++ ++ + K
Sbjct: 3625 DRDRVFTSTFWQHL---FKLQGTTLAMSSAYHPQSDGQSEVLNKCLEMYLRCFTYEHPKG 3795
Query: 2077 WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQS 2136
W + LP+A Y T S G TPF +YG E P+L + AE +
Sbjct: 3796 WVKALPWAEFWYNTAYHMSLGMTPFRALYGREP--------PTLTRQACSIDDPAEVREQ 3951
Query: 2137 RYDQLNLIEEKRMDAMARGHSYQARMKAAFDKKVNPREFKVGELVL-----KRRISQQPD 2191
D+ L+ + +++ Q MK DKK F++G+ VL R+ S
Sbjct: 3952 LTDRDALLAKLKINLTRA----QQVMKRQADKKRLDVSFQIGDEVLVKLQPYRQHSAVLR 4119
Query: 2192 PRGKWTPNYEGPYVV 2206
K + Y GP+ V
Sbjct: 4120 KNQKLSMRYFGPFKV 4164
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 219 bits (559), Expect = 8e-57
Identities = 101/143 (70%), Positives = 117/143 (81%)
Frame = +3
Query: 1250 RMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSF 1309
RMCVD+RDLN+ASPKDNFPLPHID+L+ N A +FSFMDGFSGYNQIKM+PED EKT+F
Sbjct: 3 RMCVDYRDLNRASPKDNFPLPHIDILMANMASFALFSFMDGFSGYNQIKMAPEDMEKTTF 182
Query: 1310 ITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYL 1369
IT WGTFCYKVM FGL N GATY R M LF DM+HKE+E YVD+MI KS EE+H+ L
Sbjct: 183 ITLWGTFCYKVMSFGLKNFGATYHRAMVALFQDMMHKEIEAYVDEMIAKSRMEEEHLVNL 362
Query: 1370 TKMFERLRKYKLRLNPNKCTFGV 1392
+F +LRKY+LRLNP KC FG+
Sbjct: 363 QNLFGQLRKYRLRLNPRKCVFGL 431
>TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 402
Score = 204 bits (520), Expect = 3e-52
Identities = 96/133 (72%), Positives = 113/133 (84%)
Frame = +2
Query: 1285 FSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMI 1344
FSFMDGFSGYNQI M+ ED EKT+F+T WGTF Y+VM FGL N GATYQR M LFHDM+
Sbjct: 2 FSFMDGFSGYNQI*MAREDVEKTTFVTLWGTFSYRVMAFGLKNTGATYQRAMVALFHDMM 181
Query: 1345 HKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQ 1404
HKE+EVYVDDMI KS E +H+ L K+F RL+KY+L+LNP KCTFGV+SGKLLGFIVSQ
Sbjct: 182 HKEIEVYVDDMIAKSRTETEHLVNLCKLFGRLQKYQLKLNPTKCTFGVKSGKLLGFIVSQ 361
Query: 1405 KGIEVDPDKVRAI 1417
KGIE+DP+KV+A+
Sbjct: 362 KGIEIDPEKVKAL 400
>BQ628592
Length = 423
Score = 106 bits (264), Expect(2) = 2e-47
Identities = 48/76 (63%), Positives = 58/76 (76%)
Frame = -1
Query: 1457 LRKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDET 1516
L KNQ I+WN QEAF+ IK L P +L+PPV GRP ++Y+ + DESMGCVL Q D++
Sbjct: 423 LPKNQAILWNSNYQEAFEKIKQSLANPSVLMPPVTGRPFLLYMTMLDESMGCVLVQHDDS 244
Query: 1517 GKKEHAIYYLSKKFTD 1532
GKKE AIYYLSKKFTD
Sbjct: 243 GKKEQAIYYLSKKFTD 196
Score = 104 bits (259), Expect(2) = 2e-47
Identities = 43/61 (70%), Positives = 56/61 (91%)
Frame = -3
Query: 1537 YTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEYD 1596
Y+MLE+TCC L WA+ RLR Y+++HTTWLIS+MDP+KYIFEK A+TG+IARWQ+LLSE++
Sbjct: 184 YSMLERTCCTLVWASHRLRQYMLSHTTWLISKMDPVKYIFEKPALTGRIARWQVLLSEFN 5
Query: 1597 I 1597
I
Sbjct: 4 I 2
>NP395547 reverse transcriptase [Glycine max]
Length = 762
Score = 188 bits (477), Expect = 3e-47
Identities = 96/252 (38%), Positives = 144/252 (57%), Gaps = 18/252 (7%)
Frame = +1
Query: 1214 IKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKV------------------RMCVDF 1255
++ EV K ++AG + + WV+ + VPKK G RMC+D+
Sbjct: 1 VRKEVFKLLEAGLIYPISDSSWVSPVQVVPKKGGMTVVKNDRNELIPTRRVTRWRMCIDY 180
Query: 1256 RDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGT 1315
R LN+A+ KD++PLP +D ++ A+ + F+DG+SGYNQI + P+D+EKT+F P+
Sbjct: 181 RKLNEATRKDHYPLPFMDQMLKRLARQSFYRFLDGYSGYNQIAVDPQDQEKTAFTCPFSV 360
Query: 1316 FCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFER 1375
F Y+ MPFGL NA T+QR M +F DM+ K +EV++DD A + L K+ +R
Sbjct: 361 FAYRRMPFGLCNASTTFQRCMMAIFDDMVEKCIEVFMDDFSFFGASFGNCLANLEKVLQR 540
Query: 1376 LRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGR 1435
K L LN KC F V+ G +LG +S++GIEV +K+ I ++P P K + FLG
Sbjct: 541 CEKSNLVLNWEKCHFMVQEGIVLGHKISKRGIEVVKEKLDVIDKLPPPVNVKGIHSFLGH 720
Query: 1436 LNYISRFISHMT 1447
+ + RFI T
Sbjct: 721 VGFYRRFIKDFT 756
>NP395548 reverse transcriptase [Glycine max]
Length = 762
Score = 178 bits (451), Expect = 3e-44
Identities = 90/252 (35%), Positives = 142/252 (55%), Gaps = 18/252 (7%)
Frame = +1
Query: 1214 IKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKV------------------RMCVDF 1255
++ EV K ++ G + + WV+ ++ V KK+G ++C+D+
Sbjct: 1 VRKEVLKLLEVGLIYPISDSAWVSPVLVVSKKEGMTVIRNEKNDLIPTRTVTSWKLCIDY 180
Query: 1256 RDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGT 1315
R LN+A+ KD+FPLP +D +++ A + F+D + GYNQI + P+D+EK +F P+G
Sbjct: 181 RKLNEATRKDHFPLPFMDQMLERLAGHAYYCFLDAYFGYNQIVVDPKDQEKMAFTCPFGV 360
Query: 1316 FCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFER 1375
F Y+ +PFGL NA T+Q M +F D++ K +EV++DD V E ++ L + +R
Sbjct: 361 FAYRRIPFGLCNAPTTFQMCMLAIFADIVEKSIEVFMDDFSVFVPSLESCLKKLEMVLQR 540
Query: 1376 LRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGR 1435
+ L LN KC F VR G +LG +S +GIEVD K+ I ++P P K +R FLG+
Sbjct: 541 CVETNLVLNWEKCHFMVREGIVLGHKISTRGIEVDQTKIDVIEKLPPPSNVKGIRSFLGQ 720
Query: 1436 LNYISRFISHMT 1447
+ RFI T
Sbjct: 721 ARFYRRFIKDFT 756
>TC219643 weakly similar to UP|Q6WAY7 (Q6WAY7) Gag/pol polyprotein (Fragment),
partial (8%)
Length = 1320
Score = 170 bits (430), Expect = 7e-42
Identities = 81/158 (51%), Positives = 112/158 (70%), Gaps = 4/158 (2%)
Frame = +3
Query: 1090 TVPPSFDFP--VYEAEDEEGDNI--PYEITRLLEQEKKAIQPHQEEIELINIGTEENKRE 1145
T PS DF + + EDE +++ P E+ R++ E + + PHQEE EL+++G+ KRE
Sbjct: 846 TWDPSIDFEQKMNQTEDEGNEDVGLPPELERMVAHEDQEMGPHQEETELVDLGSGSGKRE 1025
Query: 1146 IKIGATLEEGVKQKIIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRR 1205
+KIG + +++++I LLR+Y DIFAWSY+DMPGL IV+HR+P PEC PV+QKLRR
Sbjct: 1026 VKIGTGITAPIREELIILLRDYQDIFAWSYQDMPGLSSDIVQHRLPLNPECSPVKQKLRR 1205
Query: 1206 THPDMALKIKSEVQKQIDAGFLMTVEYPEWVANIVPVP 1243
P+ +LKIK EV+K DAGFL YP+WVANIVP+P
Sbjct: 1206 MKPETSLKIKEEVKK*FDAGFLAVARYPKWVANIVPIP 1319
Score = 69.3 bits (168), Expect = 2e-11
Identities = 49/125 (39%), Positives = 59/125 (47%), Gaps = 15/125 (12%)
Frame = +1
Query: 915 PWIHDAGAVTSTLHQKLKFVKNGKLVTIHGEEAYLVSQLSSFSCVEAGSAE-GTAFQGLT 973
PWIH G V STLHQKLKFV G LV + GEE LVS SS VEA TAFQ
Sbjct: 1 PWIHSVGVVPSTLHQKLKFVVEGHLVIVSGEEDILVSCPSSMPYVEAAEESLETAFQSFE 180
Query: 974 IESAESKEAGAAMASLKD-----AQRAIQEGQTAGWG---------KMIQLCENKRKEGL 1019
+ S S ++ L D A+ + G G G +I N+ K GL
Sbjct: 181 VVSISSVDSLFGQPCLSDAAVMMARVMLGNGYEPGMGLGKDNGGITSLINTQGNRGKYGL 360
Query: 1020 GFSPS 1024
G+ P+
Sbjct: 361 GYKPT 375
>TC232528 weakly similar to UP|Q6WAY5 (Q6WAY5) Gag/pol polyprotein
(Fragment), partial (3%)
Length = 449
Score = 155 bits (391), Expect = 2e-37
Identities = 72/137 (52%), Positives = 105/137 (76%)
Frame = +1
Query: 756 SKISVLSLLLSSEAHRNTLLKVLEQAYVDHEVTVDRFGGIVGNITACNNLWFSEEELPEV 815
+++S+L LL+SSE HR L+KVL +A+V +++V+ FGG+V NITA N L F+EEE+P
Sbjct: 31 ARVSLLELLMSSEPHRALLVKVLNEAHVAQDISVEGFGGLVNNITANNYLAFAEEEIPAE 210
Query: 816 GKSHNLALHISLNCKSDMISNVLVDTGSSLNVMPKTTLDQLSYRGTPLRRSTFLVKAFDG 875
G+ HN ALH+S+ C +++ VL+D G SLNVMPK+TLD+L + + L+ S+ +V+AFDG
Sbjct: 211 GRGHNKALHVSVKCMDHIVAKVLIDNGYSLNVMPKSTLDKLPFNASHLKPSSMVVRAFDG 390
Query: 876 SRKNVLGEIDLPITIGP 892
+R+ V GEIDLP+ IGP
Sbjct: 391 TRREVRGEIDLPVQIGP 441
>BI498328
Length = 335
Score = 119 bits (297), Expect = 2e-26
Identities = 63/117 (53%), Positives = 78/117 (65%)
Frame = +1
Query: 1601 TQKAIKGSILADHLAYQPLDDYQLIEFDFPDEEIMYLKSKDCEEPLINEGPDPNSKWGLV 1660
TQKA+KGS LAD+LA PL Y+ + +FPDE+IM L + IN KW +
Sbjct: 4 TQKAVKGSALADYLAQ*PLQGYRPMHPEFPDEDIMALFEEKRTHEDIN-------KWIVC 162
Query: 1661 FDGAVNAYGKGMGAVIVSPQGHHIPFTARILFECTNNMAEYEACIFGIEEAIDMRIK 1717
FDGA NA G G+GAV+VSP IPFTAR+ F+CTNNMAEYEAC G++ AID +K
Sbjct: 163 FDGASNALGHGVGAVLVSPDDQCIPFTARLGFDCTNNMAEYEACALGVQAAIDFDVK 333
>AW184779
Length = 432
Score = 109 bits (272), Expect = 2e-23
Identities = 47/74 (63%), Positives = 59/74 (79%)
Frame = +1
Query: 1859 LDGDILYKRNYDMVLLRCVDEHEAEQLVHDVHDGTFGTHATGHTMSRKLLRAGYYWMAME 1918
L +ILYKRN+DMVLLRCVD EAEQ++ +VH+G+FGTHA H M++K+LR GYYW+ ME
Sbjct: 1 LSRNILYKRNHDMVLLRCVDAREAEQMLVEVHEGSFGTHANIHAMAQKILRVGYYWLTME 180
Query: 1919 HDCYQYARKCHKCQ 1932
DC + KCHKCQ
Sbjct: 181 SDCCIHVWKCHKCQ 222
Score = 84.7 bits (208), Expect = 4e-16
Identities = 50/114 (43%), Positives = 60/114 (51%), Gaps = 22/114 (19%)
Frame = +3
Query: 1911 GYYWMAMEHDCY------------QYA-RKCHKCQIYADKIHVPPHALNVMSSPWPF--- 1954
G W +H C+ Y R H C + VP +M +P P+
Sbjct: 99 GILWNTCQHTCHGPEDSESGVLLAHYGERLLHPC---VEMP*VPVRPSLIMLTPHPYL*M 269
Query: 1955 ------SMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAK 2002
MWGID+IG IEPKASNGH FILVAIDYFTKWVEA SY +VT+ VV +
Sbjct: 270 SWQHLGHMWGIDVIGAIEPKASNGHHFILVAIDYFTKWVEAVSYASVTRSVVIR 431
>BG839293
Length = 781
Score = 108 bits (270), Expect = 3e-23
Identities = 48/110 (43%), Positives = 76/110 (68%), Gaps = 2/110 (1%)
Frame = +1
Query: 1094 SFDFPVYEAEDEEGDNI--PYEITRLLEQEKKAIQPHQEEIELINIGTEENKREIKIGAT 1151
+F+ + EDE +++ P E+ R++ E + + PHQEE EL+++G KRE+KIG
Sbjct: 400 NFEQETSQTEDEGNEDVGLPPELERMVAHEDQEMGPHQEETELVDLGIGSGKREVKIGTG 579
Query: 1152 LEEGVKQKIIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQ 1201
+ +++++I LL++Y DIFAWSY+DMPGL IV+H++P PEC PV+Q
Sbjct: 580 ITAPIREELIILLKDYQDIFAWSYQDMPGLSSDIVQHQLPLNPECSPVKQ 729
>BI316922
Length = 405
Score = 99.8 bits (247), Expect = 1e-20
Identities = 46/135 (34%), Positives = 80/135 (59%)
Frame = +3
Query: 1888 DVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNV 1947
++H G G H+ M+ ++LR GYY M C +Y +KC +C + + H+ L+
Sbjct: 3 EMHRGICGMHSKSQLMTTRVLRVGYY**TMRKYCTEYVKKCEEC*KFGNISHLLVEELHN 182
Query: 1948 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNN 2007
+ +PWPF++ G+D++ R P + +++LV ID FTKW+E ++ V KF+ N
Sbjct: 183 IVAPWPFAI*GVDIL-RPFPLSKRQVKYLLVGIDQFTKWIETEHIAIISIANVRKFV*RN 359
Query: 2008 IICRYGVPSKIITDN 2022
I+C +G+P+ +I+DN
Sbjct: 360 IVC*FGIPNTLISDN 404
>CF922341
Length = 675
Score = 99.4 bits (246), Expect = 2e-20
Identities = 63/175 (36%), Positives = 87/175 (49%), Gaps = 10/175 (5%)
Frame = +1
Query: 87 IPGANLTPVSTALTQAATTVTEPIVNAVPL--FVHANAHHGSI----ATTGNMEERMEEL 140
IP NL T LT A V +PL + +H + +TT M E+
Sbjct: 130 IPHHNLADFETCLTYATEGQA---VGGIPLRNTLEGPQYHPQLHLLHSTTSKNPHVMAEM 300
Query: 141 AK--ELRREIKANRGNGDSI--KTQDLCLVSKVDVPKKFKVPEFDKYNGLTCPQNHIVKY 196
K L ++A G D ++L LV + P KFKV +FDKY G TCP+NH+ Y
Sbjct: 301 GKLDHLEEGLRAIEGGEDYAFANLEELFLVPNIITPPKFKVLDFDKYKGTTCPKNHLKMY 480
Query: 197 VRKMGNYKDNDSLMIHYFQDSLMEDAAEWYTSLSKDDVHTFDELAAAFKSHYGFN 251
+KMG Y ++ L+IH FQ+SL A WYT+L VH++ +L AF Y +N
Sbjct: 481 CQKMGAYAKDEELLIHSFQESLTGVAVTWYTNLEPSRVHSWKDLMVAFVRQYQYN 645
>BE804087
Length = 160
Score = 82.8 bits (203), Expect = 2e-15
Identities = 38/53 (71%), Positives = 44/53 (82%)
Frame = +2
Query: 2057 EAANKNIKRIVQKMVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEA 2109
EAANKNIK+ + KM +YKDWHEM +ALH YRT VR+STGATP+SLVYG EA
Sbjct: 2 EAANKNIKKNI*KMTVSYKDWHEMFSFALHMYRTLVRTSTGATPYSLVYGKEA 160
>BI321666
Length = 430
Score = 80.1 bits (196), Expect = 1e-14
Identities = 44/129 (34%), Positives = 71/129 (54%), Gaps = 3/129 (2%)
Frame = +2
Query: 1903 MSRKLLRAGYYWMAMEHDCYQYARKCHKCQ---IYADKIHVPPHALNVMSSPWPFSMWGI 1959
+S +L++ +Y ++ D Y +A+ C+KCQ + + +P H + + F WGI
Sbjct: 2 ISTNVLQSRFYLPSIFKDAYVHAQSCNKCQRTRSVSKRNELPLHTILEVEI---FDYWGI 172
Query: 1960 DMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKII 2019
D +G P SN +ILV +DY +KWVEA + ++V KF+K I R GVP +I
Sbjct: 173 DFVGPFPPSFSN--EYILVVVDYVSKWVEAVACQKSDAKIVIKFLKKQIFSRLGVPWVLI 346
Query: 2020 TDNGTNLNN 2028
+ G++L N
Sbjct: 347 DNGGSHLCN 373
>TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%)
Length = 589
Score = 63.2 bits (152), Expect(2) = 1e-14
Identities = 42/171 (24%), Positives = 74/171 (42%)
Frame = +1
Query: 1418 REMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIK 1477
R P ++ +R F G ++ RF+ + + P+ +L++KN W ++ ++AF +K
Sbjct: 88 RMAPTLKSVGDIRSFHGLASFYRRFVPNFSTVASPLNELVKKNMAFTWGEKQEQAFALLK 267
Query: 1478 NYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRY 1537
L + P+L P + + + VL Q H I Y S+K Y
Sbjct: 268 EKLTKAPVLALPDFSKTFELECDASGVGVRAVLLQGG------HPIAYFSEKLHSATLNY 429
Query: 1538 TMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKIARW 1588
+K AL A + H+LV + S +KYI K+ + + A+W
Sbjct: 430 PTYDKELYALIRAPQTWEHFLVCKEFVIHSDHQSLKYIRGKSKLNKRHAKW 582
Score = 36.6 bits (83), Expect(2) = 1e-14
Identities = 12/25 (48%), Positives = 21/25 (84%)
Frame = +2
Query: 1399 GFIVSQKGIEVDPDKVRAIREMPAP 1423
GF+V + G+++DP+K++AI+E P P
Sbjct: 29 GFVVGRNGVQMDPEKIKAIQEWPPP 103
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.135 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,500,840
Number of Sequences: 63676
Number of extensions: 1401086
Number of successful extensions: 9002
Number of sequences better than 10.0: 143
Number of HSP's better than 10.0 without gapping: 8047
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 8755
length of query: 2239
length of database: 12,639,632
effective HSP length: 112
effective length of query: 2127
effective length of database: 5,507,920
effective search space: 11715345840
effective search space used: 11715345840
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 67 (30.4 bits)
Medicago: description of AC122728.10