
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC127428.20 + phase: 0 /pseudo
(1425 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 63 5e-22
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 61 2e-16
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 75 6e-14
TC232995 66 1e-10
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 42 2e-09
BM143109 55 3e-07
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 48 3e-05
AI959950 33 8e-05
AI966222 36 0.099
TC213445 35 0.18
AI855982 35 0.31
BI469652 weakly similar to GP|18149115|dbj| reverse transcriptas... 32 2.0
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 62.8 bits (151), Expect(2) = 5e-22
Identities = 107/382 (28%), Positives = 167/382 (43%), Gaps = 1/382 (0%)
Frame = +2
Query: 773 KSEVIMVKNLKTSHLNLFVKNMELSMNSLLLELL-NKMGL*KERIELYKKWLEP*FMKTI 831
+S V M ++LKT+ L L +++ S+ S L L NKM K + L KK L FM
Sbjct: 2453 ESGVTMAESLKTASL-LNSAHLKASLMSSLQPLHHNKMA*LKGKTGLCKKLLGSCFMPKN 2629
Query: 832 *LNISGQKQLILHVMSKIESISDLYWRKLHMNSLREEDPISLTFINLDVHVTF*IPKTI* 891
ISG K H S ES + + M S + +S T +L+VHVTF ++
Sbjct: 2630 FPIISGLKP*TQHATSTTESHLEEGLQPHCMKSGKGGSQLSSTSTSLEVHVTFWQIESKG 2809
Query: 892 RNLMPRLKEESF*VTLKGQRHTKCTIQKHIVLKNQCT*SLMIESLEVKLQSKVKVLQVNK 951
+PR+ +E TL+ H + +I + + N LMI + + SK
Sbjct: 2810 ERWIPRVMQEYSWDTLQTAEHIEYSIPEPEL*WNPSMWLLMI*LQQERRMSKK------- 2968
Query: 952 ILKMHQNLI*HSTLKRVQKLNQIQKFNPHQKLNQYQKHLMKMLLKIFQKTLSKQFHQSLN 1011
+ H+ +* LK VQK+ QK L+ LL++ Q +++ L
Sbjct: 2969 -MSEHRETM*QIQLK-VQKM---------------QKTLI--LLQMNQTSINLTRDPPLE 3091
Query: 1012 TNHHILKS*SLETKKVPEEQDHISDKKSL**DCYQ*LNLKQLMKHSQMMNGY*QCRKS*I 1071
+ +S* E + QD + S L+ + +H M +G C+K+
Sbjct: 3092 SRRCTPRS*L*EIQTEESLQDQGRLRLSPIHVLSPKLSPRM*KRH*LMSSGSMLCKKNWS 3271
Query: 1072 SFKEPMCGI*YPNPSRRTLLEQNGYSETS*MRKVK*PETKPDLLHKDIVNKKALIILKLL 1131
+ K G *+ +P L +G S T M+KV *PET+PDLL K + K ++KL
Sbjct: 3272 NSKGMKFGS*FLDPRELM*LAPSGSSRTKPMKKVL*PETRPDLLLKATLRLKV*TLMKLS 3451
Query: 1132 LQLQDWKQSGYFYLCNQSWNNN 1153
L D S + + S N++
Sbjct: 3452 PLLLDLSPSDCYLV*LASSNSS 3517
Score = 61.6 bits (148), Expect(2) = 5e-22
Identities = 69/268 (25%), Positives = 122/268 (44%)
Frame = +3
Query: 1154 ISNRC*ECFS*WCH*RRSICQTTSWF*GS*IS*PCL*A*EITIWLETSSQSLL**TE*FL 1213
+ + C E S W RS+C S S C+ A E ++W+E SS+SL+*
Sbjct: 3519 VPDGCEERVSEWIPE*RSLCGAAKGICRSNSSRSCIQAQEGSLWIEASSKSLV*KANRVP 3698
Query: 1214 NQK*F*KRSG*YNSLQKDS*ERHSDCANIC**YNIWFY*CISLQEIL*VNAGRI*NEHDG 1273
*+ * +SL + + D +IC**+ +W +A I*+E
Sbjct: 3699 YSARV*EGRN*QDSLCQTRC*KLDDSTDIC**HCVWRDVE*DASTFCPTDAI*I*DESCW 3878
Query: 1274 RTKVLSRNSNQSKQRWSLCSSNKIHKGASEEVQTRRLQSDEHSNASYLHLKQRR*WNKSR 1333
R + S +++ R + + ++ K +EV + Q +++ L +R W++
Sbjct: 3879 RADLFSGTPSEADGRLHIPLTKQVCKEHCQEVWDGKCQP*KNTCTYSLEAVKR*SWHQC* 4058
Query: 1334 PEAV*RYDWFFVIFHCI*TIYFVQCVSVCKISIRS*RISLNCC*KNL*VFERNNYSWTPL 1393
++V ++DW IF+ T + + +CKIS +S* SL +N + + + W +
Sbjct: 4059 SKSVQKHDWELTIFNS*QT*HHLCSRCLCKISSQS*DKSLESSKENSEICKWHQ*LWDYV 4238
Query: 1394 *EIH*LSVDWIL*C*LCW*YD*KKINQW 1421
+ + W+L*C*L W *+K + W
Sbjct: 4239 LSLFRFNAGWVL*C*LGWKCR*QKKHFW 4322
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 60.8 bits (146), Expect(2) = 2e-16
Identities = 131/502 (26%), Positives = 206/502 (40%), Gaps = 1/502 (0%)
Frame = +2
Query: 643 IKNGYGIEGWDMLTGD*SLSLASYNLLKACLTLIINQMHSVVHARKEKLSKPLLN*KTLY 702
+K+ YGI+ D+ T + + LL+A + SVV+ R E S+ +
Sbjct: 2060 MKSEYGIKDLDICT*EA*RKSLTKVLLEAFPI*K*KKAESVVNVRLESKSRCPTRSFNIR 2239
Query: 703 QPPDP*NYSILIFLDQFLLHPCMGVNMD*SLLMITADGLG*NS*RAKNMHMKCLAASALK 762
P NY I L M L MI+ D G* +K +
Sbjct: 2240 PLPGCWNYFTWI*WGLCRLKVLEERGMPMLLWMISPDLPG*TLSERNQKPLKYSKS*V*D 2419
Query: 763 YSLKKN*KF*KSEVIMVKNLKTSHLNLFVKNMELSMNSLLLELL-NKMGL*KERIELYKK 821
+ +K +S V M +NLKT+ +L +++ S+ S L L N+MG + + L K+
Sbjct: 2420 FKERKTVSSRESGVTMAENLKTAG-SLNSAHLKASLMSSLQPLHHNRMG*LRGKTGLCKR 2596
Query: 822 WLEP*FMKTI*LNISGQKQLILHVMSKIESISDLYWRKLHMNSLREEDPISLTFINLDVH 881
L FM ISG K H S ES + + M S + +S T +L+VH
Sbjct: 2597 LLGSCFMPKNFPIISGLKP*TQHATSTTESH*EEGLQPPCMKSGKGGSHLSSTSTSLEVH 2776
Query: 882 VTF*IPKTI*RNLMPRLKEESF*VTLKGQRHTKCTIQKHIVLKNQCT*SLMIESLEVKLQ 941
VT ++ +PR+ +E TL+ H + +I + N LMI + +
Sbjct: 2777 VTSWQIESKEERWIPRVMQEYSWDTLQTAEHIEYSIPEPEQ*WNPSMWLLMICLQQERRM 2956
Query: 942 SKVKVLQVNKILKMHQNLI*HSTLKRVQKLNQIQKFNPHQKLNQYQKHLMKMLLKIFQKT 1001
SK + H+ +* LK V+K+ QK L+ LL++ Q +
Sbjct: 2957 SKK--------MSEHRETM*QMQLK-VEKM---------------QKTLI--LLQMNQTS 3058
Query: 1002 LSKQFHQSLNTNHHILKS*SLETKKVPEEQDHISDKKSL**DCYQ*LNLKQLMKHSQMMN 1061
+ L + +S* E + QD + S L+ + +H QM +
Sbjct: 3059 TNPTRDPPLESRRCTPRS*L*EIQTEGSLQDQGRLRSSQTHVLSPKLSPRM*KRH*QMSS 3238
Query: 1062 GY*QCRKS*ISFKEPMCGI*YPNPSRRTLLEQNGYSETS*MRKVK*PETKPDLLHKDIVN 1121
G C+K+ + K G *+ L +G S T M+KV *PET+PD L K +
Sbjct: 3239 GSMLCKKNWSNSKGMKSGS*FLGLRELM*LAPSGSSRTKPMKKVS*PETRPDWLLKATLR 3418
Query: 1122 KKALIILKLLLQLQDWKQSGYF 1143
K +++LL QL D S Y+
Sbjct: 3419 LKV*TLMRLLPQLLDLSPSDYY 3484
Score = 44.7 bits (104), Expect(2) = 2e-16
Identities = 65/268 (24%), Positives = 117/268 (43%)
Frame = +3
Query: 1154 ISNRC*ECFS*WCH*RRSICQTTSWF*GS*IS*PCL*A*EITIWLETSSQSLL**TE*FL 1213
+ + C E S W RS+C S C+ A E ++W+E SS+SL+*
Sbjct: 3516 VPDGCEERISEWIPE*RSLCGAAKGICRPDSSRSCIQAQEGSLWIEASSKSLV*KANRVP 3695
Query: 1214 NQK*F*KRSG*YNSLQKDS*ERHSDCANIC**YNIWFY*CISLQEIL*VNAGRI*NEHDG 1273
*+ * + L + + DC +IC**+ +W +A I*+E
Sbjct: 3696 YSARV*EGRN*QDPLCQTRC*KLDDCTDIC**HCVWRDVE*DASTFCSTDAI*I*DESCW 3875
Query: 1274 RTKVLSRNSNQSKQRWSLCSSNKIHKGASEEVQTRRLQSDEHSNASYLHLKQRR*WNKSR 1333
R + S S+++ + + ++ K +EV QS + + L + ++
Sbjct: 3876 RADLFSGTSSEADGGLHIPLTKQVCKEHCQEVWDGECQS*KDTCTYSLEAVKG*SRHQC* 4055
Query: 1334 PEAV*RYDWFFVIFHCI*TIYFVQCVSVCKISIRS*RISLNCC*KNL*VFERNNYSWTPL 1393
++V ++D IF+ T + + +CKIS +S SL+ +N + + + W +
Sbjct: 4056 SKSVQKHDRELTIFNS*QTRHHLCSRCLCKISSQSQDKSLDSSKENSEICKWH**LWDYV 4235
Query: 1394 *EIH*LSVDWIL*C*LCW*YD*KKINQW 1421
+ + W+L*C*L W *+K + W
Sbjct: 4236 LSLFKSNAGWVL*C*LGWKCR*QKKHFW 4319
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 75.5 bits (184), Expect(2) = 6e-14
Identities = 65/171 (38%), Positives = 92/171 (53%)
Frame = +2
Query: 1242 IC**YNIWFY*CISLQEIL*VNAGRI*NEHDGRTKVLSRNSNQSKQRWSLCSSNKIHKGA 1301
+C**+N+W +Q + *VN G I*NE++ KV R SN S+ W S +I+K
Sbjct: 503 LC**HNLWCNLKKDVQGVF*VNEGWI*NEYER*AKVPPRTSNHSESLWDFYPSREIYKVP 682
Query: 1302 SEEVQTRRLQSDEHSNASYLHLKQRR*WNKSRPEAV*RYDWFFVIFHCI*TIYFVQCVSV 1361
S++VQ Q+ + AS+ + Q + V* YD FF+IF+ *T Y V + +
Sbjct: 683 SKKVQNG*SQTYGNPYASFHNH*QG*ER*SYFIKGV*WYD*FFIIFNF**TRYCVCRLPL 862
Query: 1362 CKISIRS*RISLNCC*KNL*VFERNNYSWTPL*EIH*LSVDWIL*C*LCW* 1412
CKIS+ S S C *K+L + N S + +*E *+ IL*C CW*
Sbjct: 863 CKISVLSKNFSCYCS*KDLKISCWNY*SLSMV*EKV*V*SFRIL*CLFCW* 1015
Score = 21.6 bits (44), Expect(2) = 6e-14
Identities = 13/37 (35%), Positives = 22/37 (59%)
Frame = +1
Query: 1196 IWLETSSQSLL**TE*FLNQK*F*KRSG*YNSLQKDS 1232
+W ETSS+SL+* + + K +R+ ++QK S
Sbjct: 364 LWFETSSKSLV*KAKFISSFKWIHQRNNGPRTIQKGS 474
>TC232995
Length = 1009
Score = 65.9 bits (159), Expect = 1e-10
Identities = 55/168 (32%), Positives = 90/168 (52%)
Frame = +1
Query: 1175 TTSWF*GS*IS*PCL*A*EITIWLETSSQSLL**TE*FLNQK*F*KRSG*YNSLQKDS*E 1234
TT WF* * + PCL* + ++W ETS ++* +*F + K +R Y+ + K+
Sbjct: 7 TTPWF*NF**TKPCL*ITKGSLWFETSP*GMV*TIK*FSS*KRILQR*SGYHIIHKEKA* 186
Query: 1235 RHSDCANIC**YNIWFY*CISLQEIL*VNAGRI*NEHDGRTKVLSRNSNQSKQRWSLCSS 1294
+ +NIC**YN W +* +Q + A I*N +DGRTKVLS +NQ+ + S
Sbjct: 187 *YFVGSNIC**YNFWIH**FIVQGVFP*YAK*I*NVNDGRTKVLSGITNQANSIRYIHQS 366
Query: 1295 NKIHKGASEEVQTRRLQSDEHSNASYLHLKQRR*WNKSRPEAV*RYDW 1342
+I +G +++ ++ +++ L L+ R W+ R + + R W
Sbjct: 367 IQILQGIDQKIWDG*CKTHVYTDEH*LLLR*R*IWSVYRHKTISRCYW 510
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 41.6 bits (96), Expect(2) = 2e-09
Identities = 30/73 (41%), Positives = 40/73 (54%)
Frame = -1
Query: 1066 CRKS*ISFKEPMCGI*YPNPSRRTLLEQNGYSETS*MRKVK*PETKPDLLHKDIVNKKAL 1125
C+K+*I+ KE MCG * N EQNG+ E +*M E + D K I+ K+
Sbjct: 454 CKKN*INLKEIMCGN**KNLKIILS*EQNGFLEIN*MNMA*LLEIRLD**QKGIIKKRE* 275
Query: 1126 IILKLLLQLQDWK 1138
+ K +LQLQD K
Sbjct: 274 TMKKHMLQLQD*K 236
Score = 40.0 bits (92), Expect(2) = 2e-09
Identities = 24/59 (40%), Positives = 34/59 (56%)
Frame = -2
Query: 1143 FYLCNQSWNNNISNRC*ECFS*WCH*RRSICQTTSWF*GS*IS*PCL*A*EITIWLETS 1201
F +C + +SN C*+CFS W + RRSIC TT * + CL* + ++W +TS
Sbjct: 222 FSICIHNEF*TLSNGC*KCFSKWFNSRRSIC*TTPRL*NPG*TNSCL*IAKGSLWFKTS 46
>BM143109
Length = 415
Score = 54.7 bits (130), Expect = 3e-07
Identities = 44/113 (38%), Positives = 62/113 (53%)
Frame = +3
Query: 1185 S*PCL*A*EITIWLETSSQSLL**TE*FLNQK*F*KRSG*YNSLQKDS*ERHSDCANIC* 1244
+* CL* + IW++TS L+* E* ++ F KR G*Y E ++ +IC*
Sbjct: 33 A*SCL*TEKGFIWIKTSP*GLV*TFE*ISFRQGFFKR*G*Y*PFYLKEIE*YTLSTDIC* 212
Query: 1245 *YNIWFY*CISLQEIL*VNAGRI*NEHDGRTKVLSRNSNQSKQRWSLCSSNKI 1297
*Y WF * SLQ+I A I*N +D K+ S +NQ+ + W++ S KI
Sbjct: 213 *YYFWFN**FSLQKIFSRYAK*I*NVNDA*VKLFSWTTNQANKEWNIYQSIKI 371
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 48.1 bits (113), Expect = 3e-05
Identities = 39/97 (40%), Positives = 51/97 (52%)
Frame = +1
Query: 1048 LNLKQLMKHSQMMNGY*QCRKS*ISFKEPMCGI*YPNPSRRTLLEQNGYSETS*MRKVK* 1107
+NLK K M+ G C+K+* + KE M G * N LLEQNG+ E +*M V
Sbjct: 1 VNLKI*KKP**MIIGSLSCKKN*TNLKETMYGN**KNLKIILLLEQNGFLEIN*MNMV*L 180
Query: 1108 PETKPDLLHKDIVNKKALIILKLLLQLQDWKQSGYFY 1144
E KP KDI+ K+ + K +L LQD K F+
Sbjct: 181 LEIKPG**RKDIIKKRE*TMKKHMLLLQD*KPLECFW 291
>AI959950
Length = 466
Score = 33.5 bits (75), Expect(2) = 8e-05
Identities = 31/82 (37%), Positives = 40/82 (47%), Gaps = 1/82 (1%)
Frame = -2
Query: 1065 QCRKS*ISFKEPMCGI*YPNPSRRTLLEQNGYSETS*MRKVK*PETKPDLLHKDIVNKKA 1124
+C+K+ ISFK M R LE NGY T+* R V+ +TK D L K N+K
Sbjct: 390 RCKKNLISFKRIMSRSSLNYQKERR*LE*NGYFVTN*TRMVRL*DTKQD*LLKVTHNRKV 211
Query: 1125 LIILKLLLQLQDWK-QSGYFYL 1145
K L L K + YF+L
Sbjct: 210 *TTQKPLHLLHV*K*YASYFHL 145
Score = 32.3 bits (72), Expect(2) = 8e-05
Identities = 14/37 (37%), Positives = 22/37 (58%)
Frame = -3
Query: 1144 YLCNQSWNNNISNRC*ECFS*WCH*RRSICQTTSWF* 1180
++CN +SN C +C S W + + S+C TT+W *
Sbjct: 152 FICNL**YEVVSNGCKKCISKWLNPKGSLC*TTAWI* 42
>AI966222
Length = 430
Score = 36.2 bits (82), Expect(2) = 0.099
Identities = 32/88 (36%), Positives = 45/88 (50%)
Frame = +2
Query: 821 KWLEP*FMKTI*LNISGQKQLILHVMSKIESISDLYWRKLHMNSLREEDPISLTFINLDV 880
+WLEP M T L+ S K IL+ + + + I D + L MN R+E+P FI L V
Sbjct: 2 RWLEPR*MIT*PLSTSRLK**ILYAIFRTKFI*DPS*KGLPMNYGRDENPTYHIFILLGV 181
Query: 881 HVTF*IPKTI*RNLMPRLKEESF*VTLK 908
V+ * + I* L ++ E TLK
Sbjct: 182 SVSL*TQRII*EKLTQKVTVEYLLHTLK 265
Score = 18.9 bits (37), Expect(2) = 0.099
Identities = 13/39 (33%), Positives = 20/39 (50%)
Frame = +3
Query: 911 RHTKCTIQKHIVLKNQCT*SLMIESLEVKLQSKVKVLQV 949
RH++CT + +LK * L SL S + +LQ+
Sbjct: 273 RHSECTTPEL*LLKKLSI*DLAKISLIKNY*S*MSLLQI 389
>TC213445
Length = 705
Score = 35.4 bits (80), Expect = 0.18
Identities = 22/44 (50%), Positives = 26/44 (59%)
Frame = +1
Query: 1340 YDWFFVIFHCI*TIYFVQCVSVCKISIRS*RISLNCC*KNL*VF 1383
YD F +F T Y V C+ VCKIS +S RIS C *KN +F
Sbjct: 196 YDRIFSLFINKQTSYNV*CLYVCKISSKSQRISPKCH*KNNEIF 327
>AI855982
Length = 484
Score = 34.7 bits (78), Expect = 0.31
Identities = 27/58 (46%), Positives = 32/58 (54%)
Frame = +3
Query: 1045 YQ*LNLKQLMKHSQMMNGY*QCRKS*ISFKEPMCGI*YPNPSRRTLLEQNGYSETS*M 1102
Y *LNLK K M+ G C+K+*I+ KE MCG * N EQNG E +*M
Sbjct: 87 YL*LNLKI*KKP**MITG*LPCKKN*INLKEIMCGN**KNLIIILSYEQNGSLEIN*M 260
>BI469652 weakly similar to GP|18149115|dbj| reverse transcriptase {Silene
noctiflora}, partial (60%)
Length = 427
Score = 32.0 bits (71), Expect = 2.0
Identities = 26/69 (37%), Positives = 36/69 (51%)
Frame = +2
Query: 1144 YLCNQSWNNNISNRC*ECFS*WCH*RRSICQTTSWF*GS*IS*PCL*A*EITIWLETSSQ 1203
+LC+ + + SN *+ F +*R S+CQTTS S P + IW ET +
Sbjct: 221 FLCSS*KHKSFSNGY*KWFFK*LY*RGSVCQTTSKLCRPYTSRPYFQTSKGFIWSETGTL 400
Query: 1204 SLL**TE*F 1212
L+**TE F
Sbjct: 401 CLV**TELF 427
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.377 0.169 0.647
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,779,878
Number of Sequences: 63676
Number of extensions: 793824
Number of successful extensions: 12039
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 3543
Number of HSP's successfully gapped in prelim test: 450
Number of HSP's that attempted gapping in prelim test: 8319
Number of HSP's gapped (non-prelim): 4426
length of query: 1425
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1316
effective length of database: 5,698,948
effective search space: 7499815568
effective search space used: 7499815568
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 13 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 35 (21.6 bits)
S2: 65 (29.6 bits)
Medicago: description of AC127428.20