
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148359.6 + phase: 0 /pseudo
(2066 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CF922488 112 1e-24
NP334778 reverse transcriptase [Glycine max] 74 7e-13
BQ628592 42 3e-06
TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 49 2e-05
TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 48 2e-05
BI498328 34 0.58
BI969837 33 1.3
CA953394 33 1.3
TC217499 similar to UP|Q6NM71 (Q6NM71) At3g56510, partial (63%) 33 1.7
TC213554 weakly similar to UP|Q8LLE6 (Q8LLE6) RNA-binding protei... 33 1.7
BU927182 32 2.2
TC222660 homologue to UP|Q6XCA4 (Q6XCA4) GPR34-like GPCR (Fragme... 32 2.2
TC219203 similar to UP|Q6JCR5 (Q6JCR5) Cytochrome b (Fragment), ... 32 2.9
TC223864 weakly similar to UP|Q9LIG9 (Q9LIG9) Cytochrome P450-li... 32 3.7
CF921229 32 3.7
TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%) 32 3.7
TC222466 31 4.9
TC227843 weakly similar to GB|AAM70534.1|21700821|AY124825 AT5g6... 30 8.3
TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 30 8.3
>CF922488
Length = 741
Score = 112 bits (281), Expect = 1e-24
Identities = 81/222 (36%), Positives = 116/222 (51%)
Frame = +2
Query: 1177 KEGW*SSDVC*LQRPEQSQPKRQLSITSY*CAC**YCSI*GVLLHGRFFRL*SDQDVS*R 1236
+ GW +V L R + SQ K +S T+Y + *+ +LLHG FRL* D+D +
Sbjct: 8 ERGWEGVNVRGLSRSKLSQSKG*VSFTAYQRSRG*HDQFFPILLHGWIFRL*PDKDSTRG 187
Query: 1237 *RKDIFYHSMGYFLLQGDAVRPNQCWCYLSKGNDYSVS*HDSQRS*GICG*HDCEVSR*G 1296
KD F++SMG LL G V +CW + G+ + HD+Q G+ G*HD E+ G
Sbjct: 188 YGKDNFHYSMGNLLL*GYVVWAEECWGNIPAGHGGIILGHDAQGDRGLHG*HDREIKNGG 367
Query: 1297 AAC*IFNKDV*KVKKIQAAIESQQMYIRCQVRKVIRLHCQPKGH*SRS**SPGHPRNASS 1356
F K V* IQ ES+++Y+ +++KV RL+ + + P + +
Sbjct: 368 GTPCQFAKVV*ATT*IQVETESRKVYV*GKIQKVARLY**LERNRGGFEQGESDP*DGQA 547
Query: 1357 TDREASQRFLGTFELHLPIHISYDCNLRADLQAT*KEPACRM 1398
T REAS RFLG ELH IHI +C+LRA +E C+M
Sbjct: 548 TYREASPRFLGEVELHRKIHIIVNCHLRASFHIVVQESVCQM 673
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 73.9 bits (180), Expect = 7e-13
Identities = 53/141 (37%), Positives = 73/141 (51%)
Frame = +2
Query: 1184 DVC*LQRPEQSQPKRQLSITSY*CAC**YCSI*GVLLHGRFFRL*SDQDVS*R*RKDIFY 1243
DVC L R + SQ K S T++ + *+ +L HG F L*SD+D + KD F+
Sbjct: 5 DVCGLSRSKPSQSKG*FSFTAHRHSHG*HGQFCPILFHGWIFGL*SDKDGTRGYGKDNFH 184
Query: 1244 HSMGYFLLQGDAVRPNQCWCYLSKGNDYSVS*HDSQRS*GICG*HDCEVSR*GAAC*IFN 1303
+SMG LL GD V + W L G+ + HD+Q G+CG*+D E+ G F
Sbjct: 185 YSMGNLLL*GDVVWAEEFWGNLPSGHGGIIPGHDAQGDRGLCG*NDREIKNGGGTPSQFA 364
Query: 1304 KDV*KVKKIQAAIESQQMYIR 1324
K V IQ ESQ++ +R
Sbjct: 365 KFVWATT*IQVETESQEVCVR 427
>BQ628592
Length = 423
Score = 41.6 bits (96), Expect(2) = 3e-06
Identities = 28/74 (37%), Positives = 39/74 (51%)
Frame = -3
Query: 1393 EPACRME**MPRSF**HQELPAGTTYPCSTRGRKAFDYVFGSV**IHGMRAWSTR*NWKE 1452
EP +E *+PR +Q P + +T RK F V V ++GMR S R W++
Sbjct: 415 EPGDPVEQ*LPRGLRKNQTEPRESLGAHATCNRKTFSPVHDYVRRVYGMRVGSARRLWEK 236
Query: 1453 RACYLLSKQEVHRL 1466
+LL KQEV+RL
Sbjct: 235 GTSHLLFKQEVYRL 194
Score = 29.6 bits (65), Expect(2) = 3e-06
Identities = 14/48 (29%), Positives = 29/48 (60%)
Frame = -2
Query: 1468 NSVYNA*EDLLCFSLGCQTPASLFGESYNLVDIQNGSDKVYL*ESCCH 1515
+ + N +D+L +G + ++ +SY++ QNGS +++L*E+ H
Sbjct: 191 DELLNVGKDVLYLGMGVTSS*AVHAQSYHVAYFQNGSCEIHL*EAGPH 48
>TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 402
Score = 49.3 bits (116), Expect = 2e-05
Identities = 44/131 (33%), Positives = 67/131 (50%)
Frame = +1
Query: 1219 LLHGRFFRL*SDQDVS*R*RKDIFYHSMGYFLLQGDAVRPNQCWCYLSKGNDYSVS*HDS 1278
L++G F +*S+ D +*R R+D F H MG L D +R + W LS + V *+D+
Sbjct: 4 LIYGWFLGV*SNIDGT*RCREDHFRHPMGDIQL*SDGLRAEKYWGNLSACHGGVVP*YDA 183
Query: 1279 QRS*GICG*HDCEVSR*GAAC*IFNKDV*KVKKIQAAIESQQMYIRCQVRKVIRLHCQPK 1338
+ G+ *HDC+VS * V KV +I + QM+I +V + R++ + +
Sbjct: 184 *GNRGLRR*HDCQVSD*DRTPCQSV*AVWKVAEIPIKTKPNQMHIWGEVGEAARIYRKSE 363
Query: 1339 GH*SRS**SPG 1349
RS S G
Sbjct: 364 RDRDRSRESEG 396
>TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (9%)
Length = 843
Score = 47.8 bits (112), Expect(2) = 2e-05
Identities = 26/49 (53%), Positives = 30/49 (61%)
Frame = +2
Query: 1843 EQVITGWPWSMIVTNTPESATNVKSMLIRSMCLRMLSMLCHPHGRPSCG 1891
EQVITG PW +IV + +ATNVK I SM R+L M C P G CG
Sbjct: 392 EQVITGLPWKVIVVSM*GNATNVKRSQIMSMPHRIL*MSCPPLGLSPCG 538
Score = 20.8 bits (42), Expect(2) = 2e-05
Identities = 17/55 (30%), Positives = 27/55 (48%)
Frame = +3
Query: 1756 *TLVL*HQAVLAEP*VSAGCFQTR*EDPEKIGR*IPVRWRYSVQEEL*HGIVKMC 1810
* LV HQA+ + + A + *ED EK+G +++V E+ H +C
Sbjct: 132 *ALVFRHQAICHKQRIPARDCRQ**EDIEKVGGRFLHERKHTV*EKPRHETSAVC 296
>BI498328
Length = 335
Score = 34.3 bits (77), Expect = 0.58
Identities = 25/64 (39%), Positives = 32/64 (49%)
Frame = +3
Query: 1584 SRSQ*QVGFSL*WCCQCLW*GNWGSHCIPAGASHSFYRQNLVRMYQQYG*V*SMYLWDRG 1643
+R Q+ L W QC + GS CIP + +SF+ +YQQYG V SM RG
Sbjct: 132 ARRHKQMDCLLRWGIQCFGPRSRGSPCIPG*SVYSFHG*TRF*LYQQYGRVRSMR--PRG 305
Query: 1644 SN*H 1647
S H
Sbjct: 306 SGGH 317
>BI969837
Length = 754
Score = 33.1 bits (74), Expect = 1.3
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Frame = -3
Query: 1981 FTFGQILFFIQILFFL--HFGPSIEVPKL-HFCPNFFFYFISSFFNFAV*SSSSYFCRKV 2037
F+F + F + ++FL HF V K HFC +FFF+F+ SF +F+ +F V
Sbjct: 683 FSFFYLSLFFKKVYFLTHHFSFXXYVQKTYHFC*SFFFFFL-SFISFS--DYPCHFIHGV 513
Query: 2038 QSSFF 2042
+FF
Sbjct: 512 HLTFF 498
>CA953394
Length = 420
Score = 33.1 bits (74), Expect = 1.3
Identities = 15/47 (31%), Positives = 25/47 (52%)
Frame = -2
Query: 876 LLMLSLKKPLDLVSGLYLLHQEALRQIGMPLTFLQSCMFLSNVSCCF 922
+L++SL L +S L+ A G+P+ FL S ++SCC+
Sbjct: 380 VLVVSLSSCLSFLSALFFSSFIAASMEGLPMNFLNSSSLFQSISCCW 240
>TC217499 similar to UP|Q6NM71 (Q6NM71) At3g56510, partial (63%)
Length = 1019
Score = 32.7 bits (73), Expect = 1.7
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Frame = -3
Query: 2013 FFFYFISSFFNFAV*SSSSYFCRKVQ-SSFFQVQGRAISIQI 2053
FFF+F S FFNFA SS + + SSF + + +S+Q+
Sbjct: 138 FFFFFFSFFFNFAFSSSGAVSVASISASSFSESEAFIVSLQL 13
>TC213554 weakly similar to UP|Q8LLE6 (Q8LLE6) RNA-binding protein AKIP1,
partial (18%)
Length = 451
Score = 32.7 bits (73), Expect = 1.7
Identities = 20/53 (37%), Positives = 29/53 (53%)
Frame = -1
Query: 1972 VLSVLFLFHFTFGQILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNF 2024
VL ++FL F F +LF + FFL F + + L+F + FF F+S F F
Sbjct: 319 VLVLVFLVFFFFFLVLFVLLFFFFL-FDLFLYLFFLNFLFDLFFVFLSGIFGF 164
>BU927182
Length = 447
Score = 32.3 bits (72), Expect = 2.2
Identities = 25/66 (37%), Positives = 32/66 (47%), Gaps = 1/66 (1%)
Frame = -2
Query: 1969 F*SVLSVLFLFHFTFGQILFFIQILFF-LHFGPSIEVPKLHFCPNFFFYFISSFFNFAV* 2027
F S S+LF F F ILF +LF + F + + F NFF F S FFNF +
Sbjct: 338 FLSFTSILFNFFLLFASILFNFFLLFASILFNFFLPFTSILF--NFFLSFTSIFFNFFLP 165
Query: 2028 SSSSYF 2033
+S F
Sbjct: 164 FTSFLF 147
>TC222660 homologue to UP|Q6XCA4 (Q6XCA4) GPR34-like GPCR (Fragment), partial
(5%)
Length = 751
Score = 32.3 bits (72), Expect = 2.2
Identities = 24/80 (30%), Positives = 37/80 (46%), Gaps = 2/80 (2%)
Frame = -2
Query: 1948 FHIKFWHGEVTLKYLI*TIGSF*SVLSVLFLFHFTFGQILFFIQILFFLHFGPSI-EVP- 2005
F +++W + KY L + + F F L F++I FF S+ P
Sbjct: 621 FQVRYWGQRSSSKYYYNYCFC*RGSLISFWFWSFNFW-FLVFVKIAFFFFSKYSLFSAP* 445
Query: 2006 KLHFCPNFFFYFISSFFNFA 2025
+L C +FFFYF+ FF F+
Sbjct: 444 QLLACSSFFFYFLCQFFIFS 385
>TC219203 similar to UP|Q6JCR5 (Q6JCR5) Cytochrome b (Fragment), partial (9%)
Length = 623
Score = 32.0 bits (71), Expect = 2.9
Identities = 21/65 (32%), Positives = 33/65 (50%)
Frame = +2
Query: 1969 F*SVLSVLFLFHFTFGQILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNFAV*S 2028
F S+L LFL TF +++F + ++FL P V F F F+F+S ++ F
Sbjct: 14 FLSLLVWLFLVSLTF-RVVFLVLSVYFLFSSPFCAVGFFFFLFAFLFFFLSFYWFFLCFL 190
Query: 2029 SSSYF 2033
S +F
Sbjct: 191 LSFFF 205
>TC223864 weakly similar to UP|Q9LIG9 (Q9LIG9) Cytochrome P450-like protein,
partial (5%)
Length = 561
Score = 31.6 bits (70), Expect = 3.7
Identities = 21/52 (40%), Positives = 28/52 (53%)
Frame = -3
Query: 1973 LSVLFLFHFTFGQILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNF 2024
L ++F+F ++F FFI LFFL F P + L NF F F S F+F
Sbjct: 343 LFLIFIFFYSFA--FFFISTLFFL-FSPFM----LSLFINFVFLFFSLLFSF 209
>CF921229
Length = 680
Score = 31.6 bits (70), Expect = 3.7
Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 6/60 (10%)
Frame = +1
Query: 1971 SVLSVLFLFHFTFGQI---LFFIQILFFLHFGPSIEVPKLHFCPNF-FFYFISSF--FNF 2024
++L+ F FHFTF + +F LFF HF + HF +F F+F+ F FNF
Sbjct: 301 NLLTFFFFFHFTFLLLFPHFYFHFFLFFCHFCFLLLFDHFHFFLSFGHFHFLLLFRHFNF 480
>TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%)
Length = 1005
Score = 31.6 bits (70), Expect = 3.7
Identities = 28/76 (36%), Positives = 36/76 (46%), Gaps = 4/76 (5%)
Frame = -3
Query: 1971 SVLSVLFLFHFTFGQILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISS----FFNFAV 2026
S+LS + F F+F F FFL F S+ +P L F FFF F SS F+F +
Sbjct: 268 SILS--YFFFFSFFSSSSFPSSSFFLIFSASV-LPSLSFFSFFFFSFFSSSSSPSFSFPL 98
Query: 2027 *SSSSYFCRKVQSSFF 2042
S+S SFF
Sbjct: 97 TFSTSVLSPLSLFSFF 50
>TC222466
Length = 278
Score = 31.2 bits (69), Expect = 4.9
Identities = 26/76 (34%), Positives = 32/76 (41%), Gaps = 2/76 (2%)
Frame = -1
Query: 1969 F*SVLSVLFLFHFTFG--QILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNFAV 2026
F S LS FL F+F LFF + FF FG FF+FI SFF +
Sbjct: 254 FPSFLSFFFLASFSFFLYSFLFFFPVFFF*FFG--------------FFFFILSFFTLYL 117
Query: 2027 *SSSSYFCRKVQSSFF 2042
+F + SS F
Sbjct: 116 ---PPFFASLLLSSLF 78
>TC227843 weakly similar to GB|AAM70534.1|21700821|AY124825 AT5g62930/MQB2_230
{Arabidopsis thaliana;} , partial (64%)
Length = 884
Score = 30.4 bits (67), Expect = 8.3
Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 2/75 (2%)
Frame = -3
Query: 1988 FFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNFAV*SSSSYFCRKVQ--SSFFQVQ 2045
F + + FLHFGP I++P HF + N V +SYF + S +
Sbjct: 303 FCLPAIRFLHFGP*IDIPHAHFL---------CYLNTGVCILTSYFICSFRYFGSILTIY 151
Query: 2046 GRAISIQIQVSHFHW 2060
G +S ++ + W
Sbjct: 150 GSCVSQASFLAKWRW 106
>TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (3%)
Length = 803
Score = 30.4 bits (67), Expect = 8.3
Identities = 14/32 (43%), Positives = 23/32 (71%)
Frame = +3
Query: 1606 WGSHCIPAGASHSFYRQNLVRMYQQYG*V*SM 1637
WG+ + + ++F+ Q VR++QQYG*V*S+
Sbjct: 99 WGNIGLSRQSMYTFHNQAGVRLHQQYG*V*SV 194
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.364 0.161 0.615
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 101,268,008
Number of Sequences: 63676
Number of extensions: 1576257
Number of successful extensions: 27064
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 11102
Number of HSP's successfully gapped in prelim test: 1183
Number of HSP's that attempted gapping in prelim test: 14576
Number of HSP's gapped (non-prelim): 14985
length of query: 2066
length of database: 12,639,632
effective HSP length: 112
effective length of query: 1954
effective length of database: 5,507,920
effective search space: 10762475680
effective search space used: 10762475680
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 36 (21.5 bits)
S2: 66 (30.0 bits)
Medicago: description of AC148359.6