
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139747.6 + phase: 0 /pseudo
(1770 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 108 3e-23
AW184779 42 9e-07
TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 42 4e-06
TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 43 0.001
BQ628592 30 0.034
TC231625 similar to UP|Q84XK6 (Q84XK6) Peroxisomal targeting sig... 36 0.17
TC226052 similar to UP|Q9SHY3 (Q9SHY3) F1E22.9 (At1g65720/F1E22_... 36 0.17
AI959820 similar to GP|14335118|gb| At1g30200/F12P21_1 {Arabidop... 36 0.17
TC207354 weakly similar to UP|WDR5_HUMAN (P61964) WD-repeat prot... 35 0.38
TC216633 weakly similar to UP|Q6ZQ33 (Q6ZQ33) MKIAA0857 protein ... 35 0.38
TC216000 similar to UP|O22253 (O22253) Photolyase/blue-light rec... 34 0.50
AW397672 34 0.65
AW101289 34 0.65
TC215681 similar to UP|Q9LPR4 (Q9LPR4) F15H18.3, partial (79%) 30 1.00
TC209595 homologue to GB|AAL62011.1|18252261|AY072620 AT5g08330/... 33 1.1
TC210407 similar to UP|O49656 (O49656) Predicted protein, partia... 33 1.1
TC230247 weakly similar to GB|AAP88326.1|32815835|BT009692 At1g6... 33 1.4
TC220939 similar to UP|Q75W15 (Q75W15) 3-deoxy-D-arabino heptulo... 33 1.4
CO981213 33 1.4
BI788366 similar to GP|16604384|gb| AT4g26620/T15N24_70 {Arabido... 33 1.4
>TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (9%)
Length = 843
Score = 108 bits (269), Expect = 3e-23
Identities = 75/230 (32%), Positives = 124/230 (53%)
Frame = +3
Query: 1364 AMVS*YPSFPSNSKVPTWGIQQGQENIEKIVKPFLP*RRRFV*KEL*WGPT*MCG*A*SR 1423
A+V + + ++P +Q E+IEK+ FL R+ V*++ + +CG +
Sbjct: 135 ALVFRHQAICHKQRIPARDCRQ**EDIEKVGGRFLHERKHTV*EKPRHETSAVCGCQGGK 314
Query: 1424 KIDARDS*RLFRNSLMWARHGEEDIESWVLLDNNAR*LLQSCQEMPQMSNLC*QDSYTTI 1483
D * L N+ A +G+ED +S +LL + +*LL C+EMPQMS++ T
Sbjct: 315 SHDRGSP*GLVWNARQRACYGQEDPKSRLLLAYHGK*LLCPCEEMPQMSSVRR*CQCPTA 494
Query: 1484 YAQCHLFPVALLYVGH*HDWSD*TKSFQWASLHISGYRLFHQVG*SSILRQCDQAGSGQI 1543
++CH+ P+A L+VG+ * + +W+SLH RLFHQVG S L + + GQ+
Sbjct: 495 SSECHVLPLAFLHVGNRCHRGH*AQGLEWSSLHPRSDRLFHQVGRGSFLYRRHEGCGGQV 674
Query: 1544 YQEQHHQPLRCSQQNHHRQWHKSE*QHDERVVR*LQDSTSQFFSLQTSDE 1593
+ E+ H P+ +++++H Q H+ +*Q D +R + + SQ L DE
Sbjct: 675 H*ERDHLPIWFAKEDYHGQRHQPK*QDDGGNLRGV*NPASQSHXLPAKDE 824
>AW184779
Length = 432
Score = 41.6 bits (96), Expect(2) = 9e-07
Identities = 21/41 (51%), Positives = 25/41 (60%)
Frame = +2
Query: 1496 YVGH*HDWSD*TKSFQWASLHISGYRLFHQVG*SSILRQCD 1536
YVGH D S * + F+W SLH S L HQ+G SS + CD
Sbjct: 290 YVGHRRDRSH*AQGFKWTSLHFSRN*LLHQMGRSSFVC*CD 412
Score = 31.2 bits (69), Expect(2) = 9e-07
Identities = 18/42 (42%), Positives = 22/42 (51%)
Frame = +3
Query: 1428 RDS*RLFRNSLMWARHGEEDIESWVLLDNNAR*LLQSCQEMP 1469
R +* + N+ HG ED ES VLL + LL C EMP
Sbjct: 87 RGA*GILWNTCQHTCHGPEDSESGVLLAHYGERLLHPCVEMP 212
>TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 669
Score = 42.4 bits (98), Expect(2) = 4e-06
Identities = 30/92 (32%), Positives = 41/92 (43%)
Frame = +3
Query: 1608 GLA*NVTLRFVWIPYLSANLDRGNPFLFGIWFGGCTTCRG*DSFFKSLDGS*IV*S*MVP 1667
GLA + +R +P SAN++ GN L GIW GGC T G K L I +
Sbjct: 57 GLARDAPIRVTRLPDFSANVNWGNAVLIGIWDGGCVTV*GRSPVIKDLGRIRIKGIRVGS 236
Query: 1668 EQVRPVEFD*GKTYGCFVSWTVISVKDETSIR 1699
+ + *G SW ++ K+E IR
Sbjct: 237 NAL*SAQPH*G*ALNGHESWALVPAKNEECIR 332
Score = 28.1 bits (61), Expect(2) = 4e-06
Identities = 23/65 (35%), Positives = 31/65 (47%)
Frame = +2
Query: 1706 ANSRKEILCSNVSNPFNQILGANGRQTMKVPTW*RELSLVVL*FLQIWMERSYLVL*IRM 1765
A+S +E LC ++ + NG +T K *R L L WM +SYL *I M
Sbjct: 350 ASSMRETLC*RKCPMLSRTIEGNGPRTTKGLLL*RGLFPEEPWCLPTWMVKSYLHP*ILM 529
Query: 1766 QSRNI 1770
S +I
Sbjct: 530 SSSDI 544
>TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (3%)
Length = 803
Score = 42.7 bits (99), Expect = 0.001
Identities = 57/205 (27%), Positives = 86/205 (41%), Gaps = 8/205 (3%)
Frame = +1
Query: 1258 GSHRFEDQENCHLWRFRSCD*PDQRRMGNSSSWIDSLQRLCKAFADFLQQSRVTSCAPR* 1317
GS+ + Q +W + D P +R M N D+L L + FL + S R
Sbjct: 211 GSN*LQCQATQGVWGLSTGDSPAERGMRN*RPQPDTLPSLYQGIGWFL**DLLPSRCLRG 390
Query: 1318 ESNGRCFSYSILNDQCEWSQYCASNQCP--------ISRPTCLCVCS*SN***QAMVS*Y 1369
+SNGRC C +S + +N +S T + A+V *Y
Sbjct: 391 KSNGRC--------ACHFSVHVPANTAWGPTVH*V*VSWQTRTLLSGGRGTGR*ALVF*Y 546
Query: 1370 PSFPSNSKVPTWGIQQGQENIEKIVKPFLP*RRRFV*KEL*WGPT*MCG*A*SRKIDARD 1429
+ +VPT G +Q Q+ E+I L R+ V ++ * G + SRK
Sbjct: 547 QAIR*KQRVPTGGFRQRQKEKEEIGSRLLHERKHTVQEKP*HGSATLRER*GSRKHAGGG 726
Query: 1430 S*RLFRNSLMWARHGEEDIESWVLL 1454
* LF + WARHG +D++ +LL
Sbjct: 727 P*GLFWYAQ*WARHG*KDLKGGLLL 801
>BQ628592
Length = 423
Score = 29.6 bits (65), Expect(2) = 0.034
Identities = 22/67 (32%), Positives = 34/67 (49%)
Frame = -3
Query: 1013 VE**LSEGF*SNQRISVRTSNSCSSS*RKTFDHVFNSTGRFHGLCFGSTR*NRKERARYL 1072
VE *L G NQ + + ++ RKTF V + R +G+ GS R ++ +L
Sbjct: 400 VEQ*LPRGLRKNQTEPRESLGAHATCNRKTFSPVHDYVRRVYGMRVGSARRLWEKGTSHL 221
Query: 1073 LFE*EVH 1079
LF+ EV+
Sbjct: 220 LFKQEVY 200
Score = 27.3 bits (59), Expect(2) = 0.034
Identities = 12/38 (31%), Positives = 26/38 (67%)
Frame = -2
Query: 1090 ENLLCFSLGCQASPSLYD*SYNLVDIQDGSYQVHI*EA 1127
+++L +G +S +++ SY++ Q+GS ++H+*EA
Sbjct: 170 KDVLYLGMGVTSS*AVHAQSYHVAYFQNGSCEIHL*EA 57
>TC231625 similar to UP|Q84XK6 (Q84XK6) Peroxisomal targeting signal type 2
receptor, partial (33%)
Length = 441
Score = 35.8 bits (81), Expect = 0.17
Identities = 31/87 (35%), Positives = 37/87 (41%), Gaps = 10/87 (11%)
Frame = +3
Query: 9 PPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWS--IAPFFTCTESGPNNTTSSHSR 66
PP T + TA + SS S +S S + WS P + T GPN TT S S
Sbjct: 195 PPRTSASWATAASTSSTSPRSLPSPSLS-------WSPTTPPTASTTWPGPNPTTPSSSP 353
Query: 67 PSPTL--------VPSGPSLYLPSRGT 85
PSPT P P+ PSR T
Sbjct: 354 PSPTAPLSSTTWPSPQPPTPSAPSRNT 434
>TC226052 similar to UP|Q9SHY3 (Q9SHY3) F1E22.9 (At1g65720/F1E22_13), partial
(39%)
Length = 941
Score = 35.8 bits (81), Expect = 0.17
Identities = 27/86 (31%), Positives = 35/86 (40%)
Frame = +1
Query: 3 PSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNNTTS 62
P TI PP+ +T++ +PS SS SA + + P CT SGP+
Sbjct: 391 PPRTISPPSATAPRTSSASPSPSSSVSAAAPSPPPP-------------CTSSGPS---- 519
Query: 63 SHSRPSPTLVPSGPSLYLPSRGTRPR 88
SP P PS P TR R
Sbjct: 520 -----SPRATPPPPSTTSPPTKTRSR 582
>AI959820 similar to GP|14335118|gb| At1g30200/F12P21_1 {Arabidopsis
thaliana}, partial (19%)
Length = 342
Score = 35.8 bits (81), Expect = 0.17
Identities = 28/85 (32%), Positives = 35/85 (40%)
Frame = +2
Query: 3 PSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNNTTS 62
P+TT LPP PT A + +S +S SAES P + + S PN
Sbjct: 107 PTTTPLPPPPPPTSPAAPSGTSSASSSAES*NPSKP------------SVSSSAPNAPPP 250
Query: 63 SHSRPSPTLVPSGPSLYLPSRGTRP 87
SRP P P PS P+ P
Sbjct: 251 PPSRPLPPPPPPPPSPSEPTTTPTP 325
>TC207354 weakly similar to UP|WDR5_HUMAN (P61964) WD-repeat protein 5,
partial (57%)
Length = 1124
Score = 34.7 bits (78), Expect = 0.38
Identities = 27/85 (31%), Positives = 38/85 (43%)
Frame = +2
Query: 3 PSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNNTTS 62
P+T+ PPTT P + + TP S ++ S S AT P F+ + S PN TS
Sbjct: 299 PTTSAPPPTTAP--SASGTPPSAAAASRSSAATTTP-----------FSASTSTPNRATS 439
Query: 63 SHSRPSPTLVPSGPSLYLPSRGTRP 87
P P+ PS + P + P
Sbjct: 440 C---PVPSTRPSRFGM*RPGNASTP 505
>TC216633 weakly similar to UP|Q6ZQ33 (Q6ZQ33) MKIAA0857 protein (Fragment),
partial (3%)
Length = 1119
Score = 34.7 bits (78), Expect = 0.38
Identities = 29/87 (33%), Positives = 38/87 (43%), Gaps = 1/87 (1%)
Frame = +3
Query: 2 IPSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNNTT 61
+P T +PP T + TTP +S SA S PL + PF T T + + T
Sbjct: 258 LPCPTTVPPPRRKTLSIITTPPLPTSSSASSSTL--PLRSIPPPPTPFPTPTSTSTSIPT 431
Query: 62 SSHS-RPSPTLVPSGPSLYLPSRGTRP 87
SS + SP P+G S R T P
Sbjct: 432 SSAAPNTSPPSSPTGGSATSTRRSTHP 512
>TC216000 similar to UP|O22253 (O22253) Photolyase/blue-light receptor
(Photolyase/blue light photoreceptor PHR2), partial
(84%)
Length = 1641
Score = 34.3 bits (77), Expect = 0.50
Identities = 33/104 (31%), Positives = 43/104 (40%), Gaps = 4/104 (3%)
Frame = +3
Query: 1 TIPSTT-ILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNN 59
T P+T PP+ P + + T S+ S+ SA T P +P T S P
Sbjct: 219 TAPATLPTPPPSAAPPSSGSATTSASSTMSASPPPTTTP--------SPSSPSTASTPPT 374
Query: 60 TTSSH---SRPSPTLVPSGPSLYLPSRGTRPRCGALFCS*ERGS 100
T S H +RP+PT PS S P+ R A S GS
Sbjct: 375 TASRHPASTRPAPTAPPSS-STPSPTSAAASRRAAPISSCASGS 503
Score = 30.4 bits (67), Expect = 7.2
Identities = 30/91 (32%), Positives = 34/91 (36%), Gaps = 7/91 (7%)
Frame = +3
Query: 3 PSTTILPPTT-----IPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGP 57
PST PPTT T+ T P S S+ S S A AP +C P
Sbjct: 348 PSTASTPPTTASRHPASTRPAPTAPPSSSTPSPTSAAA-------SRRAAPISSCASGSP 506
Query: 58 NNTTSSHSRPS-PT-LVPSGPSLYLPSRGTR 86
S RPS PT P+ SL R R
Sbjct: 507 RRCWWSSPRPSVPTPCTPTARSLTTRRRRRR 599
>AW397672
Length = 418
Score = 33.9 bits (76), Expect = 0.65
Identities = 22/70 (31%), Positives = 34/70 (48%), Gaps = 2/70 (2%)
Frame = +3
Query: 3 PSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFT--CTESGPNNT 60
PS TI P+ +T++ +PS SS +W+ AP T CT +GP++T
Sbjct: 138 PSCTISSPSATAPRTSSASPSPCSS---------------VWAAAPSPTPPCTSTGPSST 272
Query: 61 TSSHSRPSPT 70
++ PS T
Sbjct: 273 RTTPPAPSTT 302
>AW101289
Length = 412
Score = 33.9 bits (76), Expect = 0.65
Identities = 31/85 (36%), Positives = 40/85 (46%), Gaps = 6/85 (7%)
Frame = +3
Query: 9 PPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESG----PNNTTSSH 64
PP T P+ T ATT S SS A SK++ W+ F + T P+ TT++
Sbjct: 30 PPWTAPSTTPATT-SPTSSPPASSKSS-------TWTPTSFSSTTSQNSPPPPSETTTTP 185
Query: 65 SRPSP-TLVP-SGPSLYLPSRGTRP 87
S P P T P S P+ PS T P
Sbjct: 186 SSPPPNTATPTSAPTSPHPSGPTPP 260
>TC215681 similar to UP|Q9LPR4 (Q9LPR4) F15H18.3, partial (79%)
Length = 2427
Score = 30.0 bits (66), Expect(2) = 1.00
Identities = 14/19 (73%), Positives = 15/19 (78%)
Frame = +1
Query: 9 PPTTIPTKTTATTPSSGSS 27
PP T PTKTTATT SS S+
Sbjct: 76 PPLTAPTKTTATTHSSFSA 132
Score = 21.6 bits (44), Expect(2) = 1.00
Identities = 13/44 (29%), Positives = 20/44 (44%)
Frame = +3
Query: 50 FTCTESGPNNTTSSHSRPSPTLVPSGPSLYLPSRGTRPRCGALF 93
F+C++S P PSP+ P Y+P+R P +F
Sbjct: 180 FSCSQSEP-------PPPSPSASPRRRPPYIPNRIPDPSYVRIF 290
>TC209595 homologue to GB|AAL62011.1|18252261|AY072620 AT5g08330/F8L15_60
{Arabidopsis thaliana;} , partial (56%)
Length = 869
Score = 33.1 bits (74), Expect = 1.1
Identities = 32/107 (29%), Positives = 42/107 (38%), Gaps = 24/107 (22%)
Frame = +2
Query: 9 PPTTIPTKTTATT-----PSSGSSQSAESKATI*PLANDLWSIAPF-------------- 49
PP + ++ ++ T PSSGSS ++ P L +P
Sbjct: 161 PPASSSSRASSATSPTARPSSGSSARRNPPSSPLPAPAPLRRASPLSPTTSPSSLPLPPS 340
Query: 50 -----FTCTESGPNNTTSSHSRPSPTLVPSGPSLYLPSRGTRPRCGA 91
F T + P TT S S P P PSGP YLP+ T R GA
Sbjct: 341 SSANAFALTTTPPPKTTPSRSSPPPRRPPSGP--YLPA-PTSARSGA 472
>TC210407 similar to UP|O49656 (O49656) Predicted protein, partial (33%)
Length = 1010
Score = 33.1 bits (74), Expect = 1.1
Identities = 30/91 (32%), Positives = 41/91 (44%), Gaps = 10/91 (10%)
Frame = +3
Query: 3 PSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNN--- 59
PS + PP + T +TATT S SS S S + P ++ S +P T S +
Sbjct: 240 PSGSTTPPPSPITSSTATTFCSSSSSSPSSPS---PSSSSNSSASPSSLPTRSNQKSACP 410
Query: 60 -------TTSSHSRPSPTLVPSGPSLYLPSR 83
T +S + S + PS SL LPSR
Sbjct: 411 WPKPSSATKTSCACSSSSSAPSNSSLTLPSR 503
>TC230247 weakly similar to GB|AAP88326.1|32815835|BT009692 At1g61870
{Arabidopsis thaliana;} , partial (56%)
Length = 1291
Score = 32.7 bits (73), Expect = 1.4
Identities = 27/74 (36%), Positives = 35/74 (46%), Gaps = 1/74 (1%)
Frame = +1
Query: 2 IPSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIA-PFFTCTESGPNNT 60
+PS + PPTT P A+ PSS S+ A + AT S A P + T S P+
Sbjct: 229 LPSPSSPPPTTSP----ASAPSSTISKPAPTSATKNSSPTPSSSTARPICSITPSAPSPK 396
Query: 61 TSSHSRPSPTLVPS 74
TS PS +PS
Sbjct: 397 TSPPLAPSKP*IPS 438
>TC220939 similar to UP|Q75W15 (Q75W15) 3-deoxy-D-arabino
heptulosonate-7-phosphate synthase, partial (13%)
Length = 442
Score = 32.7 bits (73), Expect = 1.4
Identities = 30/93 (32%), Positives = 42/93 (44%), Gaps = 6/93 (6%)
Frame = +3
Query: 2 IPSTTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSI-APFFTCTESGPNNT 60
+PS+ PP+ T ++ T+PS + + P N W+ +P C S PN
Sbjct: 168 LPSSPSTPPSPPKTPSSPTSPSPKPNNLHPRPRPVQP--NGPWTAGSPRRPC--SCPNTP 335
Query: 61 TSSHSRPS-----PTLVPSGPSLYLPSRGTRPR 88
T SRPS P+L S P+ SR T PR
Sbjct: 336 TRRISRPSSAPSTPSLPSSSPARPGHSRSTSPR 434
>CO981213
Length = 710
Score = 32.7 bits (73), Expect = 1.4
Identities = 29/86 (33%), Positives = 45/86 (51%), Gaps = 3/86 (3%)
Frame = +1
Query: 4 STTILPPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNN-TTS 62
STT+LPP + PT +++TT ++ SS S ++ A+ S + + T S P+ T+S
Sbjct: 322 STTLLPPPSAPT-SSSTTATAASSPSTPPQSP----ASTSSSTSKSTSLTSSTPSA*TSS 486
Query: 63 SHSRPSPTLVP--SGPSLYLPSRGTR 86
+PS +L P S P PS R
Sbjct: 487 PKPKPSSSLEPFNSPPQFAPPSPNWR 564
>BI788366 similar to GP|16604384|gb| AT4g26620/T15N24_70 {Arabidopsis
thaliana}, partial (13%)
Length = 421
Score = 32.7 bits (73), Expect = 1.4
Identities = 25/81 (30%), Positives = 32/81 (38%)
Frame = +1
Query: 9 PPTTIPTKTTATTPSSGSSQSAESKATI*PLANDLWSIAPFFTCTESGPNNTTSSHSRPS 68
PP P KTT +T S G + ++ + D S A TC P S H P
Sbjct: 70 PPPPSPPKTTLSTASLGRRCTRKTSPALSTPTTDTCSSATRATC----PGRHASRHPTPI 237
Query: 69 PTLVPSGPSLYLPSRGTRPRC 89
P+ S P PS+ R C
Sbjct: 238 PSPSASPP----PSKPARTTC 288
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.359 0.158 0.619
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,596,434
Number of Sequences: 63676
Number of extensions: 1614312
Number of successful extensions: 21299
Number of sequences better than 10.0: 70
Number of HSP's better than 10.0 without gapping: 11538
Number of HSP's successfully gapped in prelim test: 848
Number of HSP's that attempted gapping in prelim test: 8487
Number of HSP's gapped (non-prelim): 14368
length of query: 1770
length of database: 12,639,632
effective HSP length: 111
effective length of query: 1659
effective length of database: 5,571,596
effective search space: 9243277764
effective search space used: 9243277764
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.8 bits)
S2: 66 (30.0 bits)
Medicago: description of AC139747.6