
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147714.4 - phase: 0 /pseudo
(1920 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BI417841 71 1e-12
BP065719 65 1e-10
TC11573 similar to UP|Q9M6N4 (Q9M6N4) Pol protein integrase regi... 62 6e-10
TC16320 weakly similar to UP|Q84TE0 (Q84TE0) At5g51080, partial ... 58 2e-08
AV427422 53 4e-07
BF177840 47 4e-05
TC14274 weakly similar to GB|AAF79559.1|8778551|AC022464 F22G5.3... 36 0.065
BP079571 33 0.32
AV409036 32 0.94
TC18087 similar to UP|Q9LF46 (Q9LF46) 2-hydroxyphytanoyl-CoA lya... 32 0.94
BG662122 32 0.94
TC15802 similar to UP|Q9XIV1 (Q9XIV1) mRNA expressed in cucumber... 31 2.1
TC19101 weakly similar to UP|Q93WC9 (Q93WC9) Adenylate isopenten... 30 3.6
TC13054 similar to UP|NO93_SOYBN (Q02921) Early nodulin 93 (N-93... 30 3.6
BP030435 30 4.7
TC9635 30 4.7
TC15341 similar to UP|Q9M0H8 (Q9M0H8) Predicted proline-rich pro... 29 7.9
TC15253 weakly similar to UP|Q9ZT16 (Q9ZT16) Arabinogalactan-pro... 29 7.9
AV778839 29 7.9
>BI417841
Length = 617
Score = 71.2 bits (173), Expect = 1e-12
Identities = 40/142 (28%), Positives = 68/142 (47%), Gaps = 2/142 (1%)
Frame = +1
Query: 1330 GEGPDPNSKWGLVFDGAA--NAYGKGIGAVIVSPQGHHIPFTAQILFECTNNMVEYEACI 1387
G + N L FDG++ N G GAV+ + G + + + + TNN EY I
Sbjct: 109 GRQSNANRSCTLEFDGSSKGNPGSAGAGAVLRAEDGSKV-YLREGVGNQTNNQAEYRGLI 285
Query: 1388 FGIEEAIGMRIKHLDIYGDSALIINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHI 1447
G++ A +H+++ GDS L+ Q++G W+ + + + A+ L + F +++H+
Sbjct: 286 LGLKHAHEQGYQHINVKGDSQLVCKQVEGSWKARNPNIASLCNEAKELKSKFQSFDINHV 465
Query: 1448 PRDENQMADALATLSSMFRVNH 1469
PR N AD A L H
Sbjct: 466 PRQYNSEADVQANLGVNLPAGH 531
>BP065719
Length = 567
Score = 65.1 bits (157), Expect = 1e-10
Identities = 42/175 (24%), Positives = 86/175 (49%), Gaps = 2/175 (1%)
Frame = +3
Query: 125 QHGSIPVTKTMEEMMEELAKELRHEIKANRGNADSFKTQDLCLVSKVDVPKKFKIPDFDR 184
+H + + + ++E+L + H N F + + V + ++P+ +K+P F +
Sbjct: 24 RHQNAGGQQNVAAVVEQLLNQ--HGFNVGFANRPHFVSAFIEEVLESELPRGWKVPKFTK 197
Query: 185 YNGLTCPQN--HIIKYVRKMGNYKDNDSLMIHCFQDSLMEDAAEWYTSLSKNDIHTFDEL 242
++G + HI +Y + G+ N++L + F SL ++A W+T+L+ +HT+ +L
Sbjct: 198 FSGDSGESTVEHIARYQIEAGDLAINENLKMKYFPSSLTKNAFTWFTTLAPRSVHTWAQL 377
Query: 243 AAAFKSHYGFNTRLKPNREFLRSLSQKKEESFREYAQRWRGAAARITPALDEEEM 297
F + F K + + L S+ +K ES +Y R+R +R + E E+
Sbjct: 378 ERIFHEQF-FRGECKVSXKDLASVKRKPAESIDDYLNRFRMLKSRCFTHVSEHEL 539
>TC11573 similar to UP|Q9M6N4 (Q9M6N4) Pol protein integrase region
(Fragment), partial (10%)
Length = 572
Score = 62.4 bits (150), Expect = 6e-10
Identities = 36/96 (37%), Positives = 49/96 (50%), Gaps = 1/96 (1%)
Frame = +2
Query: 1703 DNGTNLNNNVVQALCEEFKIEHHNSSPYRPQINGVVEAANKNI-KRIVQKMVTTYRDWHE 1761
+NG + Q C+ I+ SS PQ NG EAANK I K I +++ W +
Sbjct: 8 ENGIQFTSKQTQDFCDGMGIQMRFSSVKHPQTNGQTEAANKVILKGIKRRLYEAEGRWID 187
Query: 1762 MLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEV 1797
LP L Y T +SS TPF L YG + +LP+E+
Sbjct: 188 ELPIVLWSYNTMPQSSIKETPF*LTYGADTMLPVEI 295
>TC16320 weakly similar to UP|Q84TE0 (Q84TE0) At5g51080, partial (11%)
Length = 632
Score = 57.8 bits (138), Expect = 2e-08
Identities = 28/91 (30%), Positives = 44/91 (47%)
Frame = +2
Query: 1383 YEACIFGIEEAIGMRIKHLDIYGDSALIINQIKGEWETHHAKLIPYRDYARRLLTYFTKV 1442
Y I G++ AI KH+ + GDS L+ NQ++G W+ + + A+ L F
Sbjct: 2 YRGLILGLKHAIKEGYKHIQVKGDSMLVCNQVQGLWKIKNQNIASLCSEAKELKNKFLSF 181
Query: 1443 ELHHIPRDENQMADALATLSSMFRVNHWNDV 1473
+++HIPR+ N AD A R +V
Sbjct: 182 KINHIPREYNSEADVQANFGISLRAGQVEEV 274
>AV427422
Length = 417
Score = 53.1 bits (126), Expect = 4e-07
Identities = 32/133 (24%), Positives = 63/133 (47%), Gaps = 2/133 (1%)
Frame = +1
Query: 1652 SNGHRFILVAIDYFTKWVEAASYTN-VTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNN 1710
S G+ +LV +D +K+ + T +V+A ++ +GVP I++D +
Sbjct: 19 SKGYEAVLVVVDRLSKFSHFVPLKHPYTAKVIADIFVREVVRLHGVPLSIVSDRDPLFMS 198
Query: 1711 NVVQALCEEFKIEHHNSSPYRPQINGVVEAANKNIKRIVQKMVTTY-RDWHEMLPYALHG 1769
N + L + + S+ Y P+ +G E N+ ++ ++ + + W +P+A +
Sbjct: 199 NFWKELFKMQGTKLKMSTAYHPESDGQTEVVNRCLETYLRCFIADQPKSWAHWVPWAEYW 378
Query: 1770 YRTTVRSSTGATP 1782
Y T+ STG TP
Sbjct: 379 YNTSYHVSTGQTP 417
>BF177840
Length = 410
Score = 46.6 bits (109), Expect = 4e-05
Identities = 25/101 (24%), Positives = 49/101 (47%), Gaps = 1/101 (0%)
Frame = +2
Query: 1698 SKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQINGVVEAANKNIKRIVQKMVT-TY 1756
+ I++D T ++ + L + + S+ PQ +G E NK + +++ ++
Sbjct: 11 TSIVSDRDTKFISHFWRTLWGKVGTKLLYSTTCHPQTDGQTEVVNKTLSTLLRSVLERNL 190
Query: 1757 RDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEV 1797
+ W LP+ Y V S+T +PF +VYG + PL++
Sbjct: 191 KMWETWLPHIEFAYNRVVHSTTKHSPFEIVYGYNPLTPLDL 313
>TC14274 weakly similar to GB|AAF79559.1|8778551|AC022464 F22G5.35
{Arabidopsis thaliana;} , partial (16%)
Length = 1208
Score = 35.8 bits (81), Expect = 0.065
Identities = 28/85 (32%), Positives = 43/85 (49%), Gaps = 4/85 (4%)
Frame = +3
Query: 37 PIATAIPV-TTSMLPTASADARFAMPAGFPYGLPPFFTPSTAAGTSGTANNVLIPATNAV 95
P A++ P TTSM+P ASA++ + P P ST++ S A N I T A
Sbjct: 330 PSASSSPTSTTSMIPPASANSPSSAPPAAP------TARSTSSSASLVAPNPSITPTQAP 491
Query: 96 SINATLPQTT---AAVTEPLVHAIP 117
+ TL T +++TE ++H +P
Sbjct: 492 TPTLTLTLTVTLRSSITEDILHHLP 566
>BP079571
Length = 414
Score = 33.5 bits (75), Expect = 0.32
Identities = 29/86 (33%), Positives = 38/86 (43%), Gaps = 13/86 (15%)
Frame = +3
Query: 389 MPPSYPYQKQQLPVQ-------QQQQNQ--QARPTFPPIPMLYAELLPTLLHRGHCTTRQ 439
+PPSYP QK LP+Q Q+ Q Q P FP + +PT TT Q
Sbjct: 141 IPPSYPQQKATLPLQIFTCTFTQRLQGIP*QKCPGFPQWKQCVSPSIPT------TTT*Q 302
Query: 440 GK----PPPDPLPPRFRSDLKCDFHQ 461
+ P P PPR S+L+ H+
Sbjct: 303 QRHHHNP*PSDQPPRIESNLRRRLHR 380
>AV409036
Length = 429
Score = 32.0 bits (71), Expect = 0.94
Identities = 19/64 (29%), Positives = 28/64 (43%)
Frame = +2
Query: 70 PFFTPSTAAGTSGTANNVLIPATNAVSINATLPQTTAAVTEPLVHAIPQGVNINTQHGSI 129
P TPST + + T L PAT+A + T P+ ++ T P A N ++
Sbjct: 185 PAVTPSTPSSATTTTTTSLSPATSAKTAAVTGPKAASSATSPSAVAAGSPPNAPVSPKTL 364
Query: 130 PVTK 133
P K
Sbjct: 365 PRRK 376
>TC18087 similar to UP|Q9LF46 (Q9LF46) 2-hydroxyphytanoyl-CoA lyase-like
protein, partial (30%)
Length = 571
Score = 32.0 bits (71), Expect = 0.94
Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Frame = +2
Query: 27 AAQEQSATAIPIATAIPVTTSMLPTASADARFAMPAGFPYGL---PPFFTPSTAAGTSGT 83
+A S T+ P+AT P+T ++ P ++ + F +PA L P+ TP + +
Sbjct: 167 SASSPSTTSSPLATPPPLTATLPPAPASTSLFPVPAASTASLASPTPWLTPGLLS*SPAP 346
Query: 84 ANNVLIPATNAVS 96
A +++ A + S
Sbjct: 347 AIRLMLAAVTSRS 385
>BG662122
Length = 386
Score = 32.0 bits (71), Expect = 0.94
Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Frame = +2
Query: 434 HCTTRQGKPPPDPLPPRFRSDLK--CDFHQGALGHDVEGCYALKYIVKKLIDQGKL 487
H + G+ P PPR D C++ + + HD++ C+ LK ++KLI G+L
Sbjct: 146 HMVGKSGQS*P---PPRRGIDTTK*CEYRRSVV-HDIDDCFTLKREIEKLIKMGRL 301
>TC15802 similar to UP|Q9XIV1 (Q9XIV1) mRNA expressed in cucumber hypocotyls
(Arabinogalactan protein), partial (26%)
Length = 795
Score = 30.8 bits (68), Expect = 2.1
Identities = 29/85 (34%), Positives = 34/85 (39%)
Frame = +2
Query: 27 AAQEQSATAIPIATAIPVTTSMLPTASADARFAMPAGFPYGLPPFFTPSTAAGTSGTANN 86
AA +ATA AT PVT P PA P PP P+ +S A +
Sbjct: 218 AASPSTATAPAPATTTPVTPVTSPP---------PAAVPVASPP---PAAVPVSSPPAKS 361
Query: 87 VLIPATNAVSINATLPQTTAAVTEP 111
PA AV + A P TT V P
Sbjct: 362 PPAPAPTAVPVAA--PVTTPEVPAP 430
>TC19101 weakly similar to UP|Q93WC9 (Q93WC9) Adenylate
isopentenyltransferase (Cytokinin synthase)
(AT3g63110/T20O10_210) , partial (37%)
Length = 880
Score = 30.0 bits (66), Expect = 3.6
Identities = 18/54 (33%), Positives = 23/54 (42%), Gaps = 9/54 (16%)
Frame = +2
Query: 401 PVQQQQQNQQARPTFPPIP---------MLYAELLPTLLHRGHCTTRQGKPPPD 445
P + QQN Q R T+ P P L+ P L HR H + +PP D
Sbjct: 575 PRHRHQQNLQGRTTWNPPPPPRNSKP*HRLHRRRFP*LFHRRH*RNHKPRPPSD 736
>TC13054 similar to UP|NO93_SOYBN (Q02921) Early nodulin 93 (N-93), partial
(93%)
Length = 558
Score = 30.0 bits (66), Expect = 3.6
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Frame = +1
Query: 5 RTELATLREELAKANDVMTALLAAQEQSATAIPIATAIPVTTS--MLPTASAD 55
RT L +L + LA A + A ++A IATAIP TS MLP A A+
Sbjct: 85 RTSLPSLDQRLAIAKRCSHEGVMAGARAAVVASIATAIPTLTSVRMLPWARAN 243
>BP030435
Length = 533
Score = 29.6 bits (65), Expect = 4.7
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 5/59 (8%)
Frame = +1
Query: 435 CTTRQGKPP--PDPLPPRFRSDLK---CDFHQGALGHDVEGCYALKYIVKKLIDQGKLT 488
C + KP +PL R + C +HQ A+GH E C L+ V KLI GK T
Sbjct: 22 CLDKSSKPEILREPLATRISGEPGGE*CVYHQ-AMGHITEECRTLQREVGKLIATGKPT 195
>TC9635
Length = 585
Score = 29.6 bits (65), Expect = 4.7
Identities = 14/29 (48%), Positives = 18/29 (61%)
Frame = +2
Query: 442 PPPDPLPPRFRSDLKCDFHQGALGHDVEG 470
PPP PLP S LK +++ G GHD +G
Sbjct: 164 PPPPPLP--HMSHLKQNYYSGEPGHDGQG 244
>TC15341 similar to UP|Q9M0H8 (Q9M0H8) Predicted proline-rich protein,
partial (33%)
Length = 1002
Score = 28.9 bits (63), Expect = 7.9
Identities = 27/85 (31%), Positives = 36/85 (41%), Gaps = 8/85 (9%)
Frame = +1
Query: 376 QSMATVAPINATQM------PP--SYPYQKQQLPVQQQQQNQQARPTFPPIPMLYAELLP 427
Q M+ VAP T PP YP +Q QQQQQ QQ P L ++ P
Sbjct: 769 QDMSRVAPQPCTNXSGXSPPPPVQQYPQYQQPQQQQQQQQQQQQWP-----QQLPQQVQP 933
Query: 428 TLLHRGHCTTRQGKPPPDPLPPRFR 452
T + Q +PP P+ P ++
Sbjct: 934 T--QPPSMQSPQIRPPSSPVYPPYQ 1002
>TC15253 weakly similar to UP|Q9ZT16 (Q9ZT16) Arabinogalactan-protein
(ATAGP4) (AT5G10430/F12B17_220), partial (42%)
Length = 575
Score = 28.9 bits (63), Expect = 7.9
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 5/75 (6%)
Frame = +1
Query: 391 PSYPYQKQQLPVQQQQQNQQARPTFPPIPMLYAELLPTLLHRGHCTTRQGKPPP-----D 445
P + Q+Q LP QQ +QQ P PP+ LLP PPP
Sbjct: 187 PHHHRQQQHLPHLQQLHHQQL-PLLPPLLHQLPPLLP--------------PPPLQLPLQ 321
Query: 446 PLPPRFRSDLKCDFH 460
PLPP L H
Sbjct: 322 PLPPPVHPPLPLPLH 366
>AV778839
Length = 538
Score = 28.9 bits (63), Expect = 7.9
Identities = 19/57 (33%), Positives = 28/57 (48%), Gaps = 2/57 (3%)
Frame = -3
Query: 1470 WNDVPIIKVQRLERPSHVFTIGDVIDQAGENMVDNKPWYYDIKQCLLS--REYPPGA 1524
WN PI + R S + D+ID +N++DN +KQ +S +E P GA
Sbjct: 524 WNPCPIGMLPRAVGSSGCLPVLDIIDYVKQNLIDN------VKQNPVSERKEMPHGA 372
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.321 0.136 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 31,973,232
Number of Sequences: 28460
Number of extensions: 463379
Number of successful extensions: 2734
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 2609
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2706
length of query: 1920
length of database: 4,897,600
effective HSP length: 104
effective length of query: 1816
effective length of database: 1,937,760
effective search space: 3518972160
effective search space used: 3518972160
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)
Medicago: description of AC147714.4