
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144404.22 + phase: 0
(471 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g67230 unknown protein 34 0.15
At1g21200 unknown protein 33 0.34
At1g22000 hypothetical protein 33 0.44
At5g22310 unknown protein 32 0.99
At1g08000 GATA transcription factor 3, putative 32 0.99
At1g02970 unknown protein 32 0.99
At3g54870 kinesin-like protein 31 1.3
At1g22060 hypothetical protein 31 1.3
At3g19890 hypothetical protein 31 1.7
At1g13220 putative nuclear matrix constituent protein 30 2.2
At5g45510 unknown protein 30 2.9
At4g23800 98b like protein 30 2.9
At4g12380 hypothetical protein 30 2.9
At1g68790 putative nuclear matrix constituent protein 1 (NMCP1) 30 2.9
At1g08070 unknown protein 30 3.8
At5g37350 unknown protein 29 4.9
At4g14330 kinesin like protein 29 4.9
At1g79280 hypothetical protein 29 4.9
At5g24880 glutamic acid-rich protein 29 6.4
At5g23490 putative protein 29 6.4
>At1g67230 unknown protein
Length = 1132
Score = 34.3 bits (77), Expect = 0.15
Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 381 FDTEKECQEKIAELTRESYMWERKY-----RKKE--QEYDTLKNLLDQQIQANRQEKEEN 433
FD + EKI+EL +E + ++ KKE +Y+ L+ ++ + +QE+ +
Sbjct: 46 FDDPRILPEKISELEKELFEYQHSMGLLLIEKKEWSSQYEALQQAFEEVNECLKQERNAH 105
Query: 434 IRLRAALQKKEDFLDKVCPGRKKRRMDL 461
+ A ++K+E+ L K K+ +DL
Sbjct: 106 LIAIADVEKREEGLRKALGIEKQCALDL 133
>At1g21200 unknown protein
Length = 443
Score = 33.1 bits (74), Expect = 0.34
Identities = 24/113 (21%), Positives = 48/113 (42%), Gaps = 29/113 (25%)
Query: 332 DKGQLGKKTGAVHESYTQWVIDRAVQIRMPYKISRFVSAITPAPPLPMTFDTEKECQEKI 391
D Q G ++G QW+ R +Q+ + + + Q ++
Sbjct: 355 DVNQGGAESGRAGSVQKQWMESRTLQLE----------------------EQKLQIQVEL 392
Query: 392 AELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRLRAALQKKE 444
EL ++ + W+R +K++QE L++ N + K EN R+ L+++E
Sbjct: 393 LELEKQRFRWQRFSKKRDQE-------LERMRMENERMKLENDRMGLELKQRE 438
>At1g22000 hypothetical protein
Length = 727
Score = 32.7 bits (73), Expect = 0.44
Identities = 25/66 (37%), Positives = 39/66 (58%), Gaps = 5/66 (7%)
Query: 385 KECQEKIAEL--TRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRLRAALQK 442
KE ++KI EL R+S ++++ R+K YD +K LD + + KEE +L A LQ+
Sbjct: 612 KELEDKILELEADRQSVIYDK--REKTTAYDMMKAELDCSLLSLECCKEEKQKLEAILQQ 669
Query: 443 -KEDFL 447
KE+ L
Sbjct: 670 CKEESL 675
>At5g22310 unknown protein
Length = 481
Score = 31.6 bits (70), Expect = 0.99
Identities = 22/71 (30%), Positives = 39/71 (53%), Gaps = 5/71 (7%)
Query: 382 DTEKECQEKIAELTRESYMWERKYRKKEQEYD-TLKNLLDQQIQANRQEKEENIRLRAAL 440
D E+E + ++ E +E M ERK R++ ++ + L L + + R+ KEE R + A
Sbjct: 236 DEEEEEKRRLIESLQEEAMVERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRA- 294
Query: 441 QKKEDFLDKVC 451
+D L++VC
Sbjct: 295 ---KDVLEEVC 302
>At1g08000 GATA transcription factor 3, putative
Length = 308
Score = 31.6 bits (70), Expect = 0.99
Identities = 53/228 (23%), Positives = 89/228 (38%), Gaps = 38/228 (16%)
Query: 224 TPHFTTNPENLLWSKRLMSLTPAEVVWYDRVYDKGTIIDSCGKFANVPLLGMEGGISYNP 283
TPH ++ P ++ S SLTP V+ + Y + +S + P+ GM P
Sbjct: 102 TPHQSSAPPDIKVSYLFQSLTPVSVL--ENSYGSLSTQNSGSQRLAFPVKGMRSK-RRRP 158
Query: 284 TLARRQFGYPMEKKPLSIYLENVY--YFNTEDSTGMREQVVRAWHTIRRRDKGQLGKKTG 341
T R + +P E + S E+V Y+++E + ++ HT + K G
Sbjct: 159 TTVRLSYLFPFEPRK-STPGESVTEGYYSSEQHAKKKRKIHLITHT--ESSTLESSKSDG 215
Query: 342 AVH-----ESYT--QW--------VIDRAVQIRMPYKISRFVSAITPAPPLPMTFDTEKE 386
V E+ T QW + A +R +K R V PA
Sbjct: 216 IVRICTHCETITTPQWRQGPSGPKTLCNACGVR--FKSGRLVPEYRPASSPTFIPSVHSN 273
Query: 387 CQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENI 434
KI E+ RKK+ E+DT +++ IQ +Q +++ +
Sbjct: 274 SHRKIIEM-----------RKKDDEFDT--SMIRSDIQKVKQGRKKMV 308
>At1g02970 unknown protein
Length = 500
Score = 31.6 bits (70), Expect = 0.99
Identities = 17/63 (26%), Positives = 32/63 (49%), Gaps = 4/63 (6%)
Query: 372 TPAPPLPMTFDT--EKECQEKIAELTRESYMWERKYRKKEQEYDTL--KNLLDQQIQANR 427
+P P P+ +T K C+++ + W K+R EQE D + ++ ++QAN+
Sbjct: 123 SPCPRSPVKLNTVKSKRCRQESFTGNHSNSTWSSKHRVDEQENDDIDTDEVMGDKLQANQ 182
Query: 428 QEK 430
E+
Sbjct: 183 TER 185
>At3g54870 kinesin-like protein
Length = 1070
Score = 31.2 bits (69), Expect = 1.3
Identities = 23/105 (21%), Positives = 51/105 (47%), Gaps = 2/105 (1%)
Query: 353 DRAVQIRMPYKISRFVSAITPAPPLPMTFDTEKECQEKIAELTRESYMWERKYRKKEQEY 412
D+A+Q+ M K ++ A + DT + ++KIAEL + + + E +
Sbjct: 559 DKAIQLEMKLKNTKQQQLENSAYEAKLA-DTSQVYEKKIAELVQRVEDEQARSTNAEHQL 617
Query: 413 DTLKNLLDQQIQANRQEKEENIRLRAAL-QKKEDFLDKVCPGRKK 456
+KN+L +Q ++ ++++ N + + L + + K+ +KK
Sbjct: 618 TEMKNILSKQQKSIHEQEKGNYQYQRELAETTHTYESKIAELQKK 662
>At1g22060 hypothetical protein
Length = 1999
Score = 31.2 bits (69), Expect = 1.3
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
Query: 385 KECQEKIAELT--RESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRLRAALQK 442
KE + KI EL R+S ++++ R+K YD +K LD + + KEE +L A LQ+
Sbjct: 1636 KELEGKILELEADRQSVIYDK--REKTTAYDMMKAELDCSLLSLECCKEEKQKLEAILQQ 1693
Query: 443 KEDFLDKVCPGRKKRR 458
++ K+ + RR
Sbjct: 1694 CKEQSLKMSKELESRR 1709
>At3g19890 hypothetical protein
Length = 410
Score = 30.8 bits (68), Expect = 1.7
Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 12/104 (11%)
Query: 309 FNTEDSTGMREQVVRAWHTIRR-RDKGQLGKKTGAVHESYTQWVIDRAVQIRMPYKISRF 367
FN + G R R +HT +KG L K+ V+ AV++R S
Sbjct: 317 FNLDKKRGKRNNKTRCYHTAYIIGEKGYLKKE-----------VLGEAVEVRKDVYRSAL 365
Query: 368 VSAITPAPPLPMTFDTEKECQEKIAELTRESYMWERKYRKKEQE 411
V + + P L E+E ++K + +RK RK++ +
Sbjct: 366 VCSSSYVPSLEKINQIEEEEEDKCKSIKMVDTKRQRKKRKRKSK 409
>At1g13220 putative nuclear matrix constituent protein
Length = 1128
Score = 30.4 bits (67), Expect = 2.2
Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 8/68 (11%)
Query: 379 MTFDTEKECQEKIAELTRESYM--WERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRL 436
++F E+E E + RE Y+ WE+K + KE+ K L+Q R+EK I
Sbjct: 245 LSFTKERESYEGTFQKQRE-YLNEWEKKLQGKEESITEQKRNLNQ-----REEKVNEIEK 298
Query: 437 RAALQKKE 444
+ L++KE
Sbjct: 299 KLKLKEKE 306
>At5g45510 unknown protein
Length = 1222
Score = 30.0 bits (66), Expect = 2.9
Identities = 29/116 (25%), Positives = 48/116 (41%), Gaps = 3/116 (2%)
Query: 62 HCFTFQDYQILPTLEEYSCWINLPVLDKVPFSGLEETPKHSTIAATLHLETDEVKTHLIT 121
HC +D +L L +NL V+D SGL ++ A ++ +L+T
Sbjct: 727 HCLIIKDCPLLQDLPNIQELVNLEVVDVSGASGLRTCFDNADGAK--KNKSKNKNFYLLT 784
Query: 122 RGKFLGFSTDFLYERTTFFDKMGVAYAFNSILALLVYGLVLFPSLDNFVDIKAIQI 177
+ + L FS + ER F VA +S+ LL+ L + + +QI
Sbjct: 785 KLQHLDFSGSQI-ERLPIFQDSAVAAKLHSLTRLLLRNCSKLRRLPSLKPLSGLQI 839
>At4g23800 98b like protein
Length = 456
Score = 30.0 bits (66), Expect = 2.9
Identities = 16/76 (21%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
Query: 382 DTEKECQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRLRAALQ 441
D +K EK+A+ +E+Y+ + K+ +E + L +++ + + K+E +++ +
Sbjct: 296 DKKKAPYEKVAKKNKETYLQAMEEYKRTKEEEALSQKKEEE-ELLKLHKQEALQMLKKKE 354
Query: 442 KKEDFLDKVCPGRKKR 457
K ++ + K +KK+
Sbjct: 355 KTDNLIKKEKATKKKK 370
>At4g12380 hypothetical protein
Length = 160
Score = 30.0 bits (66), Expect = 2.9
Identities = 15/47 (31%), Positives = 31/47 (65%), Gaps = 1/47 (2%)
Query: 402 ERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRLRAALQKKEDFLD 448
+RK ++ + E + ++N+ D +I ++ KEE RL++AL+ +E+ D
Sbjct: 12 KRKNKQMKTEIE-IRNVPDTEINQEQKNKEEEKRLKSALEGEEEGSD 57
>At1g68790 putative nuclear matrix constituent protein 1 (NMCP1)
Length = 1085
Score = 30.0 bits (66), Expect = 2.9
Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 18/89 (20%)
Query: 384 EKECQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQE----------KEEN 433
E+E E L + + + + R+ E E + ++ LD++++ + E KEE
Sbjct: 343 EREQMEIGKLLDDQKAVLDSRRREFEMELEQMRRSLDEELEGKKAEIEQLQVEISHKEEK 402
Query: 434 IRLR-AALQKKEDFLDKVCPGRKKRRMDL 461
+ R AAL+KKE+ G KK+ DL
Sbjct: 403 LAKREAALEKKEE-------GVKKKEKDL 424
Score = 29.3 bits (64), Expect = 4.9
Identities = 22/92 (23%), Positives = 47/92 (50%), Gaps = 14/92 (15%)
Query: 384 EKEC----QEKIAELTRESYMWERKYRKKEQ----------EYDTLKNLLDQQIQANRQE 429
+KEC +++I E+ E+ E + R++ + E+ L++ L QQI +QE
Sbjct: 455 DKECLRKLKDEIEEIGTETTKQESRIREEHESLRITKEERVEFLRLQSELKQQIDKVKQE 514
Query: 430 KEENIRLRAALQKKEDFLDKVCPGRKKRRMDL 461
+E ++ R L++ ++ +K K+R ++
Sbjct: 515 EELLLKEREELKQDKERFEKEWEALDKKRANI 546
>At1g08070 unknown protein
Length = 741
Score = 29.6 bits (65), Expect = 3.8
Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 38 LGVVKTDID-EGLLNAFVQ--FYDPGYHCFTF--QDYQILPTLEEYSCWINL 84
+G+ DI GLL+A D G H F QDY++ P LE Y C I+L
Sbjct: 464 IGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDL 515
>At5g37350 unknown protein
Length = 531
Score = 29.3 bits (64), Expect = 4.9
Identities = 18/80 (22%), Positives = 37/80 (45%)
Query: 377 LPMTFDTEKECQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRL 436
+P + D + +A++T + Y+ D L + +QQI+ N +E+EE
Sbjct: 412 IPKSLDAVNNPEADVAKITSGQDTGDMLYQTITGLKDALPKVDEQQIEVNAEEEEEEEEE 471
Query: 437 RAALQKKEDFLDKVCPGRKK 456
+ + +E+ ++ P KK
Sbjct: 472 GSGEESEEESEKELGPEDKK 491
>At4g14330 kinesin like protein
Length = 959
Score = 29.3 bits (64), Expect = 4.9
Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 4/50 (8%)
Query: 402 ERKYRKKEQEYDTLKNLLDQQ--IQANRQEKEENIRLRAALQKKEDFLDK 449
+++ +KKE+E L++LL Q+ N +E +E + R L K E LDK
Sbjct: 419 QKQLKKKEEEVAALRSLLTQREACATNEEEIKEKVNERTQLLKSE--LDK 466
>At1g79280 hypothetical protein
Length = 2111
Score = 29.3 bits (64), Expect = 4.9
Identities = 19/79 (24%), Positives = 38/79 (48%), Gaps = 10/79 (12%)
Query: 381 FDTEKECQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEK---------- 430
F+ +K+ EK ++ M +RKY K++ E L +Q++ ++E
Sbjct: 1447 FNKQKQELEKNKKIHYTLNMTKRKYEKEKDELSKQNQSLAKQLEEAKEEAGKRTTTDAVV 1506
Query: 431 EENIRLRAALQKKEDFLDK 449
E++++ R +K+ LDK
Sbjct: 1507 EQSVKEREEKEKRIQILDK 1525
Score = 28.5 bits (62), Expect = 8.4
Identities = 36/145 (24%), Positives = 60/145 (40%), Gaps = 16/145 (11%)
Query: 325 WHTIRRRDKGQLGKKTGAVHESYTQWVIDRAVQIRMPYKISRFVSAITPAPPLPMTFDTE 384
WH R R+ L K G++ E + D A Y F T + + ++
Sbjct: 242 WHKERLRE---LETKIGSLQEDLSSCK-DAATTTEEQYTAELF----TANKLVDLYKESS 293
Query: 385 KECQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIRLRAALQKKE 444
+E K EL E + + E Y K LD+++ + ++EN L+ L+K E
Sbjct: 294 EEWSRKAGELEGVIKALEARLSQVESSY---KERLDKEVSTKQLLEKENGDLKQKLEKCE 350
Query: 445 DFLDKVCPGRKKRRMDLFDGPHSDF 469
++K RK ++L P S+F
Sbjct: 351 AEIEKT---RKTDELNLI--PFSNF 370
>At5g24880 glutamic acid-rich protein
Length = 443
Score = 28.9 bits (63), Expect = 6.4
Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 5/60 (8%)
Query: 376 PLPMTFDTEKECQEKIAELTRESYMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENIR 435
P P+ K+C +AEL + E +K +E N QAN+ E+EE+++
Sbjct: 252 PTPVASPVGKDCNAVVAELEEKLIKNEDDIEEKTEEMKEQDN-----NQANKSEEEEDVK 306
Score = 28.9 bits (63), Expect = 6.4
Identities = 22/82 (26%), Positives = 41/82 (49%), Gaps = 5/82 (6%)
Query: 378 PMTFDTEKECQEKIAELTRES---YMWERKYRKKEQEYDTLKNLLDQQIQANRQEKEENI 434
P DTE + E + E T+E E K R +E+E + K + + Q + E+EE
Sbjct: 315 PEKVDTESKEVESVEETTQEKEEEVKEEGKERVEEEEKE--KEKVKEDDQKEKVEEEEKE 372
Query: 435 RLRAALQKKEDFLDKVCPGRKK 456
+++ +K++ ++ G+KK
Sbjct: 373 KVKGDEEKEKVKEEESAEGKKK 394
>At5g23490 putative protein
Length = 421
Score = 28.9 bits (63), Expect = 6.4
Identities = 22/60 (36%), Positives = 33/60 (54%), Gaps = 4/60 (6%)
Query: 386 ECQEKIAELTRESYMWERKYRKKEQEYDTLK-NLLDQQIQA--NRQEK-EENIRLRAALQ 441
+C K A++ E Y+ E++ +D + +L+D +A RQE EENIRL ALQ
Sbjct: 226 DCSVKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDASSKALSYRQEIIEENIRLTYALQ 285
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.322 0.139 0.424
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,261,080
Number of Sequences: 26719
Number of extensions: 510910
Number of successful extensions: 1656
Number of sequences better than 10.0: 27
Number of HSP's better than 10.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 1633
Number of HSP's gapped (non-prelim): 49
length of query: 471
length of database: 11,318,596
effective HSP length: 103
effective length of query: 368
effective length of database: 8,566,539
effective search space: 3152486352
effective search space used: 3152486352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (28.5 bits)
Medicago: description of AC144404.22