
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC138130.21 - phase: 0 /pseudo
(247 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g16310 unknown protein 374 e-104
At1g04150 putative phosphoribosylanthranilate transferase 36 0.017
At3g21810 unknown protein 33 0.18
At5g14540 unknown protein 32 0.31
At4g16790 glycoprotein homolog 32 0.41
At3g43640 putative protein 32 0.41
At1g13030 unknown protein 32 0.41
At4g00890 31 0.54
At3g07540 putative protein 31 0.54
At4g12780 auxilin-like protein 30 1.6
At4g23810 unknown protein 29 2.0
At4g17620 hypothetical protein 29 2.0
At5g38500 putative protein 29 2.7
At4g14900 hydroxyproline-rich glycoprotein homolog (Z97337.18) 29 2.7
At1g79480 hypothetical protein 29 2.7
At5g49080 putative protein 28 3.5
At5g13760 unknown protein 28 3.5
At2g34130 En/Spm-like transposon protein 28 3.5
At1g19100 hypothetical protein 28 4.5
At1g73100 putative protein 28 5.9
>At3g16310 unknown protein
Length = 329
Score = 374 bits (959), Expect = e-104
Identities = 181/248 (72%), Positives = 204/248 (81%), Gaps = 1/248 (0%)
Query: 1 MSTTAQRTPRSGRQSLFFQDLASPVNARRGKFSSPGQSGASSSLWRENFGNSDLPPPPFF 60
MS A RTP+SGRQSL FQDLASPV+ARRGKFSSPGQ+ A S+LWRENFG SDLPPPP +
Sbjct: 1 MSAAAHRTPKSGRQSLLFQDLASPVSARRGKFSSPGQAAAVSALWRENFGGSDLPPPPMY 60
Query: 61 TLEDRSDFSPESGILDYQMSPETKSDHKTPIQTPNREFSTPAKGKSEASTSYALRGVQQN 120
TL+DRSDFSPESGI DY SP+ KSD +TP Q+ + TP KGK EAS S++L QQ+
Sbjct: 61 TLDDRSDFSPESGIADYSASPDAKSDRRTPFQSSGKNIVTPGKGKLEASPSFSLLNAQQS 120
Query: 121 QQGSLSLNWWS-SPAKSGGEQEEKGRGSPVEGVVQPGALITLPPPPEVARPEVQRNSLPA 179
QQ S S +WWS S A S EQ++KG+GSPVEGVVQPGAL+TLPPP EVARPEVQR +P
Sbjct: 121 QQVSGSPSWWSQSKAGSSTEQDDKGKGSPVEGVVQPGALVTLPPPREVARPEVQRQIIPT 180
Query: 180 GNLNEEEWVTVYGFSPNDTNLVLREFEKCGEILKHVPGPRDANWMHILYQNRSAAQNALN 239
GNL+EEEWVTVYGFSP DTNLVLREFEKCG +LKHVPGPR+ANWMHILYQNRS A ALN
Sbjct: 181 GNLDEEEWVTVYGFSPGDTNLVLREFEKCGMVLKHVPGPRNANWMHILYQNRSDAHKALN 240
Query: 240 KNGIQING 247
K G+ ING
Sbjct: 241 KAGMMING 248
>At1g04150 putative phosphoribosylanthranilate transferase
Length = 1012
Score = 36.2 bits (82), Expect = 0.017
Identities = 29/110 (26%), Positives = 52/110 (46%), Gaps = 5/110 (4%)
Query: 60 FTLEDRSDFSPESGILDYQMSPETKSDHKTPIQTPNRE-FSTPAKGKSEASTSYALRGVQ 118
+TLE RS FS G + + T +++ ++ NR S +K S+S A++ Q
Sbjct: 115 YTLEKRSLFSSVRGEISVKHYMTTTAENGENVRRVNRSGGSKKSKKVQNVSSSMAIQQQQ 174
Query: 119 QNQQGSLSLNWWSSPAKSGGEQEEKGRGS----PVEGVVQPGALITLPPP 164
Q QQ +SL+ + + +Q +G+ P + ++P + LP P
Sbjct: 175 QQQQQQISLHNHNRGNQQQSQQNGQGQRMLPFYPHQSEIKPLVITALPSP 224
>At3g21810 unknown protein
Length = 388
Score = 32.7 bits (73), Expect = 0.18
Identities = 37/141 (26%), Positives = 49/141 (34%), Gaps = 22/141 (15%)
Query: 35 PGQSGASSSLWRENFGNSDLPPPPFFTLEDRSDFSPESGILDYQMSPETKSDHKTPIQTP 94
P + G E N PP + L +S ESG + + + S + +
Sbjct: 250 PVEDGLIGRGEEEKVENEKKRPPCWNMLSSKSYSEEESGAWNDEDTINRSSSKEDNWKRR 309
Query: 95 NREFSTPAKGKSEASTSYALRGVQQNQQGSLSLNWWSSPAKSGGEQEEKGRGSPVEGVVQ 154
T A K STS A R + A+S E E GSP
Sbjct: 310 RFSIGTSATDKVILSTSMAARE-------------FDDVAESEEENPEAANGSP------ 350
Query: 155 PGALITLPPPPEVARPEVQRN 175
LI+LPPPP VQR+
Sbjct: 351 ---LISLPPPPPFRDAHVQRD 368
>At5g14540 unknown protein
Length = 530
Score = 32.0 bits (71), Expect = 0.31
Identities = 58/217 (26%), Positives = 83/217 (37%), Gaps = 42/217 (19%)
Query: 20 DLASPVN---ARRGKFSSPGQSGASSSLWRENFGNSD-LPPPPFFTLEDRSDFSPESGIL 75
DLA VN AR S P ASS+ ++G+ D L P F +DR+ SPES I+
Sbjct: 56 DLAGSVNSTAARVWDASDPKPVSASSA---RSYGSMDSLEPSKLFAEKDRN--SPESAII 110
Query: 76 ---DYQMSP---------ETKSDHKTPIQTPNREFST------PAKGKSEASTSYALR-- 115
D M E S T ++T R+ + G S T LR
Sbjct: 111 SAIDRTMKAHADKLLHVMEGVSARLTQLETRTRDLENLVDDVKVSVGNSHGKTDGKLRQL 170
Query: 116 ---------GVQ--QNQQGSLSLNWWSSPAK-SGGEQEEKGRGSPVEGVVQPGALITLPP 163
GVQ +++Q + S + S Q+ + + VE QP A + PP
Sbjct: 171 ENIMLEVQNGVQLLKDKQEIVEAQLQLSKLQLSKVNQQPETHSTHVEPTAQPPASLPQPP 230
Query: 164 PPEVARPEVQRNSLPAGNLNEEEWVTVYGFSPNDTNL 200
A P + + LP + + +G SP L
Sbjct: 231 ASAAAPPSLTQQGLPPQQFIQPP-ASQHGLSPPSLQL 266
>At4g16790 glycoprotein homolog
Length = 473
Score = 31.6 bits (70), Expect = 0.41
Identities = 33/128 (25%), Positives = 45/128 (34%), Gaps = 23/128 (17%)
Query: 32 FSSPGQSGASSSLWRENFGNSDLPPPP------FFTLEDRSDFSPESGILDYQMSPETKS 85
FSSP +S +L E + PPPP F+ R D GI Y++ S
Sbjct: 262 FSSPRKSNPIPNLASEFHPSPPPPPPPPPPLPAFYNSSSRKD---HPGI--YRVERRESS 316
Query: 86 DHKT------------PIQTPNREFSTPAKGKSEASTSYALRGVQQNQQGSLSLNWWSSP 133
HKT P P E+ K S Q+ ++ + WWS P
Sbjct: 317 VHKTKFAGGEFHPPPPPPPPPPVEYYKSPPTKFRLSNERRKSSEQKMKRNAPKKVWWSDP 376
Query: 134 AKSGGEQE 141
EQ+
Sbjct: 377 IVESKEQD 384
>At3g43640 putative protein
Length = 428
Score = 31.6 bits (70), Expect = 0.41
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 22/142 (15%)
Query: 38 SGASSSLWRENFGNSDLPPPPFFTLEDRSDFSP--ESGILDYQMSPETKSDHKTPIQTPN 95
+G SS+ + SD PPPP + F P E GI MS +TKS P Q
Sbjct: 46 TGVSSAGQPSSATPSDPPPPPVAPM-----FIPKTEPGIC---MSKKTKSKVSAPTQRKR 97
Query: 96 REFSTPAKGKSEASTSYALRGVQQNQQGSLSLNWWSSPAKSGGEQEEKGRGSPVEGVVQP 155
+ + AK +S+A ALR + + + A + G+ E+ R S +Q
Sbjct: 98 QCTAASAKSRSKAPAP-ALRRLSSRKS--------NRSASAAGQTEDLVRDS---SDLQD 145
Query: 156 GALITLPPPPEVARPEVQRNSL 177
G + + PP ++R R L
Sbjct: 146 GDVTEIAPPLFLSRYRENRRLL 167
>At1g13030 unknown protein
Length = 608
Score = 31.6 bits (70), Expect = 0.41
Identities = 30/116 (25%), Positives = 48/116 (40%), Gaps = 5/116 (4%)
Query: 42 SSLWRENFGNSDLPPPPFFTLEDRSDFSPESGILDYQMSPETKSDHKTPIQTPNREFSTP 101
+SL++E+ G+ ++ + S +S + PE K P + N+E TP
Sbjct: 485 TSLYKED-GSLEIEFSALLDVRSVKTSSSDSAEVAKSALPEPDQSAKKPKLSANKELQTP 543
Query: 102 AKGKSEASTSYALRGVQQNQQGSLSL---NWWSSPAKSGGEQEEKG-RGSPVEGVV 153
AK E S L ++ +LS W + SGG K RGS + V+
Sbjct: 544 AKENGEVSPWEELSEALSAKKAALSQANNGWNKKGSSSGGSWSYKALRGSAMGPVM 599
>At4g00890
Length = 431
Score = 31.2 bits (69), Expect = 0.54
Identities = 44/199 (22%), Positives = 68/199 (34%), Gaps = 30/199 (15%)
Query: 10 RSGRQSLFFQDLASPVNARRGKFSSPGQSGASSSLWREN-----FGNSDLPPP---PFFT 61
R R+ +LA V R+ S P + +L + F P P P
Sbjct: 5 RQSRRRFRLGNLARLVQLRQSVSSQPPRRTVQETLSQSPPQSPLFHPQSPPEPKSLPLSP 64
Query: 62 LEDRSDFSPESGILDYQMSPETKSDHKTPIQTPNREFSTPAKGKSEASTSYALRGVQQNQ 121
L +S PES + MS K+ IQ+P + + + + S LR +
Sbjct: 65 LSPKSKEEPESQTMTPLMSKHVKTHRNLLIQSPPKSPPESTESQQQFLASVPLRPLTTEP 124
Query: 122 QGSLSLNWWSSPAKSGGEQEEKGRGSPVEGVVQ--------------PGALITLPPPPEV 167
+ LS S E+ + P+E +V+ + +PPP E
Sbjct: 125 KTPLS----PSSTLKATEESQSQPSPPLESIVETWFRPSPPTSTETGDETQLPIPPPQEA 180
Query: 168 ARPEVQRNSLPAGNLNEEE 186
P S P+ LN E
Sbjct: 181 KTPP----SSPSMMLNATE 195
>At3g07540 putative protein
Length = 841
Score = 31.2 bits (69), Expect = 0.54
Identities = 40/161 (24%), Positives = 61/161 (37%), Gaps = 23/161 (14%)
Query: 22 ASPVNARRGKFSSPGQSGASSSLWRENFGNSDLPPPPFFTLEDRSDFSPESGILDYQMSP 81
+SP R GQS SLW +N G F + SPE G++ +
Sbjct: 290 SSPERTVRNNKRYGGQSLRMFSLWNQNLG---------FPRISSASTSPERGMIRTPDAY 340
Query: 82 ETKSDHKTPIQTPNREF-----STPAK----GKSEASTSYALRGVQQNQQGSLSLNWWSS 132
S + + TP+R F S+P + ++ S + + ++++ S
Sbjct: 341 ARSSMYSSVSTTPDRFFRKVLDSSPPRWNDFSRNVKSLFLSSTSASPARDFCINISESSR 400
Query: 133 PAKSGGEQEEKGRGSPVEGVVQPGALITLPPP--PEVARPE 171
KS E+ E E A +TLPPP P A PE
Sbjct: 401 SLKSSWEKPELDTTQQRESA---AAAVTLPPPQRPPPAMPE 438
>At4g12780 auxilin-like protein
Length = 904
Score = 29.6 bits (65), Expect = 1.6
Identities = 36/168 (21%), Positives = 55/168 (32%), Gaps = 29/168 (17%)
Query: 32 FSSPGQSGASSSLWRENFGNSDLPPPPFFTLEDRSDFSPESGILDYQM---SPETKSDHK 88
F PG+ G L+++ F PPPP + SP + DY P++KS
Sbjct: 57 FDDPGRDG-DDLLFKDVFSG---PPPPKYGSSSGDSRSPSAPAFDYDAMFKEPKSKSASS 112
Query: 89 TPIQTPN-----------REFSTPAKGKSEASTSYALRGVQQNQQGSLSLNWWSSP---- 133
P+ E P+ A + + N SSP
Sbjct: 113 MPVYDKPVYDDEDVFESIPELKIPSTSSQSARFENVFSSISSSPTKHRKQN--SSPFDDL 170
Query: 134 -----AKSGGEQEEKGRGSPVEGVVQPGALITLPPPPEVARPEVQRNS 176
K G + + + +GS + + PG T PP + E S
Sbjct: 171 MGNNLGKKGADSDREEKGSSIFDDLIPGFGRTSSPPSKRTTSETTNQS 218
>At4g23810 unknown protein
Length = 324
Score = 29.3 bits (64), Expect = 2.0
Identities = 14/42 (33%), Positives = 23/42 (54%)
Query: 93 TPNREFSTPAKGKSEASTSYALRGVQQNQQGSLSLNWWSSPA 134
+P+ FS+PA +E + + V ++ L LNW SSP+
Sbjct: 37 SPSSSFSSPATAVAETNEILVKQIVSSYERSLLLLNWSSSPS 78
>At4g17620 hypothetical protein
Length = 1225
Score = 29.3 bits (64), Expect = 2.0
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 8/70 (11%)
Query: 34 SPGQSGASSSLWRENFGNSDLPPPPFFTLEDRSDFSPESGILDYQMSPETKSDHKTPIQT 93
S SG+ SS +E G+ + RS+ + ESG+ Y+ E + D K +
Sbjct: 757 SASDSGSGSSGGKEEHGDDKVE-------SYRSNDNGESGVYPYEEEEEEEEDEKDLFGS 809
Query: 94 PNREFS-TPA 102
N E++ TPA
Sbjct: 810 DNEEYTKTPA 819
>At5g38500 putative protein
Length = 411
Score = 28.9 bits (63), Expect = 2.7
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 75 LDYQMSPETKSDHKTPIQTPNREFSTPAKGKSEASTSYALRGVQQNQQGSL 125
LDY K++ K + PN + S+P+ +E TS R V+Q + G +
Sbjct: 182 LDYNTVESEKTETKV-LPNPNYQSSSPSSCLTENDTSRKRRAVEQRKSGKV 231
>At4g14900 hydroxyproline-rich glycoprotein homolog (Z97337.18)
Length = 532
Score = 28.9 bits (63), Expect = 2.7
Identities = 21/68 (30%), Positives = 31/68 (44%), Gaps = 4/68 (5%)
Query: 24 PVNAR-RGKFSSPGQSGASSSLWRENFGNSDLPPPPFFTLEDRSDFSPESGILDYQMSPE 82
P N R R +S P + + N S PPPP + +S SP+ G+ Y SP
Sbjct: 414 PANKRTRASYSGPMPPAKAGRI--TNAYVSSFPPPPPTFIRSQSH-SPQYGVPAYTTSPP 470
Query: 83 TKSDHKTP 90
T +++P
Sbjct: 471 TIYSNRSP 478
>At1g79480 hypothetical protein
Length = 356
Score = 28.9 bits (63), Expect = 2.7
Identities = 27/123 (21%), Positives = 42/123 (33%), Gaps = 6/123 (4%)
Query: 50 GNSDLPPPPFFTLEDRSDFSPESGILDYQMSPETKSDHKTPIQTPNREFSTPAKGKSEAS 109
G LP PP + S+ +P +P P PN + P +S ++
Sbjct: 82 GPITLPNPPDSSSNPNSNPNPPES----SSNPNPPDSSSNPNSNPNPPVTVPNPPESSSN 137
Query: 110 TSY--ALRGVQQNQQGSLSLNWWSSPAKSGGEQEEKGRGSPVEGVVQPGALITLPPPPEV 167
+ + N S + + P E +P E P IT+P PPE
Sbjct: 138 PNPPDSSSNPNSNPNPPESSSNPNPPVTVPNPPESSSNPNPPESSSNPNPPITIPYPPES 197
Query: 168 ARP 170
+ P
Sbjct: 198 SSP 200
>At5g49080 putative protein
Length = 609
Score = 28.5 bits (62), Expect = 3.5
Identities = 16/53 (30%), Positives = 25/53 (46%), Gaps = 8/53 (15%)
Query: 55 PPPPFFTLEDRSDFSPESGILDYQ------MSPETKSDHKTPIQTPNREFSTP 101
PPPP++T + D+ +Y SP K D+K+P P +S+P
Sbjct: 82 PPPPYYTPSPKVDYKSPPPPYEYSSPPPPYYSPSPKIDYKSP--PPPYVYSSP 132
>At5g13760 unknown protein
Length = 569
Score = 28.5 bits (62), Expect = 3.5
Identities = 15/53 (28%), Positives = 24/53 (44%), Gaps = 17/53 (32%)
Query: 135 KSGGEQEEKGRG-----------------SPVEGVVQPGALITLPPPPEVARP 170
+ GGE+E K + +P+ +V G+ T PPPP +A+P
Sbjct: 28 EGGGEEESKDKDEISHHRFLASLNRLNPTNPLRIIVNGGSRFTTPPPPNLAQP 80
>At2g34130 En/Spm-like transposon protein
Length = 459
Score = 28.5 bits (62), Expect = 3.5
Identities = 20/66 (30%), Positives = 27/66 (40%), Gaps = 9/66 (13%)
Query: 118 QQNQQGSLSLNWWSSPAKSGGEQEEKGRGSPVEGVVQPGALITLPPPPEVARPEVQRNSL 177
+Q Q+ L +N SP G +G GS VQP PP +P V + +
Sbjct: 11 RQQQRDPLHVN---SPLSGGNISGVQGSGSGSGSTVQPS------PPTPTTQPSVNHSDV 61
Query: 178 PAGNLN 183
A LN
Sbjct: 62 QADRLN 67
>At1g19100 hypothetical protein
Length = 542
Score = 28.1 bits (61), Expect = 4.5
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 3/83 (3%)
Query: 98 FSTPAKGKSEASTSYALRGVQQNQQGSLSLNWWSSPAKSGGEQEEKGRGSPVEGVVQPGA 157
F P K + + L+ ++ N+ +++ +WS G Q K + VQP
Sbjct: 339 FVEPTHNKQDFEKTVLLQKLE-NRLKEMTVEYWSCHCVLIGYQVNKKPRLQIPQKVQPAG 397
Query: 158 LITLPPPP--EVARPEVQRNSLP 178
L PPP + P+ SLP
Sbjct: 398 RQALSPPPGFQAVFPQGNTTSLP 420
>At1g73100 putative protein
Length = 669
Score = 27.7 bits (60), Expect = 5.9
Identities = 20/65 (30%), Positives = 28/65 (42%), Gaps = 4/65 (6%)
Query: 57 PPFF-TLEDRSDFSPESGILDYQMSPETKSDHKTPIQT---PNREFSTPAKGKSEASTSY 112
PPF + + S F P +P+ TPI + P R + TP K +S+S
Sbjct: 47 PPFGPSSSEYSSFFPFGAQQPTHDTPDLNQTQNTPIPSFVPPLRSYRTPTKTNGPSSSSG 106
Query: 113 ALRGV 117
RGV
Sbjct: 107 TKRGV 111
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.310 0.129 0.385
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,699,881
Number of Sequences: 26719
Number of extensions: 333413
Number of successful extensions: 789
Number of sequences better than 10.0: 29
Number of HSP's better than 10.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 24
Number of HSP's that attempted gapping in prelim test: 773
Number of HSP's gapped (non-prelim): 36
length of query: 247
length of database: 11,318,596
effective HSP length: 97
effective length of query: 150
effective length of database: 8,726,853
effective search space: 1309027950
effective search space used: 1309027950
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 59 (27.3 bits)
Medicago: description of AC138130.21