
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135101.6 + phase: 0 /pseudo
(1281 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g15650 putative retroelement pol polyprotein 67 5e-11
At3g60170 putative protein 62 2e-09
At3g61330 copia-type polyprotein 59 1e-08
At3g59720 copia-type reverse transcriptase-like protein 59 1e-08
At1g58140 hypothetical protein 59 1e-08
At1g48710 hypothetical protein 59 1e-08
At1g36110 hypothetical protein 51 5e-06
At3g25450 hypothetical protein 47 5e-05
At1g32590 hypothetical protein, 5' partial 46 2e-04
At3g20980 unknown protein 45 3e-04
At3g20975 unknown protein, 3' partial 43 0.001
At4g04560 putative transposon protein 42 0.003
At4g04420 putative transposon protein 40 0.011
At3g21010 unknown protein 40 0.011
At2g05930 copia-like retroelement pol polyprotein 40 0.011
At3g21000 unknown protein 39 0.015
At3g21020 unknown protein 38 0.043
At3g20990 unknown protein 38 0.043
At2g05390 putative retroelement pol polyprotein 38 0.043
At3g21040 hypothetical protein 37 0.073
>At2g15650 putative retroelement pol polyprotein
Length = 1347
Score = 67.4 bits (163), Expect = 5e-11
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 3/109 (2%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLP--NNPTMAQLK-YHKERKTKKA 63
P+FDG Y+ W++K+ +W +EE V P+ P A+ K +E T
Sbjct: 10 PIFDGEKYDFWSIKMATIFRTRKLWSVVEEGVPVEPVQAEETPETARAKTLREEAVTNDT 69
Query: 64 KAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
A L V + +F+RI + K WD LK+EY G ++R +++ +L
Sbjct: 70 MALQILQTAVTDQIFSRIAAASSSKEAWDVLKDEYQGSPQVRLVKLQSL 118
>At3g60170 putative protein
Length = 1339
Score = 62.0 bits (149), Expect = 2e-09
Identities = 33/110 (30%), Positives = 57/110 (51%), Gaps = 2/110 (1%)
Query: 1 FSKVAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTM-AQLKYHKERK 59
F + A P FDG Y+ W++ +E +L + ++W +EE + P AQ +E K
Sbjct: 7 FVQPAIPRFDGY-YDFWSMTMENFLRSRELWRLVEEGIPAIVVGTTPVSEAQRSAVEEAK 65
Query: 60 TKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQM 109
K K K+ LF + + I+ T KAIW+ +K++Y G +++ Q+
Sbjct: 66 LKDLKVKNFLFQAIDREILETILDKSTSKAIWESMKKKYQGSTKVKRAQL 115
>At3g61330 copia-type polyprotein
Length = 1352
Score = 59.3 bits (142), Expect = 1e-08
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>At3g59720 copia-type reverse transcriptase-like protein
Length = 1272
Score = 59.3 bits (142), Expect = 1e-08
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>At1g58140 hypothetical protein
Length = 1320
Score = 59.3 bits (142), Expect = 1e-08
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>At1g48710 hypothetical protein
Length = 1352
Score = 59.3 bits (142), Expect = 1e-08
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>At1g36110 hypothetical protein
Length = 745
Score = 50.8 bits (120), Expect = 5e-06
Identities = 26/108 (24%), Positives = 50/108 (46%), Gaps = 18/108 (16%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ NY +WA++++ +WE IE P + + K A+
Sbjct: 20 PMLTSTNYTVWAIRMQIARSVNKMWETIE-----PGIDDG-------------YKNTMAR 61
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNLMS 114
+F + E++ ++ TL T K +WD +K Y G +R++ ++ LM+
Sbjct: 62 GLIFQSIPESLTLQVGTLATAKLVWDSIKTRYVGADRVKEARLQTLMA 109
>At3g25450 hypothetical protein
Length = 1343
Score = 47.4 bits (111), Expect = 5e-05
Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 18/108 (16%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ + NY +W +++EA L +W IE P A + K A+
Sbjct: 23 PMLNSVNYTVWTMRMEAVLRVHKLWGTIE-----------PGSAD-------EEKNDMAR 64
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNLMS 114
+ LF + E++ ++ KT A+W+ +K G ER++ ++ LM+
Sbjct: 65 ALLFQSIPESLILQVGKQKTSSAVWEAIKSRNLGAERVKEARLQTLMA 112
>At1g32590 hypothetical protein, 5' partial
Length = 1263
Score = 45.8 bits (107), Expect = 2e-04
Identities = 22/65 (33%), Positives = 38/65 (57%)
Query: 48 TMAQLKYHKERKTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSI 107
T AQ E+ K K K+ LFA + +T+ I+ +T K +W+ +K +Y G++R++S
Sbjct: 11 TGAQRTELAEKTVKDHKVKNYLFASIDKTILKTILQKETSKDLWESMKRKYQGNDRVQSA 70
Query: 108 QMLNL 112
Q+ L
Sbjct: 71 QLQRL 75
>At3g20980 unknown protein
Length = 405
Score = 45.1 bits (105), Expect = 3e-04
Identities = 27/96 (28%), Positives = 49/96 (50%), Gaps = 9/96 (9%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPP-LPNNPTMA------QLKYHKERKTKK 62
D NYE+WA ++ L +W+ ++ Y +PP L P +A L +++ K
Sbjct: 15 DELNYEIWAPIMKTSLAEKGLWDIVK--YGIPPDLSKIPELATKIRTQDLSIYRDSAVKD 72
Query: 63 AKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEY 98
KA L + + ++VF + + + K +WD +KE+Y
Sbjct: 73 TKALHLLQSFLPDSVFRKTLETTSAKHLWDLVKEDY 108
>At3g20975 unknown protein, 3' partial
Length = 728
Score = 43.1 bits (100), Expect = 0.001
Identities = 26/94 (27%), Positives = 45/94 (47%), Gaps = 9/94 (9%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPP-------LPNNPTMAQLKYHKERKTKK 62
D NYE+WA ++ L +W ++ Y +PP L L +++ K
Sbjct: 375 DDLNYEIWAHIMKTSLVEKGLWYVVK--YGIPPDLSKIRELATKIQAQDLSIYRDYVAKD 432
Query: 63 AKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKE 96
+KA L + + ++VF + + + T K +WD LKE
Sbjct: 433 SKALQILQSSLPDSVFRKTLEVTTAKDLWDLLKE 466
Score = 37.0 bits (84), Expect = 0.073
Identities = 24/94 (25%), Positives = 45/94 (47%), Gaps = 9/94 (9%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNN-PTMA------QLKYHKERKTKK 62
D N E+W+ ++ L +W+ + VPP P+ P +A +L +E +
Sbjct: 623 DDLNMEVWSAMLKTILTEEGLWDVVANG--VPPDPSKIPKLAATIQAEELCKWRELARED 680
Query: 63 AKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKE 96
KA L + + ++VF + ++ + K +WD L E
Sbjct: 681 MKALQILQSSLTDSVFRKTLSATSAKDLWDMLNE 714
>At4g04560 putative transposon protein
Length = 590
Score = 41.6 bits (96), Expect = 0.003
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 2 SKVAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPL--PNNPTMAQLKYHKERK 59
S + P+F+ NY W +K++ + +WE ++E PP ++P Q K E
Sbjct: 3 SHIVTPIFNKENYGFWRIKMKTIFQTKKLWEIVDEGVPKPPAEGDHSPEAVQQKTRCEAA 62
Query: 60 T-KKAKAKSCLFAGVLETVFTRI 81
+ K A L V +++F RI
Sbjct: 63 SLKDLTALQILQTAVSDSIFPRI 85
Score = 38.9 bits (89), Expect = 0.019
Identities = 27/125 (21%), Positives = 61/125 (48%), Gaps = 12/125 (9%)
Query: 1 FSKVAPPL-FDGNNYELWAVKIEAYLEALDV--WEAIEEDYEVPPLPNNP-----TMAQL 52
F + PL D +Y W V I+ ++++D+ W A+E+ + +PP + + ++
Sbjct: 339 FIAIPKPLKLDAEHYGYWKVLIKRSIQSIDMDAWFAVEDGW-MPPTTKDAKRDIVSKSRT 397
Query: 53 KYHKERKTK---KAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQM 109
++ + KT ++A S +F +L FT++ + K +W+ L+ + ++ ++
Sbjct: 398 EWIADEKTAANHNSQALSVIFGSLLRNKFTQVQGCLSAKEVWEILQVSFECTNNVKRTRL 457
Query: 110 LNLMS 114
L S
Sbjct: 458 DMLAS 462
>At4g04420 putative transposon protein
Length = 1008
Score = 39.7 bits (91), Expect = 0.011
Identities = 28/107 (26%), Positives = 46/107 (42%), Gaps = 9/107 (8%)
Query: 8 LFDGNNYELWAVKIEAYLEAL--DVWEAIEEDYEVPPLPNNP------TMAQLKYHKERK 59
+ + NY W VK+ A + L + W A ++ P + T Q +E K
Sbjct: 15 MLEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGEDVLKTKDQWNDAEEAK 74
Query: 60 TK-KAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIR 105
K ++A S +F V + F RI ++ K WD L + Y G ++
Sbjct: 75 AKANSRALSLIFNFVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVK 121
>At3g21010 unknown protein
Length = 438
Score = 39.7 bits (91), Expect = 0.011
Identities = 25/106 (23%), Positives = 50/106 (46%), Gaps = 9/106 (8%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNN-PTMA------QLKYHKERKTKK 62
D NYE+WA + L +W+ +E VPP P+ P +A + ++ K
Sbjct: 13 DDLNYEIWAPIAKTTLTEKGLWDVVENG--VPPDPSKVPELAATIQAEEFSKWRDLAEKD 70
Query: 63 AKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQ 108
KA L + + ++ F + ++ + K +WD L++ ++R ++
Sbjct: 71 MKALQILQSSLTDSAFRKTLSASSAKDLWDSLEKGNNEQAKLRRLE 116
>At2g05930 copia-like retroelement pol polyprotein
Length = 916
Score = 39.7 bits (91), Expect = 0.011
Identities = 25/107 (23%), Positives = 45/107 (41%), Gaps = 9/107 (8%)
Query: 8 LFDGNNYELWAVKIEAYLEAL--DVWEAIEEDYEVPPLPNNPTMAQLKYHKE-------R 58
+ + NY W VK+ A + L + W A ++ P + LK + +
Sbjct: 27 ILEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGEDVLKTEDQWNDAEEAK 86
Query: 59 KTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIR 105
T ++A S +F V + F +I ++ K WD L + Y G ++
Sbjct: 87 ATANSRALSLIFNSVNQNQFKQIQNCESAKEAWDKLAKAYEGTSSVK 133
>At3g21000 unknown protein
Length = 405
Score = 39.3 bits (90), Expect = 0.015
Identities = 28/114 (24%), Positives = 56/114 (48%), Gaps = 12/114 (10%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMA------QLKYHKERKTKKA 63
D +YE+WA ++ L +W+ + P NP +A +L ++ K A
Sbjct: 13 DQFDYEIWAPITKSTLIEQGLWDVVVNGVPQDP-SKNPELAATIQPEELSKWRDFVVKDA 71
Query: 64 KAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDER--IRSIQMLNLMSL 115
KA L + + ++VF + ++ + K +WD L++ G+E+ IR ++ + + L
Sbjct: 72 KALQILQSSLTDSVFRKTLSASSAKDVWDLLRK---GNEQATIRRLEQVTIRRL 122
>At3g21020 unknown protein
Length = 310
Score = 37.7 bits (86), Expect = 0.043
Identities = 25/106 (23%), Positives = 49/106 (45%), Gaps = 9/106 (8%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNN-PTMA------QLKYHKERKTKK 62
D NYE+WA + L +W+ +E VPP P+ P +A +L ++ K
Sbjct: 13 DDLNYEIWARIAKTTLTEKGLWDVVENG--VPPDPSKVPELAAKIQPEELSKWRDLAVKD 70
Query: 63 AKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQ 108
KA L + + ++ + + + K +WD L++ ++R ++
Sbjct: 71 MKALQNLQSSLTDSALRKTLYASSAKDVWDLLEKGNNEQAKLRRLE 116
>At3g20990 unknown protein
Length = 379
Score = 37.7 bits (86), Expect = 0.043
Identities = 26/107 (24%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNN-PTMA------QLKYHKERKTKK 62
D +YE+W+ +A L ++W+ +E VPP P+ P +A +L ++ K
Sbjct: 13 DDFDYEIWSSVTKATLTEKELWDVVENG--VPPDPSKIPELATKIQPEELSRWRDLAVKD 70
Query: 63 AKAKSCLFAGVL-ETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQ 108
KA L + L ++ F + ++ + K +WD L++ ++R ++
Sbjct: 71 MKALQILQSSSLTDSAFRKTLSASSAKDLWDSLEKGNNEQAKLRRLE 117
>At2g05390 putative retroelement pol polyprotein
Length = 1307
Score = 37.7 bits (86), Expect = 0.043
Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 18/94 (19%)
Query: 21 IEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAKSCLFAGVLETVFTR 80
+EA L VWE I+ P + M K A++ LF V E+ +
Sbjct: 1 MEATLRVHKVWETID--------PGSDDME----------KNDMARALLFQSVPESTILQ 42
Query: 81 IMTLKTPKAIWDYLKEEYAGDERIRSIQMLNLMS 114
+ KT KA+W+ +K G ER++ ++ LM+
Sbjct: 43 VGKHKTSKAMWEAIKTRNLGAERVKEAKLQTLMA 76
>At3g21040 hypothetical protein
Length = 534
Score = 37.0 bits (84), Expect = 0.073
Identities = 22/98 (22%), Positives = 44/98 (44%), Gaps = 5/98 (5%)
Query: 10 DGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYH-----KERKTKKAK 64
D +YE W+ E L VW+ ++ P + A +K + R + K
Sbjct: 11 DDFDYEHWSPIAETRLVENGVWDVVQNGVSPNPTTDPEVAATIKVEDLTQWRNRMIEDNK 70
Query: 65 AKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDE 102
A + + + ++VF + +++ + K +WD LK+ +E
Sbjct: 71 ALKIMQSALPDSVFRKTISVASSKELWDLLKKGNDNEE 108
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.361 0.159 0.550
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 22,186,126
Number of Sequences: 26719
Number of extensions: 759420
Number of successful extensions: 3191
Number of sequences better than 10.0: 31
Number of HSP's better than 10.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 3136
Number of HSP's gapped (non-prelim): 52
length of query: 1281
length of database: 11,318,596
effective HSP length: 111
effective length of query: 1170
effective length of database: 8,352,787
effective search space: 9772760790
effective search space used: 9772760790
T: 11
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.9 bits)
S2: 66 (30.0 bits)
Medicago: description of AC135101.6