
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC142394.3 - phase: 0
(321 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||A96682 protein F1E22.12 [imported] - Arabidopsis thaliana g... 176 1e-42
ref|NP_850866.1| reverse transcriptase-related [Arabidopsis thal... 160 4e-38
ref|NP_680680.1| reverse transcriptase-related [Arabidopsis thal... 158 2e-37
ref|NP_680357.1| RNase H domain-containing protein [Arabidopsis ... 154 4e-36
gb|AAC26674.1| putative non-LTR retroelement reverse transcripta... 153 6e-36
gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 148 2e-34
dbj|BAB09192.1| non-LTR retroelement reverse transcriptase-like ... 146 7e-34
dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ... 144 3e-33
gb|AAD22368.1| putative non-LTR retroelement reverse transcripta... 144 3e-33
gb|AAD21515.1| putative reverse transcriptase [Arabidopsis thali... 134 4e-30
ref|NP_680382.1| hypothetical protein [Arabidopsis thaliana] 128 2e-28
gb|AAV68820.1| hypothetical protein AT1G17390 [Arabidopsis thali... 126 1e-27
gb|AAF79490.1| F1L3.4 [Arabidopsis thaliana] gi|25402823|pir||D8... 118 2e-25
emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|73210... 115 1e-24
pir||B96652 protein F23N19.5 [imported] - Arabidopsis thaliana g... 109 9e-23
ref|NP_198321.2| hypothetical protein [Arabidopsis thaliana] 108 3e-22
ref|NP_680149.1| reverse transcriptase-related [Arabidopsis thal... 104 4e-21
ref|NP_680665.1| hypothetical protein [Arabidopsis thaliana] 104 4e-21
ref|NP_175165.1| expressed protein [Arabidopsis thaliana] 91 5e-17
gb|AAG28895.1| F12A21.24 [Arabidopsis thaliana] 87 7e-16
>pir||A96682 protein F1E22.12 [imported] - Arabidopsis thaliana
gi|6686397|gb|AAF23831.1| F1E22.12 [Arabidopsis
thaliana]
Length = 1055
Score = 176 bits (445), Expect = 1e-42
Identities = 100/293 (34%), Positives = 145/293 (49%), Gaps = 3/293 (1%)
Query: 17 LWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIG-NPMCRFCHDEIESEIHVLRDCPKAT 75
LW +R PER+K F+ ++ + + T +H+ + + +C+ C +ES +HVLRDCP
Sbjct: 400 LWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQL 459
Query: 76 ALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKETH 135
+W+ VV + FF L W+ NL + + W +A W WR
Sbjct: 460 GIWVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCED-IPWSTIFAVIIWWGWKWRCGNIF 518
Query: 136 NDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEEVGWKEPAIRWVKINTDGACK 195
+N + K + V Y ++ +QPR +GW P + WVK+NTDGA +
Sbjct: 519 GENTKCRDRVKFVKEWAVEVYRAHSGNVLVGITQPRVERMIGWVSPCVGWVKVNTDGASR 578
Query: 196 DG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVD 254
+A GG++R G W GFS +G+C A AELWGV GL A VEL VD
Sbjct: 579 GNPGLASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVD 638
Query: 255 SLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
S V+V + + + LV+ LQ +W V++ H YREANR AD LAN
Sbjct: 639 SEVIVGFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLAN 691
>ref|NP_850866.1| reverse transcriptase-related [Arabidopsis thaliana]
Length = 411
Score = 160 bits (406), Expect = 4e-38
Identities = 98/310 (31%), Positives = 154/310 (49%), Gaps = 10/310 (3%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGI-GNPMCRFCHDEIESEIHVLRDCPKA 74
++W I+ PER++ F +L + + TN +H+ + + +C C +ES +H+LRDCP
Sbjct: 82 RVWRIKVPERVRVFFWLLGNQAIMTNEERHRRHLCASDLCEVCKGGVESAMHILRDCPTM 141
Query: 75 TALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKET 134
+WL +V R FF+ L W+ NL + + V W +A W WR
Sbjct: 142 EGIWLRIVPPRKRHEFFKKSLFEWVYGNLGDDAVVDG-VPWSVMFAMGVWWGWKWRCGNV 200
Query: 135 HNDNYQ---RPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEEVGWKEPAIRWVKINTD 191
+N + R KN+ V H + A ++ L +GW P + WVK+NTD
Sbjct: 201 FGENRKCRDRVQFVKNVAK-EVFQAHGSRAGNDRGRNRVERL--IGWGSPGVGWVKLNTD 257
Query: 192 GACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVE 250
GA + +A GG++R EG W GF+ +G+C A +AELW V GL A + +E
Sbjct: 258 GASRGNPGLATAGGVLRDEEGNWRGGFAANIGRCSAPLAELWEVYYGLYFAWAIRIPQLE 317
Query: 251 LNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALANIGC 310
L VDS +VV + + LV+ L +W+V++ H YREAN + LAN
Sbjct: 318 LEVDSEIVVGFLKTGISDTHPLFFLVRLCYGFLSKDWRVQITHVYREANH-LNGLANYAF 376
Query: 311 IMGNEMMFYE 320
+ +++ FY+
Sbjct: 377 SLPSDLHFYD 386
>ref|NP_680680.1| reverse transcriptase-related [Arabidopsis thaliana]
Length = 396
Score = 158 bits (399), Expect = 2e-37
Identities = 98/304 (32%), Positives = 150/304 (49%), Gaps = 13/304 (4%)
Query: 11 DEIWLKLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIGNP-MCRFCHDEIESEIHVLR 69
+ + ++W + APER++ F+ ++ + + TN+ + + +G+ +C+ C E+ IHVLR
Sbjct: 61 ERFFSRIWRVLAPERVRVFLWLVGNQAILTNVERARRHMGDTDVCQVCRGAKETIIHVLR 120
Query: 70 DCPKATALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNW 129
DCP + +WL +V R FF L W+ NL + W +A + W W
Sbjct: 121 DCPAMSGIWLRLVPARERERFFTRSLLEWLYENLHQKEDPSRG-GWPTLFAMSIWWGWKW 179
Query: 130 RNKETHNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEE-----VGWKEPAIR 184
R + RP + + +V D + + + R+L E + WK+P+ R
Sbjct: 180 RCMNVFGE--LRPCRDR---VRFVRDMAAEVGAAYLKVNAGRNLGERVERQIVWKKPSER 234
Query: 185 WVKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKR 243
W +NTDGA A G++R EG WL GF+ +G C A +AELWGV GL A
Sbjct: 235 WATVNTDGASHGNPGFATARGVVRDGEGSWLGGFALNIGVCSAPLAELWGVYYGLVTAWE 294
Query: 244 MGFTAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCAD 303
G V L V+S +VV + S LV+ + + +W V+V H YREANR AD
Sbjct: 295 RGVRRVVLEVESKLVVGFLQSGIRDTHPLAFLVRLCQGFIARDWLVRVTHVYREANRLAD 354
Query: 304 ALAN 307
LAN
Sbjct: 355 GLAN 358
>ref|NP_680357.1| RNase H domain-containing protein [Arabidopsis thaliana]
Length = 633
Score = 154 bits (388), Expect = 4e-36
Identities = 91/301 (30%), Positives = 147/301 (48%), Gaps = 17/301 (5%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIG-NPMCRFCHDEIESEIHVLRDCPKA 74
++W + ER++ F+ ++ H + T++ + + + + +C+ C E+ +HVLRDCP
Sbjct: 290 RVWRVTTTERVRVFIWLVVHQVIMTDVERRRRHLSASGVCQVCKGGDETILHVLRDCPSI 349
Query: 75 TALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKET 134
+W +V T+FF ++ W+ NL ++V W +A W WR
Sbjct: 350 AGIWGRLVPRGKITAFFASNILDWVYQNL-SDVTEIRGCPWATLFAIVVWWAWKWRCGNV 408
Query: 135 HNDNYQRPLH-------AKNIIMIYVNDYHNAIAKFVIVTSQPRHLEEVGWKEPAIRWVK 187
+N + A+ I + ++N A+ + S + W P+ W K
Sbjct: 409 FGENGRCRDRVRFVVDQAREIWIAHLNLRRGAMRGSEVEMS-------IKWTPPSTGWFK 461
Query: 188 INTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGF 246
+NTDGA + +A GG++R EG+W GF +G C A +AELWGV GL A G
Sbjct: 462 LNTDGASRGNPGLATAGGVVRDGEGQWCVGFVLNIGICSAPLAELWGVYYGLHIAWERGI 521
Query: 247 TAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALA 306
+EL VDS +VV + + E + LV+ + +W V++ H YREANR AD LA
Sbjct: 522 RRLELEVDSTLVVGFLQAGIEDSHPLSFLVRLCYGFISRDWIVRISHVYREANRLADGLA 581
Query: 307 N 307
N
Sbjct: 582 N 582
>gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25411326|pir||C84488 hypothetical protein
At2g07730 [imported] - Arabidopsis thaliana
Length = 970
Score = 153 bits (387), Expect = 6e-36
Identities = 100/301 (33%), Positives = 146/301 (48%), Gaps = 23/301 (7%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNL---RKHKMGIGNPMCRFCHDEIESEIHVLRDCP 72
++W + APER++ F+ ++ H + TN+ R+H I C C+ ES +HVLRDCP
Sbjct: 646 QIWKLVAPERVRVFIWLVSHMVIMTNVERVRRHLSDIAT--CSVCNGADESILHVLRDCP 703
Query: 73 KATALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNK 132
T +W ++ + FF W LFTN+ +W ++ W WR
Sbjct: 704 AMTPIWQRLLPQRRQNEFFSQ--FEW----LFTNLDPAKG-DWPTLFSMGIWWAWKWRCG 756
Query: 133 ETHNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVT-----SQPRHLEEVGWKEPAIRWVK 187
+ + K ++ D + K + T + R + WK P+ RWVK
Sbjct: 757 DVFGERKLCRDRLK-----FIKDIAEEVRKAHVGTLNNHVKRARVERMIRWKAPSDRWVK 811
Query: 188 INTDGACKDGS-IAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGF 246
+ TDGA + +A G I +GEWL GF+ +G CDA +AELWG GL A GF
Sbjct: 812 LTTDGASRGHQGLAAASGAILNLQGEWLGGFALNIGSCDAPLAELWGAYYGLLIAWDKGF 871
Query: 247 TAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALA 306
VELN+DS +VV +++ LV+ + +W V+V H YREANR AD LA
Sbjct: 872 RRVELNLDSELVVGFLSTGISKAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLA 931
Query: 307 N 307
N
Sbjct: 932 N 932
>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25408124|pir||C84716 hypothetical protein
At2g31080 [imported] - Arabidopsis thaliana
Length = 1231
Score = 148 bits (373), Expect = 2e-34
Identities = 97/300 (32%), Positives = 147/300 (48%), Gaps = 19/300 (6%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIG-NPMCRFCHDEIESEIHVLRDCPKA 74
++W + PER++ F+ ++ + TN+ + + + N +C C+ E+ +HVLRDCP
Sbjct: 905 RIWKLITPERVRVFIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLRDCPAM 964
Query: 75 TALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKET 134
+W ++ FF L W LFTN+ + W + W WR +
Sbjct: 965 EPIWRRLLPLRRHHEFFSQSLLEW----LFTNMDPVKGI-WPTLFGMGIWWAWKWRCCDV 1019
Query: 135 HNDNYQRPLHAKNIIMIYVNDYHNAIAKFVI--VTSQPRHLEE---VGWKEPAIRWVKIN 189
+ K ++ D + + + V ++P + + W+ P+ WVKI
Sbjct: 1020 FGERKICRDRLK-----FIKDMAEEVRRVHVGAVGNRPNGVRVERMIRWQVPSDGWVKIT 1074
Query: 190 TDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTA 248
TDGA + +A GG IR +GEWL GF+ +G C A +AELWG GL A GF
Sbjct: 1075 TDGASRGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRR 1134
Query: 249 VELNVDSLVVVNIITSERESNASGRS-LVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
VEL++D +VV + S SNA S LV+ + +W V+V H YREANR AD LAN
Sbjct: 1135 VELDLDCKLVVGFL-STGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLAN 1193
>dbj|BAB09192.1| non-LTR retroelement reverse transcriptase-like protein
[Arabidopsis thaliana]
Length = 308
Score = 146 bits (369), Expect = 7e-34
Identities = 84/248 (33%), Positives = 119/248 (47%), Gaps = 2/248 (0%)
Query: 61 IESEIHVLRDCPKATALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWA 120
+ES +HV RDCP +W+ V + FF L W+ NL + + W +A
Sbjct: 24 VESVLHVFRDCPAQLGIWVRFVPRRRQQGFFSKSLFEWLYDNLCDRSSCED-IPWSTIFA 82
Query: 121 TACHTIWNWRNKETHNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEEVGWKE 180
W WR +N + K + V Y + ++ ++QPR +GW
Sbjct: 83 VIIWWGWKWRCSNIFGENTKCRDRVKFVKEWVVEVYRAHLGNALVGSTQPRVERLIGWVL 142
Query: 181 PAIRWVKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLR 239
P + WVK+NTDGA + +A GG++R EG W GFS +G+C A AELWGV GL
Sbjct: 143 PCVGWVKVNTDGASRGNPGLASAGGVLRDCEGAWCGGFSLNIGRCSAQHAELWGVYYGLY 202
Query: 240 CAKRMGFTAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREAN 299
A VEL VDS +V + + + LV+ LQ +W V++ + YREAN
Sbjct: 203 FAWEKKVPRVELEVDSEAIVGFLKTGISDSHPLSFLVRLCHNFLQKDWLVRIVYVYREAN 262
Query: 300 RCADALAN 307
AD LAN
Sbjct: 263 CLADGLAN 270
>dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
thaliana]
Length = 676
Score = 144 bits (363), Expect = 3e-33
Identities = 95/298 (31%), Positives = 140/298 (46%), Gaps = 11/298 (3%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIGNP-MCRFCHDEIESEIHVLRDCPKA 74
++W + PER + F+ ++ + + TN + + + + +C C ES IHVLRDCP
Sbjct: 346 RVWRVMVPERARIFLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAM 405
Query: 75 TALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKET 134
+W+ VV + FFE L W+ NL + W +A W WR
Sbjct: 406 MGIWMRVVPVMEQRRFFETSLLEWMYGNLKERSD-SERRSWPTLFALTVWWGWKWRCGYV 464
Query: 135 HNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRH--LEE--VGWKEPAIRWVKINT 190
++ + ++ + + A + R L E + W++PA WV +NT
Sbjct: 465 FGEDSR----CRDRVKFLKSAVAEVEAAHLAANGDAREDVLVERMIAWRKPAEGWVTMNT 520
Query: 191 DGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAV 249
DGA A GG+IR G WL GF+ +G C A +AELWGV GL A G+ V
Sbjct: 521 DGASHGNPGQATAGGVIRDEHGSWLVGFALNIGVCSAPLAELWGVYYGLVVAWERGWRRV 580
Query: 250 ELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
L VDS +VV + S + LV+ + +W V++ H YREANR AD LAN
Sbjct: 581 RLEVDSALVVGFLQSGIGDSHPLAFLVRLCHGFISKDWIVRITHVYREANRLADGLAN 638
>gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25412100|pir||F84611 hypothetical protein
At2g22350 [imported] - Arabidopsis thaliana
gi|15227192|ref|NP_179822.1| RNase H domain-containing
protein [Arabidopsis thaliana]
Length = 321
Score = 144 bits (363), Expect = 3e-33
Identities = 96/293 (32%), Positives = 138/293 (46%), Gaps = 18/293 (6%)
Query: 22 APERIKHFMSMLYHGRLSTNLRKHKMGIGNP-MCRFCHDEIESEIHVLRDCPKATALWLC 80
APER++ F+ ++ + TN+ +++ + + +C+ C E+ +HVLRDCP +W
Sbjct: 2 APERVRVFLWLVVQQVIITNVERYRRHLSDTRVCQICQGGEETILHVLRDCPAMAGIWSR 61
Query: 81 VVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKETHNDNYQ 140
+V FF L WI NL W +W W WR N +
Sbjct: 62 LVPRDQIRQFFTASLLEWIYKNLRERGSWPTVFVMAVWWG------WKWRCGNIFGGNGK 115
Query: 141 RPLHAKNIIMIYVNDYHN--AIAKFVIVTSQPR--HLEE-VGWKEPAIRWVKINTDGACK 195
K ++ D AIA + ++ R +E V W P WVK+NTDGA +
Sbjct: 116 CRDRVK-----FIKDLAEEVAIANAFVKGNEVRVSRVERLVSWVSPEDGWVKLNTDGASR 170
Query: 196 DG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVD 254
A GG++R G W+ GF+ +G C A +AELWGV GL A G VEL VD
Sbjct: 171 GNPGFATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVD 230
Query: 255 SLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
S +VV +T+ + L++ L W V++ H YREANR AD LAN
Sbjct: 231 SKMVVGFLTTGIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLADGLAN 283
>gb|AAD21515.1| putative reverse transcriptase [Arabidopsis thaliana]
gi|20197456|gb|AAM15081.1| putative reverse
transcriptase [Arabidopsis thaliana]
gi|25407930|pir||H84677 hypothetical protein At2g27870
[imported] - Arabidopsis thaliana
gi|15226268|ref|NP_180354.1| hypothetical protein
[Arabidopsis thaliana]
Length = 314
Score = 134 bits (337), Expect = 4e-30
Identities = 86/279 (30%), Positives = 128/279 (45%), Gaps = 16/279 (5%)
Query: 38 LSTNLRKHKMGIGNP-MCRFCHDEIESEIHVLRDCPKATALWLCVVDNAARTSFFEGDLS 96
L TN + + + + +C+ C ++ IH+LRDCP +W+ +V R FF L
Sbjct: 5 LMTNAERRRRHLSDSDICQICKGAEKTIIHILRDCPAMEGIWIRLVPAGKRREFFTQSLL 64
Query: 97 SWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKE---THNDNYQRPLHAKNIIMIYV 153
W+ NL + W +A + W WR + R K++
Sbjct: 65 EWLFANLGDRRKTCEST-WSTLFALSIWWAWKWRCGNIFGVQDKCRDRVRFLKDLAR--- 120
Query: 154 NDYHNAIAKFVIVTSQPRHLEEV----GWKEPAIRWVKINTDGACKDG-SIAGCGGLIRG 208
++A ++ T H E V W +P W K+NTDGA + +A GG++R
Sbjct: 121 ---ETSMAHVIVRTLSGGHGERVERLIAWSKPEEGWWKLNTDGASRGNPGLASAGGVLRD 177
Query: 209 SEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVDSLVVVNIITSERES 268
EG W GF+ +G C A +AELWGV GL A T +E+ VDS +VV +
Sbjct: 178 EEGAWRGGFALNIGVCSAPLAELWGVYYGLYIAWERRVTRLEIEVDSEIVVGFLKIGINE 237
Query: 269 NASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
LV+ + +W+V++ H YREANR AD LAN
Sbjct: 238 VHPLSFLVRLCHDFISRDWRVRISHVYREANRLADGLAN 276
>ref|NP_680382.1| hypothetical protein [Arabidopsis thaliana]
Length = 258
Score = 128 bits (322), Expect = 2e-28
Identities = 73/230 (31%), Positives = 107/230 (45%), Gaps = 2/230 (0%)
Query: 61 IESEIHVLRDCPKATALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWA 120
+ES +HV RDCP +W+ V + FF L W+ NL + + W +A
Sbjct: 24 VESVLHVFRDCPAQLGIWVRFVPRRRQQGFFSKSLFEWLYDNLCDRSSCED-IPWSTIFA 82
Query: 121 TACHTIWNWRNKETHNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEEVGWKE 180
W WR +N + K + V Y + ++ ++QPR +GW
Sbjct: 83 VIIWWGWKWRCSNIFGENTKCRDRVKFVKEWVVEVYRAHLGNALVGSTQPRVERLIGWVL 142
Query: 181 PAIRWVKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLR 239
P + WVK+NTDGA + +A GG++R EG W GFS +G+C A AELWGV GL
Sbjct: 143 PCVGWVKVNTDGASRGNPGLASAGGVLRDCEGAWCGGFSLNIGRCSAQHAELWGVYYGLY 202
Query: 240 CAKRMGFTAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKV 289
A VEL VDS +V + + + LV+ LQ +W++
Sbjct: 203 FAWEKKVPRVELEVDSEAIVGFLKTGISDSHPLSFLVRLCHNFLQKDWRI 252
>gb|AAV68820.1| hypothetical protein AT1G17390 [Arabidopsis thaliana]
gi|18394475|ref|NP_564020.1| hypothetical protein
[Arabidopsis thaliana] gi|9665118|gb|AAF97302.1|
Hypothetical protein [Arabidopsis thaliana]
Length = 272
Score = 126 bits (316), Expect = 1e-27
Identities = 74/232 (31%), Positives = 111/232 (46%), Gaps = 2/232 (0%)
Query: 77 LWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKETHN 136
+W ++ +SFF L WI NL + N W ++ A W WR
Sbjct: 4 IWTRLLPVRRLSSFFSKSLLEWIYANLGEEIEING-CPWAVTFSQAIWWGWKWRYGNIFG 62
Query: 137 DNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEEVGWKEPAIRWVKINTDGACKD 196
+N + + I ++ + + K + T R + W P + W K+NTDGA +
Sbjct: 63 ENKKCRDRVRFIKDRALDVWKAHVHKMGVTTRTAREERLIAWSPPRVGWFKLNTDGASRG 122
Query: 197 GS-IAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVDS 255
+A GG++R +G W GFS +G C A +AELWG GL A G T +E+ +DS
Sbjct: 123 NPRLATAGGVVRDGDGNWCYGFSLNIGICSAPLAELWGAYYGLNIAWERGVTQLEMEIDS 182
Query: 256 LVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
+VV + + + + LV+ LL +W V++ H YREANR AD LAN
Sbjct: 183 EMVVGFLRTGIDDSHPLSFLVRLCHGLLSKDWSVRISHVYREANRLADGLAN 234
>gb|AAF79490.1| F1L3.4 [Arabidopsis thaliana] gi|25402823|pir||D86310 protein
F1L3.4 [imported] - Arabidopsis thaliana
Length = 253
Score = 118 bits (296), Expect = 2e-25
Identities = 69/211 (32%), Positives = 102/211 (47%), Gaps = 2/211 (0%)
Query: 98 WIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKETHNDNYQRPLHAKNIIMIYVNDYH 157
WI NL + N W ++ A W WR +N + + I ++ +
Sbjct: 6 WIYANLGEEIEING-CPWAVTFSQAIWWGWKWRYGNIFGENKKCRDRVRFIKDRALDVWK 64
Query: 158 NAIAKFVIVTSQPRHLEEVGWKEPAIRWVKINTDGACKDGS-IAGCGGLIRGSEGEWLAG 216
+ K + T R + W P + W K+NTDGA + +A GG++R +G W G
Sbjct: 65 AHVHKMGVTTRTAREERLIAWSPPRVGWFKLNTDGASRGNPRLATAGGVVRDGDGNWCYG 124
Query: 217 FSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVDSLVVVNIITSERESNASGRSLV 276
FS +G C A +AELWG GL A G T +E+ +DS +VV + + + + LV
Sbjct: 125 FSLNIGICSAPLAELWGAYYGLNIAWERGVTQLEMEIDSEMVVGFLRTGIDDSHPLSFLV 184
Query: 277 QKIRKLLQMEWKVKVKHSYREANRCADALAN 307
+ LL +W V++ H YREANR AD LAN
Sbjct: 185 RLCHGLLSKDWSVRISHVYREANRLADGLAN 215
>emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|7321072|emb|CAB82119.1|
putative protein [Arabidopsis thaliana]
gi|25407453|pir||H85088 hypothetical protein AT4g08830
[imported] - Arabidopsis thaliana
Length = 947
Score = 115 bits (289), Expect = 1e-24
Identities = 85/260 (32%), Positives = 116/260 (43%), Gaps = 37/260 (14%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIGNP-MCRFCHDEIESEIHVLRDCPKA 74
+LW + A ER+K F L+H IG+ +C+ C E+ +HVL+DCP
Sbjct: 695 RLWRVVALERVKTF---LWH-------------IGDTSVCQVCKGGDETILHVLKDCPSI 738
Query: 75 TALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTI----WNWR 130
+W +V FF G L W+ NL N E WAT + W WR
Sbjct: 739 AGIWRRLVQVQRSYDFFNGSLFGWLYVNLGMK-----NAETGYAWATLFAIVVWWSWKWR 793
Query: 131 NKETHNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRHLEE-----VGWKEPAIRW 185
+ + K + D ++ + SQ L V WK P W
Sbjct: 794 CGYVFGEVGKCRDRVK-----FFRDLAAEVSHAHAIHSQNGGLRTRVERLVAWKPPDGEW 848
Query: 186 VKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRM 244
VK+NTDGA + +A GG++R G W GF+ +G C A +AELWGV GL A
Sbjct: 849 VKLNTDGASRGNLGLATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWGVYYGLYMAWER 908
Query: 245 GFTAVELNVDSLVVVNIITS 264
FT VEL VDS +VV +T+
Sbjct: 909 RFTRVELEVDSELVVGFLTT 928
>pir||B96652 protein F23N19.5 [imported] - Arabidopsis thaliana
gi|6630448|gb|AAF19536.1| F23N19.5 [Arabidopsis
thaliana]
Length = 233
Score = 109 bits (273), Expect = 9e-23
Identities = 70/191 (36%), Positives = 94/191 (48%), Gaps = 11/191 (5%)
Query: 123 CHTIWNWRNKETHNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIV-----TSQPRHLEEVG 177
C W WR +N + K V D +AK S+ R +V
Sbjct: 10 CWWGWKWRCGNVFGENRKCRDRVK-----LVKDIAQEVAKANNCGSGSNNSRSRMERQVR 64
Query: 178 WKEPAIRWVKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLE 236
W +P++ W K+NTDGA +A GG +R GEW GF+ +G+C A +AELWGV
Sbjct: 65 WSKPSLGWCKLNTDGASHGNPGLATAGGALRNEYGEWCFGFALNIGRCLAPLAELWGVYY 124
Query: 237 GLRCAKRMGFTAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYR 296
GL A G T +EL VDS +VV + + S+ LV+ L +W V++ H YR
Sbjct: 125 GLFMAWDRGITRLELEVDSEMVVGFLRTGIGSSHPLSFLVRMCHGFLSRDWIVRIGHVYR 184
Query: 297 EANRCADALAN 307
EANR AD LAN
Sbjct: 185 EANRLADGLAN 195
>ref|NP_198321.2| hypothetical protein [Arabidopsis thaliana]
Length = 175
Score = 108 bits (269), Expect = 3e-22
Identities = 58/134 (43%), Positives = 78/134 (57%), Gaps = 1/134 (0%)
Query: 175 EVGWKEPAIRWVKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWG 233
+V W +P++ W K+NTDGA +A GG +R GEW GF+ +G+C A +AELWG
Sbjct: 4 QVRWSKPSLGWCKLNTDGASHGNPGLAIAGGALRNEYGEWCFGFALNIGRCSAPLAELWG 63
Query: 234 VLEGLRCAKRMGFTAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKH 293
V GL A G T +EL VDS +VV + + S+ LV+ L +W V++ H
Sbjct: 64 VYYGLFMAWDRGITRLELEVDSEMVVGFLRTGIGSSHPLSFLVRMCHGFLSRDWIVRIGH 123
Query: 294 SYREANRCADALAN 307
YREANR AD LAN
Sbjct: 124 VYREANRLADELAN 137
>ref|NP_680149.1| reverse transcriptase-related [Arabidopsis thaliana]
Length = 594
Score = 104 bits (259), Expect = 4e-21
Identities = 69/229 (30%), Positives = 105/229 (45%), Gaps = 11/229 (4%)
Query: 16 KLWNIRAPERIKHFMSMLYHGRLSTNLRKHKMGIGNP-MCRFCHDEIESEIHVLRDCPKA 74
++W + PER + F+ ++ + + TN + + + + +C C ES IHVLRDCP
Sbjct: 333 RVWRVMVPERARIFLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAM 392
Query: 75 TALWLCVVDNAARTSFFEGDLSSWIKFNLFTNVYWNNNVEWRDFWATACHTIWNWRNKET 134
+W+ VV + FFE L W+ NL + W +A W WR
Sbjct: 393 MGIWMRVVPVMEQRRFFETSLLEWMYGNLKERSD-SERRSWPTLFALTVWWGWKWRCGYV 451
Query: 135 HNDNYQRPLHAKNIIMIYVNDYHNAIAKFVIVTSQPRH--LEE--VGWKEPAIRWVKINT 190
++ + ++ + + A + R L E + W++PA WV +NT
Sbjct: 452 FGEDSR----CRDRVKFLKSAVAEVEAAHLAANGDAREDVLVERMIAWRKPAEGWVTMNT 507
Query: 191 DGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGL 238
DGA A GG+IR G WL GF+ +G C A +AELWGV GL
Sbjct: 508 DGASHGNPGQATAGGVIRDEHGSWLVGFALNIGVCSAPLAELWGVYYGL 556
>ref|NP_680665.1| hypothetical protein [Arabidopsis thaliana]
Length = 214
Score = 104 bits (259), Expect = 4e-21
Identities = 62/154 (40%), Positives = 83/154 (53%), Gaps = 6/154 (3%)
Query: 160 IAKFVIVTSQPRHL---EE--VGWKEPAIRWVKINTDGACKDG-SIAGCGGLIRGSEGEW 213
+AK V PR L EE + W + W K+NTDGA + +A GG++R S GEW
Sbjct: 23 VAKAYAVVGVPRRLVGREERLISWSRLSEGWCKLNTDGASRGNPGLAAAGGVLRESNGEW 82
Query: 214 LAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVDSLVVVNIITSERESNASGR 273
GF+ +G C A +AELWGV GL A + +EL VDS VVV + + +
Sbjct: 83 RRGFAINIGICSAPLAELWGVYYGLFMAWECKVSQLELEVDSEVVVGFLRTGISESHPLS 142
Query: 274 SLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
LV+ + +W V++ H YREANR AD LAN
Sbjct: 143 FLVRMCYGFISRDWIVRISHKYREANRLADGLAN 176
>ref|NP_175165.1| expressed protein [Arabidopsis thaliana]
Length = 259
Score = 90.9 bits (224), Expect = 5e-17
Identities = 49/123 (39%), Positives = 71/123 (56%), Gaps = 1/123 (0%)
Query: 186 VKINTDGACKDG-SIAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRM 244
+KINTDGA + +A GG+++ +EG W GFS +G+ A +AELWG GL A
Sbjct: 101 LKINTDGASRGNPGLATAGGVLQDNEGRWCGGFSLNIGRSSAPMAELWGAYYGLYLAWER 160
Query: 245 GFTAVELNVDSLVVVNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADA 304
+ +EL VDS +VV + + + LV+ + +W+V++ H YREANR AD
Sbjct: 161 KSSHIELEVDSEIVVGFLKTGISDHHPLSFLVRLCHGFISKDWRVRIFHVYREANRFADG 220
Query: 305 LAN 307
LAN
Sbjct: 221 LAN 223
>gb|AAG28895.1| F12A21.24 [Arabidopsis thaliana]
Length = 803
Score = 87.0 bits (214), Expect = 7e-16
Identities = 47/109 (43%), Positives = 60/109 (54%)
Query: 199 IAGCGGLIRGSEGEWLAGFSKFLGKCDAFIAELWGVLEGLRCAKRMGFTAVELNVDSLVV 258
+A GG+IR G W GF+ +G+C A +AELWGV G A T VEL VDS +V
Sbjct: 657 LATAGGVIRDGAGNWCGGFALNIGRCSAPLAELWGVYYGFYLAWTKALTRVELEVDSELV 716
Query: 259 VNIITSERESNASGRSLVQKIRKLLQMEWKVKVKHSYREANRCADALAN 307
V + + LV+ LL +W V++ H YREANR AD LAN
Sbjct: 717 VGFLKTGIGDQHPLSFLVRLCHGLLSKDWIVRITHVYREANRLADGLAN 765
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.323 0.137 0.450
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 581,128,687
Number of Sequences: 2540612
Number of extensions: 23860653
Number of successful extensions: 47526
Number of sequences better than 10.0: 550
Number of HSP's better than 10.0 without gapping: 62
Number of HSP's successfully gapped in prelim test: 488
Number of HSP's that attempted gapping in prelim test: 46990
Number of HSP's gapped (non-prelim): 609
length of query: 321
length of database: 863,360,394
effective HSP length: 128
effective length of query: 193
effective length of database: 538,162,058
effective search space: 103865277194
effective search space used: 103865277194
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 75 (33.5 bits)
Medicago: description of AC142394.3