
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC122729.1 - phase: 0 /pseudo
(119 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 50 2e-05
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 47 8e-05
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 46 2e-04
gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi... 45 4e-04
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 44 7e-04
emb|CAE04890.2| OSJNBa0042I15.12 [Oryza sativa (japonica cultiva... 44 0.001
emb|CAD41329.2| OJ991113_30.13 [Oryza sativa (japonica cultivar-... 44 0.001
ref|XP_470528.1| Putative retrovirus-related pol polyprotein [Or... 43 0.002
dbj|BAA97536.1| retroelement pol polyprotein-like [Arabidopsis t... 42 0.003
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ... 42 0.005
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult... 40 0.017
ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cu... 39 0.022
emb|CAE01742.2| OSJNBb0056F09.5 [Oryza sativa (japonica cultivar... 39 0.029
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 39 0.029
gb|AAP53866.1| putative polyprotein [Oryza sativa (japonica cult... 39 0.038
ref|NP_916849.1| retrovirus-related pol polyprotein from transpo... 38 0.050
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 38 0.065
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 37 0.085
emb|CAB77906.1| putative polyprotein [Arabidopsis thaliana] gi|4... 37 0.15
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 36 0.19
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 49.7 bits (117), Expect = 2e-05
Identities = 38/117 (32%), Positives = 62/117 (52%), Gaps = 15/117 (12%)
Query: 4 EKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFL 63
E + E KA S IVL + D+V +KKE+T ++ D+++ S +P +
Sbjct: 65 EALEEKKKKARSAIVLSVTDRVLRKIKKEST------AAAMLLALDKLYMSKALPNRIYP 118
Query: 64 LIK-YEVK*SHH-------EEFNKILDNLENIEVHLEDEDKVILLL-CIPKSFESFR 111
K Y K S + +EF +I+ +LEN+ V + DED+ ILLL +PK+F+ +
Sbjct: 119 KQKLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLK 175
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 47.4 bits (111), Expect = 8e-05
Identities = 40/120 (33%), Positives = 63/120 (52%), Gaps = 15/120 (12%)
Query: 1 MSQEKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKT 60
+ +E + E KA S IVL + D+V +KKE + +GV+ D+++ S +P
Sbjct: 62 LRRELLEEKRRKARSAIVLSVTDRVLRKIKKEQSA----AAMLGVL--DKLYMSKALPNR 115
Query: 61 KFLLIK-YEVK*SHH-------EEFNKILDNLENIEVHLEDEDKVILLL-CIPKSFESFR 111
+ K Y K S + +EF +I+ +LEN V + DED+ ILLL +PK F+ R
Sbjct: 116 IYQKQKLYSFKMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLR 175
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
gi|11278366|pir||T47492 copia-like polyprotein -
Arabidopsis thaliana
Length = 1363
Score = 45.8 bits (107), Expect = 2e-04
Identities = 39/117 (33%), Positives = 59/117 (50%), Gaps = 15/117 (12%)
Query: 4 EKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFL 63
E E KA S IVL + D+V +KKET+ + D+++ S +P +L
Sbjct: 65 EAFEEKKRKARSTIVLSVSDRVLRKIKKETS------AAAMLEALDRLYMSKALPNRIYL 118
Query: 64 LIK-YEVK*SHH-------EEFNKILDNLENIEVHLEDEDKVILLL-CIPKSFESFR 111
K Y K S + +EF I+ +LEN+ V + DED+ ILLL +PK F+ +
Sbjct: 119 KQKLYSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLK 175
>gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25412027|pir||G84599 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 838
Score = 45.1 bits (105), Expect = 4e-04
Identities = 36/127 (28%), Positives = 68/127 (53%), Gaps = 16/127 (12%)
Query: 2 SQEKMI-EMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKT 60
+++K++ E KA S ++L L + V V KE T G + V D++F + ++P
Sbjct: 59 TEDKVLKEKRGKARSTVILSLGNHVLRKVIKEKTAAGM------IRVLDKLFMAKSLPNR 112
Query: 61 KFL---LIKYEVK*S-----HHEEFNKILDNLENIEVHLEDEDK-VILLLCIPKSFESFR 111
+L L Y++ S + +F K++ +LEN++V + DED+ ++LL+ +PK F+ +
Sbjct: 113 IYLKQRLYGYKMSDSMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLK 172
Query: 112 RPFSMAK 118
K
Sbjct: 173 DTLKYGK 179
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 44.3 bits (103), Expect = 7e-04
Identities = 35/120 (29%), Positives = 61/120 (50%), Gaps = 15/120 (12%)
Query: 8 EMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFLLIK- 66
E KA S I+L L + V V K+ T G + V DQ+F + ++P +L +
Sbjct: 74 EKRGKARSTIILSLGNNVLRKVIKQKTAAGM------IKVLDQLFMAKSLPNRIYLKQRL 127
Query: 67 YEVK*SHH-------EEFNKILDNLENIEVHLEDEDK-VILLLCIPKSFESFRRPFSMAK 118
Y K S + +F K++ +LEN++V + DED+ ++LL+ +P+ F+ + K
Sbjct: 128 YGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYCK 187
>emb|CAE04890.2| OSJNBa0042I15.12 [Oryza sativa (japonica cultivar-group)]
Length = 432
Score = 43.9 bits (102), Expect = 0.001
Identities = 36/124 (29%), Positives = 65/124 (52%), Gaps = 21/124 (16%)
Query: 1 MSQEKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQV---FDS*TI 57
M + EM +A + I CL D++ V +ET+ I+++++ F S T+
Sbjct: 83 MEDDDCEEMQLQAAATIRRCLSDQIMYHVMEETSPK---------IIWEKLEAQFMSKTL 133
Query: 58 PKTKFLLIK-YEVK*-------SHHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFE 108
++L K Y +K +H FN+++ +L ++V ++DEDK I+LLC +P+S+E
Sbjct: 134 TNKRYLKQKLYGLKMQEGADLTAHVNVFNQLVTDLVKMDVEVDDEDKAIVLLCSLPESYE 193
Query: 109 SFRR 112
R
Sbjct: 194 HVMR 197
>emb|CAD41329.2| OJ991113_30.13 [Oryza sativa (japonica cultivar-group)]
gi|50925386|ref|XP_472962.1| OJ991113_30.13 [Oryza
sativa (japonica cultivar-group)]
Length = 353
Score = 43.9 bits (102), Expect = 0.001
Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 3/115 (2%)
Query: 1 MSQEKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKT 60
M +K EM +A + I L L+D V V E T + + + + +
Sbjct: 1 MDADKWDEMKAQAAATIRLSLLDSVMYQVMDEKTPKEIWDKLASLYMSKSLTSKLYLKQQ 60
Query: 61 KFLLIKYEVK*--SHHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFESFRR 112
+ L E H + FN+++ +L ++V L+DEDK I+LLC +P S+E RR
Sbjct: 61 SYGLQMQEESDLRKHVDIFNQLVVDLRKLDVKLDDEDKAIILLCSLPPSYEHLRR 115
>ref|XP_470528.1| Putative retrovirus-related pol polyprotein [Oryza sativa (japonica
cultivar-group)] gi|27436748|gb|AAO13467.1| Putative
retrovirus-related pol polyprotein [Oryza sativa
(japonica cultivar-group)]
Length = 556
Score = 43.1 bits (100), Expect = 0.002
Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 5/112 (4%)
Query: 1 MSQEKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKT 60
M K +EM +AT++I L L D V V E T K+ + + S K
Sbjct: 179 MDVGKWVEMKAQATAIIRLSLSDFVMYQVMDEKTPKEIW-DKLASLYMSKSLTSKLYLKQ 237
Query: 61 KFLLIKYEVK*S---HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFE 108
+ ++ + + H + FN+++ +L ++V L+DEDK I+LLC +P SFE
Sbjct: 238 QLYGLQMQEESDLRKHVDVFNQLVVDLSKLDVKLDDEDKAIILLCSLPPSFE 289
>dbj|BAA97536.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1338
Score = 42.4 bits (98), Expect = 0.003
Identities = 33/113 (29%), Positives = 60/113 (52%), Gaps = 15/113 (13%)
Query: 8 EMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFLLIK- 66
E +KA S+I+L + D + ++ E T G + V D+++ S + L K
Sbjct: 83 EKENKARSVIILSVADNILRRIRTEETAAGM------ISVLDKLYLSDPLSSRISLKRKL 136
Query: 67 YEVK*SHH-------EEFNKILDNLENIEVHLEDEDKV-ILLLCIPKSFESFR 111
+E K S + E+F +I+++LE ++V++ DEDK +LLL +P+ E +
Sbjct: 137 FEFKMSENKAVEENIEDFFRIVEDLEKLDVYVSDEDKAFMLLLSLPRKLEQLK 189
>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
gi|4539406|emb|CAB40039.1| putative retrotransposon
[Arabidopsis thaliana] gi|7444416|pir||T04181
hypothetical protein F7L13.40 - Arabidopsis thaliana
Length = 1230
Score = 41.6 bits (96), Expect = 0.005
Identities = 37/115 (32%), Positives = 56/115 (48%), Gaps = 15/115 (13%)
Query: 6 MIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFLLI 65
M E KA S IVL + D+V KKE T + D+++ S +P +L
Sbjct: 65 MEEKRQKARSTIVLSVSDQVLRKSKKEKTAPSM------LEALDKLYMSKALPNRIYLKQ 118
Query: 66 K-YEVK*SHH-------EEFNKILDNLENIEVHLEDEDKVILLL-CIPKSFESFR 111
K Y K + +EF +++ +LEN V + DED+ ILLL +PK F+ +
Sbjct: 119 KLYSYKMQENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLK 173
>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1280
Score = 39.7 bits (91), Expect = 0.017
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 5/104 (4%)
Query: 12 KATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFLLIKYEVK* 71
KA S I L L + + +V KE T G + K+ I + S K K L K +
Sbjct: 92 KAMSYIHLHLSNNILQEVLKEETAAGLWL-KLEQICMTKDLTSKMHLKQKLFLHKLQDDG 150
Query: 72 S---HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFESFR 111
S H F +I+ +LE+IEV ++ED ++LLC +P S+ +FR
Sbjct: 151 SVMDHLSTFKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFR 194
>ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475188|gb|AAT44257.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1211
Score = 39.3 bits (90), Expect = 0.022
Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 5/104 (4%)
Query: 12 KATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFLLIKYEVK* 71
K S I L L + + +V KE T G + K+ I + S K K L K +
Sbjct: 57 KTMSYIHLHLSNNILQEVLKEETAAGLWL-KLEQICMTKDLTSKMHLKQKLFLHKLQDDG 115
Query: 72 S---HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFESFR 111
S H F KI+ +LE++EV ++ED ++LLC +P S+ +FR
Sbjct: 116 SVMDHLSAFKKIVADLESMEVKYDEEDLCLILLCSLPSSYANFR 159
>emb|CAE01742.2| OSJNBb0056F09.5 [Oryza sativa (japonica cultivar-group)]
gi|50922273|ref|XP_471497.1| OSJNBb0056F09.5 [Oryza
sativa (japonica cultivar-group)]
Length = 371
Score = 38.9 bits (89), Expect = 0.029
Identities = 17/37 (45%), Positives = 28/37 (74%), Gaps = 1/37 (2%)
Query: 73 HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFE 108
H + FN+++ +L ++V L+DEDK I+LLC +P S+E
Sbjct: 142 HVDVFNQLVVDLSKLDVKLDDEDKAIILLCSLPPSYE 178
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 1373
Score = 38.9 bits (89), Expect = 0.029
Identities = 19/41 (46%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 72 SHHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFESFR 111
+H EF KI+ +L ++EV +DED +LLLC +P S+ +FR
Sbjct: 117 THMAEFKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFR 157
>gb|AAP53866.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534554|ref|NP_921579.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 382
Score = 38.5 bits (88), Expect = 0.038
Identities = 17/37 (45%), Positives = 28/37 (74%), Gaps = 1/37 (2%)
Query: 73 HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFE 108
H + FN+++ +L ++V L+DEDK I+LLC +P S+E
Sbjct: 122 HVDVFNQLVVDLSKLDVKLDDEDKAIILLCSLPLSYE 158
>ref|NP_916849.1| retrovirus-related pol polyprotein from transposon TNT 1-94-like
[Oryza sativa (japonica cultivar-group)]
Length = 425
Score = 38.1 bits (87), Expect = 0.050
Identities = 17/37 (45%), Positives = 28/37 (74%), Gaps = 1/37 (2%)
Query: 73 HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFE 108
H + FN+++ +L ++V+L DEDK I+LLC +P S+E
Sbjct: 122 HVDVFNQLVVDLSKLDVNLYDEDKAIILLCSLPPSYE 158
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 37.7 bits (86), Expect = 0.065
Identities = 32/112 (28%), Positives = 53/112 (46%), Gaps = 3/112 (2%)
Query: 1 MSQEKMIEMADKATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKT 60
M E ++ ++A S I L L D V ++ E T G + + + + + K
Sbjct: 47 MKAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQ 106
Query: 61 KFLLIKYEVK*--SHHEEFNKILDNLENIEVHLEDEDKVILLL-CIPKSFES 109
+ L E SH FN ++ L N+ V +E+EDK ILLL +P S+++
Sbjct: 107 LYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDN 158
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 37.4 bits (85), Expect = 0.085
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 5/104 (4%)
Query: 12 KATSLIVLCLVDKVSIDVKKETTMHGGHVGKVGVIVYDQVFDS*TIPKTKFLLIKYEVK* 71
KA S I L L + + +V KE T G + K+ I + S K K L K +
Sbjct: 57 KAISYIHLHLSNNILQEVLKEETAAGLWL-KLEQICMTKDLTSKMHLKQKLFLHKLQDDE 115
Query: 72 S---HHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFESFR 111
S H F +I+ +LE++EV +++D ++LLC +P S+ +FR
Sbjct: 116 SVMDHLSAFKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFR 159
>emb|CAB77906.1| putative polyprotein [Arabidopsis thaliana]
gi|4982475|gb|AAD36943.1| putative polyprotein
[Arabidopsis thaliana] gi|25407267|pir||D85055 probable
polyprotein [imported] - Arabidopsis thaliana
Length = 778
Score = 36.6 bits (83), Expect = 0.15
Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 9/72 (12%)
Query: 49 DQVFDS*TIPKTKFLLIK-YEVK*SHH-------EEFNKILDNLENIEVHLEDEDKVILL 100
D+++ S +P +L K Y K S + +EF I+ +LEN+ V + DED+ ILL
Sbjct: 73 DRLYMSKALPNQIYLKQKLYRFKMSENLSMEGNIDEFLHIVADLENLNVLVSDEDQTILL 132
Query: 101 L-CIPKSFESFR 111
L +PKSF+ +
Sbjct: 133 LMSLPKSFDQLK 144
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 36.2 bits (82), Expect = 0.19
Identities = 17/48 (35%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 72 SHHEEFNKILDNLENIEVHLEDEDKVILLLC-IPKSFESFRRPFSMAK 118
+H F +I+ +L ++EV +DED +LLLC +P S+ +FR +++
Sbjct: 121 NHISVFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSR 168
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.330 0.144 0.415
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 175,904,003
Number of Sequences: 2540612
Number of extensions: 6134527
Number of successful extensions: 16847
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 16814
Number of HSP's gapped (non-prelim): 33
length of query: 119
length of database: 863,360,394
effective HSP length: 95
effective length of query: 24
effective length of database: 622,002,254
effective search space: 14928054096
effective search space used: 14928054096
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 68 (30.8 bits)
Medicago: description of AC122729.1