
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0102b.4
(278 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB78007.1| putative protein [Arabidopsis thaliana] gi|73210... 42 0.015
gb|AAB84339.1| putative Ta11-like non-LTR retroelement protein [... 36 1.0
ref|NP_598559.1| hypothetical protein LOC97827 [Mus musculus] gi... 35 1.8
ref|XP_343087.2| PREDICTED: similar to Expressed sequence C85658... 35 1.8
gb|AAP49302.1| acetylcholinesterase [Rhipicephalus sanguineus] 35 1.8
gb|AAP49301.1| acetylcholinesterase [Rhipicephalus sanguineus] 35 1.8
ref|NP_060669.1| hypothetical protein LOC55218 [Homo sapiens] gi... 35 3.1
ref|XP_510030.1| PREDICTED: chromosome 14 open reading frame 114... 35 3.1
emb|CAD39094.2| hypothetical protein [Homo sapiens] 35 3.1
gb|AAS53782.1| AFR411Cp [Ashbya gossypii ATCC 10895] gi|45198929... 34 4.0
gb|AAF18653.1| hypothetical protein [Arabidopsis thaliana] gi|15... 34 4.0
ref|XP_547872.1| PREDICTED: similar to Protein C14orf114 [Canis ... 34 5.2
gb|AAD37019.2| putative non-LTR retrolelement reverse transcript... 33 6.8
gb|EAL30092.1| GA14442-PA [Drosophila pseudoobscura] 33 6.8
pir||E84487 hypothetical protein At2g07670 [imported] - Arabidop... 33 6.8
ref|YP_080733.1| ribonuclease R [Bacillus licheniformis ATCC 145... 33 8.9
gb|AAK78180.1| Uncharacterized membrane protein, ortholog YYAS B... 33 8.9
>emb|CAB78007.1| putative protein [Arabidopsis thaliana] gi|7321071|emb|CAB82118.1|
putative protein [Arabidopsis thaliana]
gi|15236612|ref|NP_192622.1| hypothetical protein
[Arabidopsis thaliana] gi|25371845|pir||G85088
hypothetical protein AT4g08820 [imported] - Arabidopsis
thaliana
Length = 404
Score = 42.4 bits (98), Expect = 0.015
Identities = 44/195 (22%), Positives = 83/195 (42%), Gaps = 21/195 (10%)
Query: 100 LKPWSSSFAP-----INRVTWVRCTGIPLHMWTMDCFKHLLLQVGDVIKIDEDTSEFVVL 154
++ WS +F P + WVR T IP++ + + + +G ++K+D +T F
Sbjct: 94 VQDWSPNFDPLRDDIVTTPVWVRLTNIPVNYYHRCLLEEIARGLGKLLKVDLNTITFGRG 153
Query: 155 EFSRLLVRTTTLNFITLARKILIKGTTYTISI--IEEGEALLHPNCCCREESSVEDDVFS 212
F+R+ + L +LI G Y ++ +EEG ++ + R E + VF+
Sbjct: 154 RFARVCIEVNLAK--PLKGTVLINGDRYFVAYEEVEEGFTVVRQS-GIRVEKPAQKMVFA 210
Query: 213 GWSLNGVTVTP----------ELSESGGEDDDGLWPVETKSQHDLQGSPNHSLGDNKNPR 262
S G + E+S G ++ + E L+GS + KN R
Sbjct: 211 AGSSGGRSNRSLRELPKNQGVEISNRFGGLEEDMVSAEIGEVAILEGSNKENKYKGKNMR 270
Query: 263 SNTGAVNRLLSSRLN 277
+G V ++ +++N
Sbjct: 271 KESG-VTQVKEAQVN 284
>gb|AAB84339.1| putative Ta11-like non-LTR retroelement protein [Arabidopsis
thaliana] gi|7487606|pir||T00813 probable Ta11-like
non-LTR retroelement protein [imported] - Arabidopsis
thaliana gi|15227381|ref|NP_181688.1| hypothetical
protein [Arabidopsis thaliana]
Length = 418
Score = 36.2 bits (82), Expect = 1.0
Identities = 33/154 (21%), Positives = 66/154 (42%), Gaps = 12/154 (7%)
Query: 67 LRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKPWSSS-------FAPINRVTWVRCT 119
L +R E +L+E+L+ + E+ ++ W F P+ W+R
Sbjct: 72 LSQERFQFFFKSEDDLLEILKTGVWTQDEWCVVMERWIEKSTEEYLMFLPV----WMRLR 127
Query: 120 GIPLHMWTMDCFKHLLLQVGDVIKIDEDTSEFVVLEFSRLLVRTTTLNFITLARKILIKG 179
IP++ +T D K + VG V+K++ D + ++ R+ V N + ++I +
Sbjct: 128 NIPVNYYTQDTIKKIASCVGKVLKVELDLEKSQAQDYIRVQVIIDVRNPLRNFKEIQLP- 186
Query: 180 TTYTISIIEEGEALLHPNCCCREESSVEDDVFSG 213
T +S+ + E + C+ + + D SG
Sbjct: 187 TGEIVSVTFDYERIRKRCFLCQRLTHEKGDCDSG 220
>ref|NP_598559.1| hypothetical protein LOC97827 [Mus musculus]
gi|17391198|gb|AAH18508.1| Expressed sequence C85658
[Mus musculus] gi|37999725|sp|Q8VEG4|CN114_MOUSE Protein
C14orf114 homolog
Length = 496
Score = 35.4 bits (80), Expect = 1.8
Identities = 30/98 (30%), Positives = 44/98 (44%), Gaps = 5/98 (5%)
Query: 7 EGSKVDRGRMTALELWSGPEIPKEEWMLRSLVREMKNLELIPS--LREAFMVEGHITIQV 64
E +V G L S P KEE L +RE N ++I L EA +E I +
Sbjct: 373 ERRQVRSGARALLNAESLPAHRKEE--LLHALREFYNTDIITEEMLHEAASLETRIYNE- 429
Query: 65 RYLRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKP 102
Y+ ++ H E L L+Q WR F+D+++P
Sbjct: 430 SYIPHGLKVVQRHTEGGLRSLMQLESRWRQHFLDSMQP 467
>ref|XP_343087.2| PREDICTED: similar to Expressed sequence C85658 [Rattus norvegicus]
Length = 648
Score = 35.4 bits (80), Expect = 1.8
Identities = 29/98 (29%), Positives = 45/98 (45%), Gaps = 5/98 (5%)
Query: 7 EGSKVDRGRMTALELWSGPEIPKEEWMLRSLVREMKNLELIPS--LREAFMVEGHITIQV 64
E +V G L S P KEE L +RE N +++ L+EA +E I +
Sbjct: 525 ERRQVRSGARALLNAESLPAHRKEE--LLHALREFYNTDIVTEEMLQEAASLETRIYNE- 581
Query: 65 RYLRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKP 102
Y+ ++ H E L L+Q WR F+D+++P
Sbjct: 582 SYIPHGLKVVQRHTEGGLRSLMQLESRWRQHFLDSMQP 619
>gb|AAP49302.1| acetylcholinesterase [Rhipicephalus sanguineus]
Length = 587
Score = 35.4 bits (80), Expect = 1.8
Identities = 20/67 (29%), Positives = 28/67 (40%)
Query: 190 GEALLHPNCCCREESSVEDDVFSGWSLNGVTVTPELSESGGEDDDGLWPVETKSQHDLQG 249
GE L +C E+ + + W+ T P LSE G + WP T S+ L
Sbjct: 496 GEPLNDTHCYSEEDKVLSRRIMRYWANFAKTGNPNLSEGGSSESITNWPERTDSKRHLVL 555
Query: 250 SPNHSLG 256
N S+G
Sbjct: 556 DVNESVG 562
>gb|AAP49301.1| acetylcholinesterase [Rhipicephalus sanguineus]
Length = 593
Score = 35.4 bits (80), Expect = 1.8
Identities = 20/67 (29%), Positives = 28/67 (40%)
Query: 190 GEALLHPNCCCREESSVEDDVFSGWSLNGVTVTPELSESGGEDDDGLWPVETKSQHDLQG 249
GE L +C E+ + + W+ T P LSE G + WP T S+ L
Sbjct: 502 GEPLNDTHCYSEEDKVLSRRIMRYWANFAKTGNPNLSEGGSSESITNWPERTDSKRHLVL 561
Query: 250 SPNHSLG 256
N S+G
Sbjct: 562 DVNESVG 568
>ref|NP_060669.1| hypothetical protein LOC55218 [Homo sapiens]
gi|7022954|dbj|BAA91781.1| unnamed protein product [Homo
sapiens] gi|12805017|gb|AAH01962.1| Chromosome 14 open
reading frame 114 [Homo sapiens]
gi|37999798|sp|Q9NVH0|CN114_HUMAN Protein C14orf114
Length = 496
Score = 34.7 bits (78), Expect = 3.1
Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 5/98 (5%)
Query: 7 EGSKVDRGRMTALELWSGPEIPKEEWMLRSLVREMKNLELIPS--LREAFMVEGHITIQV 64
E +V G L S P KEE L +RE N +++ L+EA +E I+ +
Sbjct: 373 ERRQVRSGARALLNAESLPTQRKEE--LLQALREFYNTDVVTEEMLQEAASLETRISNE- 429
Query: 65 RYLRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKP 102
Y+ ++ H + L L+Q WR F+D+++P
Sbjct: 430 NYVPHGLKVVQCHSQGGLRSLMQLESRWRQHFLDSMQP 467
>ref|XP_510030.1| PREDICTED: chromosome 14 open reading frame 114 [Pan troglodytes]
Length = 631
Score = 34.7 bits (78), Expect = 3.1
Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 5/98 (5%)
Query: 7 EGSKVDRGRMTALELWSGPEIPKEEWMLRSLVREMKNLELIPS--LREAFMVEGHITIQV 64
E +V G L S P KEE L +RE N +++ L+EA +E I+ +
Sbjct: 508 ERRQVRSGARALLNAESLPAHRKEE--LLQALREFYNTDVVTEEMLQEAASLETRISNE- 564
Query: 65 RYLRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKP 102
Y+ ++ H + L L+Q WR F+D+++P
Sbjct: 565 NYVPHGLKVVQCHSQGGLRSLMQLESRWRQHFLDSMQP 602
>emb|CAD39094.2| hypothetical protein [Homo sapiens]
Length = 496
Score = 34.7 bits (78), Expect = 3.1
Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 5/98 (5%)
Query: 7 EGSKVDRGRMTALELWSGPEIPKEEWMLRSLVREMKNLELIPS--LREAFMVEGHITIQV 64
E +V G L S P KEE L +RE N +++ L+EA +E I+ +
Sbjct: 373 ERRQVRSGARALLNAESLPTQRKEE--LLQALREFYNTDVVTEEMLQEAASLETRISNE- 429
Query: 65 RYLRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKP 102
Y+ ++ H + L L+Q WR F+D+++P
Sbjct: 430 NYVPHGLKVVQCHSQGGLRSLMQLESRWRQHFLDSMQP 467
>gb|AAS53782.1| AFR411Cp [Ashbya gossypii ATCC 10895] gi|45198929|ref|NP_985958.1|
AFR411Cp [Eremothecium gossypii]
Length = 283
Score = 34.3 bits (77), Expect = 4.0
Identities = 30/121 (24%), Positives = 54/121 (43%), Gaps = 13/121 (10%)
Query: 159 LLVRTTTLNFITLARK----ILIKGTTYTISIIE-EGEALLHPNCCCREESS---VEDDV 210
L+ ++ L+ + LAR+ I++ G T+++ EG+ ++P C S+ ++DD
Sbjct: 101 LVPKSDPLSLLILARQLDVDIMLWGGTHSVEAYTLEGKFFINPGSCTGAFSTDWPLQDDA 160
Query: 211 FSGWS-----LNGVTVTPELSESGGEDDDGLWPVETKSQHDLQGSPNHSLGDNKNPRSNT 265
G LNG P+ +E D++GL + S + + LG N R
Sbjct: 161 LQGEGSPSAKLNGAATAPQPTEGEAGDENGLQQELSTSSRTPREAEARVLGSPNNRRELE 220
Query: 266 G 266
G
Sbjct: 221 G 221
>gb|AAF18653.1| hypothetical protein [Arabidopsis thaliana]
gi|15226168|ref|NP_178215.1| hypothetical protein
[Arabidopsis thaliana] gi|25410873|pir||A84420
hypothetical protein At2g01050 [imported] - Arabidopsis
thaliana
Length = 515
Score = 34.3 bits (77), Expect = 4.0
Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 9/104 (8%)
Query: 100 LKPWSSSFAP-----INRVTWVRCTGIPLHMWTMDCFKHLLLQVGDVIKIDEDTSEFVVL 154
++ WSS F P + WVR + IP + + + +G +K+D +T F
Sbjct: 147 VQDWSSRFDPLRDDIVTTPVWVRLSNIPYNYYHRCLLMEIARGLGRPLKVDMNTINFDKG 206
Query: 155 EFSRLLVRTTTLNFITLARKILIKGTTYTISIIEEGEALLHPNC 198
F+R+ + L +LI G Y ++ EG + + +C
Sbjct: 207 RFARVCIEVNLAK--PLKGTVLINGDRYFVAY--EGLSKICSSC 246
>ref|XP_547872.1| PREDICTED: similar to Protein C14orf114 [Canis familiaris]
Length = 624
Score = 33.9 bits (76), Expect = 5.2
Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 5/98 (5%)
Query: 7 EGSKVDRGRMTALELWSGPEIPKEEWMLRSLVREMKNLELIPS--LREAFMVEGHITIQV 64
E +V G L S P KEE L +RE N + + L+EA +E I+ +
Sbjct: 501 ERRQVRSGARALLNAESLPAHRKEE--LLQALREFYNTDTVTDEMLQEAASLETRISNE- 557
Query: 65 RYLRDDRVLLTNHQESNLVELLQGSKGWRSEFVDNLKP 102
Y+ ++ H + L L+Q WR F+D+++P
Sbjct: 558 NYVPHGLKVVQCHSQGGLRSLMQLESRWRQHFLDSMQP 595
>gb|AAD37019.2| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
thaliana]
Length = 855
Score = 33.5 bits (75), Expect = 6.8
Identities = 26/104 (25%), Positives = 44/104 (42%), Gaps = 9/104 (8%)
Query: 100 LKPWSSSFAPI-NRVT----WVRCTGIPLHMWTMDCFKHLLLQVGDVIKIDEDTSEFVVL 154
++ WS F P+ + +T WVR IPL ++ + +G +K+D T
Sbjct: 122 VQAWSPEFDPLRDEITTTPIWVRLMNIPLSLYHTSILMGIAGSLGKPVKVDMTTLHVERA 181
Query: 155 EFSRLLVRTTTLNFITLARKILIKGTTYTISIIEEGEALLHPNC 198
F+R+ + L +L+ G Y +S EG A + C
Sbjct: 182 RFARMCIEVDLAK--PLKGTLLLNGERYFVSY--EGLANICSRC 221
>gb|EAL30092.1| GA14442-PA [Drosophila pseudoobscura]
Length = 1158
Score = 33.5 bits (75), Expect = 6.8
Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 4/71 (5%)
Query: 201 REESSVEDDVFSGWSLNGVTVTPELSESGGEDDDGLWPVETKSQHDLQGSPNHSLGDNKN 260
R ++S S S +GV P E G E D+ PV K+ H L+GS L N N
Sbjct: 527 RVKASPSPRTASPASSDGVRPLPTTDEDGDEVDENKTPVNKKT-HALKGS---LLPPNTN 582
Query: 261 PRSNTGAVNRL 271
P +NT + RL
Sbjct: 583 PNANTLSSTRL 593
>pir||E84487 hypothetical protein At2g07670 [imported] - Arabidopsis thaliana
Length = 913
Score = 33.5 bits (75), Expect = 6.8
Identities = 26/104 (25%), Positives = 44/104 (42%), Gaps = 9/104 (8%)
Query: 100 LKPWSSSFAPI-NRVT----WVRCTGIPLHMWTMDCFKHLLLQVGDVIKIDEDTSEFVVL 154
++ WS F P+ + +T WVR IPL ++ + +G +K+D T
Sbjct: 122 VQAWSPEFDPLRDEITTTPIWVRLMNIPLSLYHTSILMGIAGSLGKPVKVDMTTLHVERA 181
Query: 155 EFSRLLVRTTTLNFITLARKILIKGTTYTISIIEEGEALLHPNC 198
F+R+ + L +L+ G Y +S EG A + C
Sbjct: 182 RFARMCIEVDLAK--PLKGTLLLNGERYFVSY--EGLANICSRC 221
>ref|YP_080733.1| ribonuclease R [Bacillus licheniformis ATCC 14580]
gi|52005153|gb|AAU25095.1| ribonuclease R [Bacillus
licheniformis ATCC 14580] gi|52787326|ref|YP_093155.1|
Rnr [Bacillus licheniformis ATCC 14580]
gi|52349828|gb|AAU42462.1| Rnr [Bacillus licheniformis
DSM 13]
Length = 767
Score = 33.1 bits (74), Expect = 8.9
Identities = 36/107 (33%), Positives = 52/107 (47%), Gaps = 15/107 (14%)
Query: 135 LLQVGDVIKIDEDTSEFVVLEFSRLLVRTTTLNFITLARKILIKGTTYTISIIEEGEALL 194
+L++ D + E V LE L+VRT + + + LIKG +S +G A +
Sbjct: 30 MLEITDSDEYKELVKALVTLEEKGLVVRTRSNRYGLPEKMNLIKG---KVSAHAKGFAFV 86
Query: 195 HPNCCCREESSVEDDVF-----SGWSLNGVTVTPELS-ESGGEDDDG 235
P E+SS+ DDVF ++NG TV LS ESGG +G
Sbjct: 87 LP-----EDSSL-DDVFIPPSELNTAMNGDTVLVRLSTESGGTKKEG 127
>gb|AAK78180.1| Uncharacterized membrane protein, ortholog YYAS B.subtilis
[Clostridium acetobutylicum ATCC 824]
gi|15893491|ref|NP_346840.1| Uncharacterized membrane
protein, ortholog YYAS B.subtilis [Clostridium
acetobutylicum ATCC 824] gi|25495808|pir||A96924
uncharacterized membrane protein, ortholog YYAS B.
subtilis [imported] - Clostridium acetobutylicum
Length = 227
Score = 33.1 bits (74), Expect = 8.9
Identities = 21/80 (26%), Positives = 40/80 (49%), Gaps = 1/80 (1%)
Query: 108 APINRVTWVRCTGIPLHMWTMDCFKHLLLQVGDVIKIDEDTSEFVVLEFSRLLVRTTTLN 167
+PI+ V + G PL M T ++ L VG VI + +D + +++ LV ++
Sbjct: 35 SPISSVPYTLSMGFPLTMGTFTFILNMFLIVGQVIFLGKDFKKVQLMQIPISLVFGYFID 94
Query: 168 FITLARKILIKGTTYTISII 187
F T++ ++ TTY ++
Sbjct: 95 F-TMSLLSVLNPTTYVFKLV 113
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.317 0.135 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 478,557,140
Number of Sequences: 2540612
Number of extensions: 19491805
Number of successful extensions: 51257
Number of sequences better than 10.0: 17
Number of HSP's better than 10.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 51255
Number of HSP's gapped (non-prelim): 17
length of query: 278
length of database: 863,360,394
effective HSP length: 126
effective length of query: 152
effective length of database: 543,243,282
effective search space: 82572978864
effective search space used: 82572978864
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)
Lotus: description of TM0102b.4