Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004643A_C01 KMC004643A_c01
(1006 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_186869.2| unknown protein; protein id: At3g02200.1, suppo... 326 3e-88
emb|CAD39334.1| OSJNBa0094O15.2 [Oryza sativa (japonica cultivar... 314 1e-84
ref|NP_197065.1| putative protein; protein id: At5g15610.1, supp... 312 6e-84
gb|AAF02117.1|AC009755_10 unknown protein [Arabidopsis thaliana]... 261 1e-68
gb|EAA00285.1| agCP10016 [Anopheles gambiae str. PEST] 119 7e-26
>ref|NP_186869.2| unknown protein; protein id: At3g02200.1, supported by cDNA:
gi_17065291, supported by cDNA: gi_20259989 [Arabidopsis
thaliana] gi|17065292|gb|AAL32800.1| Unknown protein
[Arabidopsis thaliana] gi|20259990|gb|AAM13342.1| unknown
protein [Arabidopsis thaliana]
Length = 417
Score = 326 bits (836), Expect = 3e-88
Identities = 161/249 (64%), Positives = 206/249 (82%), Gaps = 2/249 (0%)
Frame = -3
Query: 1004 VTEYSYL-LSKDDSF*KMEN-GIPEQRELFLTVSNILKENKSMSKDSFKFLTNYLVTFNG 831
VTEY K D+F K N I +QRELFL ++N+LKENKS+ +S KFLT YL TF+
Sbjct: 151 VTEYIVSSFKKIDNFLKEWNIDIKDQRELFLAIANVLKENKSLVNESLKFLTKYLATFSN 210
Query: 830 EDANVLSEAKEEAVRAIVDFVRAPDVFQCDLLDFPAVGQLEKDAKYALLYQLLKIFLSQR 651
EDA VL EAKEEAVRA+++FV+A +FQCDLLD PAV QLEKDAKYA +YQLLKIFL+QR
Sbjct: 211 EDAQVLDEAKEEAVRAVIEFVKASSIFQCDLLDLPAVAQLEKDAKYAPVYQLLKIFLTQR 270
Query: 650 LDAYIEYHTANSALLKSYGLVHEDCISKMRLMSLVDLSSDASGQIPYEVIRDTLRINDDE 471
L+AY E+ ANS L+SYGL +EDC++KMRL+SLVDL+SD SG+IPY I+DTL++N+ +
Sbjct: 271 LNAYTEFQNANSGFLQSYGLSNEDCVTKMRLLSLVDLASDESGKIPYTSIKDTLQVNEQD 330
Query: 470 VELWVVKAITAKLIDCKMDQINQVVVVSHHTDRVFGQHQWQTLRTKLASWRGNISNVIST 291
VELW+VKAITAKLI+CKMDQ+NQV++VS ++R FG QWQ+LRTKLA+W+ NIS++I+T
Sbjct: 331 VELWIVKAITAKLIECKMDQMNQVLIVSRSSEREFGTKQWQSLRTKLATWKDNISSIITT 390
Query: 290 IQANKVTED 264
I++NKVTE+
Sbjct: 391 IESNKVTEE 399
>emb|CAD39334.1| OSJNBa0094O15.2 [Oryza sativa (japonica cultivar-group)]
Length = 416
Score = 314 bits (805), Expect = 1e-84
Identities = 148/234 (63%), Positives = 193/234 (82%), Gaps = 2/234 (0%)
Frame = -3
Query: 947 GIPEQRELFLTVSNILKENKSMSKDSFKFLTNYLVTFNG--EDANVLSEAKEEAVRAIVD 774
G EQR+LFL + ILK+ K M+K+ F FL YL TF+G +DA+ + +AKEEAV AI++
Sbjct: 176 GKVEQRDLFLAAARILKDQKGMNKEYFNFLNKYLATFDGSADDADAIGDAKEEAVAAIIE 235
Query: 773 FVRAPDVFQCDLLDFPAVGQLEKDAKYALLYQLLKIFLSQRLDAYIEYHTANSALLKSYG 594
FV++ D++QCDLL+ PAV QLEKD KY L+Y+LLKIFL+QRLD+Y+E+ +ANSALLK YG
Sbjct: 236 FVKSSDLYQCDLLNMPAVAQLEKDEKYQLVYELLKIFLTQRLDSYLEFQSANSALLKGYG 295
Query: 593 LVHEDCISKMRLMSLVDLSSDASGQIPYEVIRDTLRINDDEVELWVVKAITAKLIDCKMD 414
LVHEDCI+KMRLMSL+DLSS +G+IPY I D L+INDDEVE W+VKAI+ K++DCK+D
Sbjct: 296 LVHEDCITKMRLMSLLDLSSRCAGEIPYHAIIDALKINDDEVEYWIVKAISCKILDCKVD 355
Query: 413 QINQVVVVSHHTDRVFGQHQWQTLRTKLASWRGNISNVISTIQANKVTEDWVPG 252
Q+NQV++VS HT+R+FG QWQ+LR+KL WRGNI++ I+TIQANKVT+D G
Sbjct: 356 QLNQVIIVSRHTERIFGMPQWQSLRSKLGVWRGNIASAINTIQANKVTDDGSQG 409
>ref|NP_197065.1| putative protein; protein id: At5g15610.1, supported by cDNA:
gi_20466755 [Arabidopsis thaliana]
gi|11358128|pir||T51539 hypothetical protein T20K14_220 -
Arabidopsis thaliana gi|9755816|emb|CAC01760.1| putative
protein [Arabidopsis thaliana] gi|20466756|gb|AAM20695.1|
putative protein [Arabidopsis thaliana]
gi|23198264|gb|AAN15659.1| putative protein [Arabidopsis
thaliana]
Length = 442
Score = 312 bits (799), Expect = 6e-84
Identities = 163/278 (58%), Positives = 204/278 (72%), Gaps = 31/278 (11%)
Frame = -3
Query: 1004 VTEYSY-LLSKDDSF*KMEN-GIPEQRELFLTVSNILKENKSMSKDSFKFLTNYLVTFNG 831
VTEY K DSF K N I +QRELFL ++ +L+ENKS +K+S +F+TNYL TF+
Sbjct: 151 VTEYIVPSFKKIDSFLKEWNIDIKDQRELFLAIAKVLRENKSFAKESLQFVTNYLATFSN 210
Query: 830 EDANVLSEAKEEAVRAIVDFVRAPDVFQCDLLDFPAVGQLEKDAKYALLYQLLKIFLSQR 651
ED VLSEAKEEAVRA+++FV+AP +FQCDLLD PAV QLEKD A +YQLLKIFL+QR
Sbjct: 211 EDTQVLSEAKEEAVRAVIEFVKAPSIFQCDLLDHPAVAQLEKDPNNAPVYQLLKIFLTQR 270
Query: 650 LDAYIEYHTANSALLKSYGLVHEDCISKMRLMSLVDLSSDASGQIPYEVIRDTLRINDDE 471
LDAY+E+ ANS L++YGLV EDC++KMRL+SLVDL+SD SG+IPY I++TL++ND+E
Sbjct: 271 LDAYMEFQNANSGFLQTYGLVEEDCVAKMRLLSLVDLASDDSGKIPYASIKNTLQVNDEE 330
Query: 470 VELWVVKAITAKLIDCKMDQINQVVVV-----------------------------SHHT 378
VELWVVKAITAKL+ CKMDQ+NQVV+V S
Sbjct: 331 VELWVVKAITAKLVACKMDQMNQVVIVRQVSNLLLFRFICVNPKSHILVLHICSCISRCA 390
Query: 377 DRVFGQHQWQTLRTKLASWRGNISNVISTIQANKVTED 264
+R FGQ QWQ+LRTKLA+WR N+ NVISTI++NK TE+
Sbjct: 391 EREFGQKQWQSLRTKLAAWRDNVRNVISTIESNKATEE 428
>gb|AAF02117.1|AC009755_10 unknown protein [Arabidopsis thaliana]
gi|6513915|gb|AAF14819.1|AC011664_1 unknown protein
[Arabidopsis thaliana]
Length = 400
Score = 261 bits (667), Expect = 1e-68
Identities = 140/249 (56%), Positives = 182/249 (72%), Gaps = 2/249 (0%)
Frame = -3
Query: 1004 VTEYSYL-LSKDDSF*KMEN-GIPEQRELFLTVSNILKENKSMSKDSFKFLTNYLVTFNG 831
VTEY K D+F K N I +QRELFL ++N+LKENKS+ +S KFLT YL TF+
Sbjct: 151 VTEYIVSSFKKIDNFLKEWNIDIKDQRELFLAIANVLKENKSLVNESLKFLTKYLATFSN 210
Query: 830 EDANVLSEAKEEAVRAIVDFVRAPDVFQCDLLDFPAVGQLEKDAKYALLYQLLKIFLSQR 651
EDA VL EAKEEAVRA+++FV+A +FQCDLLD PAV QLEKDAKYA +YQLLKIFL+QR
Sbjct: 211 EDAQVLDEAKEEAVRAVIEFVKASSIFQCDLLDLPAVAQLEKDAKYAPVYQLLKIFLTQR 270
Query: 650 LDAYIEYHTANSALLKSYGLVHEDCISKMRLMSLVDLSSDASGQIPYEVIRDTLRINDDE 471
L+AY E+ ANS L+SYGL +EDC++KMRL+SLVDL+SD SG+IPY I+DTL++N+ +
Sbjct: 271 LNAYTEFQNANSGFLQSYGLSNEDCVTKMRLLSLVDLASDESGKIPYTSIKDTLQVNEQD 330
Query: 470 VELWVVKAITAKLIDCKMDQINQVVVVSHHTDRVFGQHQWQTLRTKLASWRGNISNVIST 291
VELW+VKAITAKLI+ + +N + G Q+LR + ++I+T
Sbjct: 331 VELWIVKAITAKLIESALPNVN--------LGQSNGNLSEQSLR---------LGSIITT 373
Query: 290 IQANKVTED 264
I++NKVTE+
Sbjct: 374 IESNKVTEE 382
>gb|EAA00285.1| agCP10016 [Anopheles gambiae str. PEST]
Length = 394
Score = 119 bits (298), Expect = 7e-26
Identities = 73/223 (32%), Positives = 135/223 (59%), Gaps = 4/223 (1%)
Frame = -3
Query: 938 EQRELFLTVSNILKENKSMSKDSFKFLTNYLVTFNGEDANVLSEAKEEAVRAIVDFVRAP 759
+ ++L+ + ++LK+ S S+ + K + L T+ E+A S A+E+A++ IV + P
Sbjct: 173 QMQKLYRLLHDVLKD--SNSELASKVMIELLGTYTAENA---SYAREDAMKCIVTALADP 227
Query: 758 DVFQCD-LLDFPAVGQLEKDAKYALLYQLLKIFLSQRLDAYIEYHTANSALLKSYGLVHE 582
+ F D LL V LE + L++ LL +F+S++L +Y+E++ + + S GL HE
Sbjct: 228 NTFLLDPLLSLKPVRFLEGE----LIHDLLSVFVSEKLPSYLEFYKNHKEFVNSQGLNHE 283
Query: 581 DCISKMRLMSLVDLSSDASGQIPYEVIRDTLRINDDEVELWVVKAITAKLIDCKMDQINQ 402
I KMRL+S + L+ +++ ++ ++ ++D L+I ++EVE ++++ + KL+ +MDQ +
Sbjct: 284 QNIKKMRLLSFMQLA-ESNSEMTFQQLQDELQIKEEEVEPFIIEVLKTKLVRARMDQRAR 342
Query: 401 VVVVSHHTDRVFGQHQWQTLRTKLASWRGNISNV---ISTIQA 282
V +S R FG+ QWQ LR L SW+ N++ V I+T+ A
Sbjct: 343 KVHISSTMHRTFGRPQWQQLRDLLLSWKSNLTLVQENINTVSA 385
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 819,429,837
Number of Sequences: 1393205
Number of extensions: 17869952
Number of successful extensions: 48432
Number of sequences better than 10.0: 62
Number of HSP's better than 10.0 without gapping: 45687
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 48269
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 57945683670
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)