Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000937A_C01 KMC000937A_c01
(882 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_196024.1| expressed protein; protein id: At5g04040.1 [Ara... 491 e-141
ref|NP_191273.1| expressed protein; protein id: At3g57140.1 [Ara... 493 e-141
dbj|BAB61223.1| contains EST AU057376(S21389)~similar to Arabido... 460 e-132
gb|EAA29514.1| hypothetical protein [Neurospora crassa] 194 1e-48
emb|CAD60564.1| unnamed protein product [Podospora anserina] 191 1e-47
>ref|NP_196024.1| expressed protein; protein id: At5g04040.1 [Arabidopsis thaliana]
gi|11282314|pir||T48431 hypothetical protein F8F6.250 -
Arabidopsis thaliana gi|7406414|emb|CAB85524.1| putative
protein [Arabidopsis thaliana]
gi|22531263|gb|AAM97135.1| putative protein [Arabidopsis
thaliana]
Length = 825
Score = 491 bits (1264), Expect(2) = e-141
Identities = 240/274 (87%), Positives = 262/274 (95%), Gaps = 2/274 (0%)
Frame = -3
Query: 880 ELHKGRLQVPRLIKEYIDEVSTQLRMVCDSDSQELLLEEKLAFMHETRHAFGRTALLLSG 701
ELHKGRLQVPR IKEYIDEVSTQLRMVC+SDS+EL LEEKL+FMHETRHAFGRTALLLSG
Sbjct: 177 ELHKGRLQVPRHIKEYIDEVSTQLRMVCNSDSEELSLEEKLSFMHETRHAFGRTALLLSG 236
Query: 700 GASLGASHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFEDSWHSMQFF 521
GASLGA HVGVV+TLVEHKLLPR+IAGSSVGSI+CAVVA+RSWPELQSFFE+S HS+QFF
Sbjct: 237 GASLGAFHVGVVRTLVEHKLLPRIIAGSSVGSIICAVVASRSWPELQSFFENSLHSLQFF 296
Query: 520 DQMGGIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHE 341
DQ+GG+F++VKRV T+GA+H+IRQLQ MLR+LT+NLTFQEAYDMTGR+LGITVCSPRKHE
Sbjct: 297 DQLGGVFSIVKRVMTQGALHDIRQLQCMLRNLTSNLTFQEAYDMTGRILGITVCSPRKHE 356
Query: 340 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKNRSGEIVPYHPPFNLGPEEG--S 167
PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAK+RSGEIVPYHPPFNL PE G S
Sbjct: 357 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKDRSGEIVPYHPPFNLDPEVGTKS 416
Query: 166 TPARRWRDGSLEIDLPMMQLKELFNVNHFIVSQA 65
+ RRWRDGSLE+DLPMMQLKELFNVNHFIVSQA
Sbjct: 417 SSGRRWRDGSLEVDLPMMQLKELFNVNHFIVSQA 450
Score = 33.9 bits (76), Expect(2) = e-141
Identities = 14/24 (58%), Positives = 17/24 (70%)
Frame = -2
Query: 83 FYSQPGHPHIAPLLRLKEFIRNYG 12
F +PHIAPLLRLK+ +R YG
Sbjct: 445 FIVSQANPHIAPLLRLKDLVRAYG 468
>ref|NP_191273.1| expressed protein; protein id: At3g57140.1 [Arabidopsis thaliana]
gi|11282313|pir||T47774 hypothetical protein F24I3.220 -
Arabidopsis thaliana gi|6911884|emb|CAB72184.1| putative
protein [Arabidopsis thaliana]
gi|26450904|dbj|BAC42559.1| unknown protein [Arabidopsis
thaliana] gi|29029050|gb|AAO64904.1| At3g57140
[Arabidopsis thaliana]
Length = 801
Score = 493 bits (1270), Expect(2) = e-141
Identities = 239/272 (87%), Positives = 256/272 (93%)
Frame = -3
Query: 880 ELHKGRLQVPRLIKEYIDEVSTQLRMVCDSDSQELLLEEKLAFMHETRHAFGRTALLLSG 701
ELHKGRL VPRLIKEYIDEVSTQLRMVCD D++EL LEEKL+FMHETRHA+GRTALLLSG
Sbjct: 178 ELHKGRLHVPRLIKEYIDEVSTQLRMVCDMDTEELSLEEKLSFMHETRHAYGRTALLLSG 237
Query: 700 GASLGASHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFEDSWHSMQFF 521
GASLGA H+GVVKTLVEHKLLPR+IAGSSVGS+MCAVV TRSWPELQSFFE SWH++QFF
Sbjct: 238 GASLGAFHLGVVKTLVEHKLLPRIIAGSSVGSVMCAVVGTRSWPELQSFFEGSWHALQFF 297
Query: 520 DQMGGIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHE 341
DQMGGIFT VKRV T+GAVHEIR LQ LR+LTNNLTFQEAYD+TGR+LGITVCS RKHE
Sbjct: 298 DQMGGIFTTVKRVMTQGAVHEIRHLQWKLRNLTNNLTFQEAYDITGRILGITVCSLRKHE 357
Query: 340 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKNRSGEIVPYHPPFNLGPEEGSTP 161
PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAK+R+GEIVPYHPPFNL PEEGS
Sbjct: 358 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKDRTGEIVPYHPPFNLDPEEGSAS 417
Query: 160 ARRWRDGSLEIDLPMMQLKELFNVNHFIVSQA 65
RRWRDGSLE+DLPM+QLKELFNVNHFIVSQA
Sbjct: 418 VRRWRDGSLEMDLPMIQLKELFNVNHFIVSQA 449
Score = 31.6 bits (70), Expect(2) = e-141
Identities = 13/24 (54%), Positives = 16/24 (66%)
Frame = -2
Query: 83 FYSQPGHPHIAPLLRLKEFIRNYG 12
F +PHIAP LR+KEF+R G
Sbjct: 444 FIVSQANPHIAPFLRMKEFVRACG 467
>dbj|BAB61223.1| contains EST AU057376(S21389)~similar to Arabidopsis thaliana
chromosome 5, F8F6.250~unknown protein [Oryza sativa
(japonica cultivar-group)] gi|20804680|dbj|BAB92368.1|
P0512C01.22 [Oryza sativa (japonica cultivar-group)]
Length = 1044
Score = 460 bits (1183), Expect(2) = e-132
Identities = 228/274 (83%), Positives = 252/274 (91%), Gaps = 2/274 (0%)
Frame = -3
Query: 880 ELHKGRLQVPRLIKEYIDEVSTQLRMVCDSDSQELLLEEKLAFMHETRHAFGRTALLLSG 701
ELHKGRLQVP+LIKEYI+EVSTQL+MVC+SDS +L LEEKLAFMHETRHAFGRTALLLSG
Sbjct: 179 ELHKGRLQVPKLIKEYIEEVSTQLKMVCNSDSDDLPLEEKLAFMHETRHAFGRTALLLSG 238
Query: 700 GASLGASHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFEDSWHSMQFF 521
GASLG HVGVVKTLVEHKLLPR+I+GSSVGSIMC++VATRSWPEL+SFFE+ WHS++FF
Sbjct: 239 GASLGCFHVGVVKTLVEHKLLPRIISGSSVGSIMCSIVATRSWPELESFFEE-WHSLKFF 297
Query: 520 DQMGGIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHE 341
DQMGGIF VVKR+ T GAVH+IR LQ +LR+LT+NLTFQEAYDMTGR+L +TVCSPRKHE
Sbjct: 298 DQMGGIFPVVKRILTHGAVHDIRHLQTLLRNLTSNLTFQEAYDMTGRILVVTVCSPRKHE 357
Query: 340 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKNRSGEIVPYHPPFNLGPEE--GS 167
PPRCLNYLTSPHV+IWSAVTASCAFPGLFEAQELMAK+R GE VP+H PF LG EE G+
Sbjct: 358 PPRCLNYLTSPHVLIWSAVTASCAFPGLFEAQELMAKDRFGETVPFHAPFLLGLEERVGA 417
Query: 166 TPARRWRDGSLEIDLPMMQLKELFNVNHFIVSQA 65
T RRWRDGSLE DLPM QLKELFNVNHFIVSQA
Sbjct: 418 T-TRRWRDGSLESDLPMKQLKELFNVNHFIVSQA 450
Score = 35.4 bits (80), Expect(2) = e-132
Identities = 16/24 (66%), Positives = 17/24 (70%)
Frame = -2
Query: 83 FYSQPGHPHIAPLLRLKEFIRNYG 12
F +PHIAPLLRLKE IR YG
Sbjct: 445 FIVSQANPHIAPLLRLKEIIRAYG 468
>gb|EAA29514.1| hypothetical protein [Neurospora crassa]
Length = 802
Score = 194 bits (493), Expect = 1e-48
Identities = 113/268 (42%), Positives = 168/268 (62%), Gaps = 7/268 (2%)
Frame = -3
Query: 850 RLIKEYIDEVSTQLRMVCDSDSQELLLE----EKLAFMHETRHAFGRTALLLSGGASLGA 683
+LI++Y+D + + D +Q L + + L M R +FGR+ALLLSGGA+ G
Sbjct: 182 KLIEDYVDSAVKTIGALMDQSTQTLPADMETKDLLEGMLFARQSFGRSALLLSGGATFGM 241
Query: 682 SHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFED-SWHSMQFFDQMG- 509
SH+GV+K+L E LLPR+I+G+S GSI+C+V+ TR E+ + + F
Sbjct: 242 SHIGVIKSLFEANLLPRIISGASAGSIVCSVLCTRKDEEVPDLIRTFPYGDLDVFKGPND 301
Query: 508 GIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHEPPRC 329
GI ++R+ T+G+ +I L ++R + +LTFQEAY+ T R+ I V + +E PR
Sbjct: 302 GISDSLRRLLTQGSWADITNLTRVMRSMLGDLTFQEAYNRTRRICNICVSTASIYELPRL 361
Query: 328 LNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKN-RSGEIVPYHPPFNLGPEEGSTPARR 152
LNY+T+P+V+IWSAV ASC+ P +F+A L+ K+ +G VP++P TP +R
Sbjct: 362 LNYITAPNVMIWSAVAASCSVPLVFQAAPLLVKDPATGAHVPWNP----------TP-QR 410
Query: 151 WRDGSLEIDLPMMQLKELFNVNHFIVSQ 68
W DGS++ DLPM +L E+FNVNHFIVSQ
Sbjct: 411 WIDGSVDNDLPMTRLAEMFNVNHFIVSQ 438
>emb|CAD60564.1| unnamed protein product [Podospora anserina]
Length = 824
Score = 191 bits (486), Expect = 1e-47
Identities = 111/267 (41%), Positives = 165/267 (61%), Gaps = 7/267 (2%)
Frame = -3
Query: 847 LIKEYIDEVSTQLRMVCDSDSQELLL----EEKLAFMHETRHAFGRTALLLSGGASLGAS 680
LI+ Y+D + + + + + ++ L M R +FGR+ALLLSGGA+ G S
Sbjct: 180 LIERYVDSAVKTIEALVEKSAYSIPAGMETQDLLEGMLYARQSFGRSALLLSGGATFGMS 239
Query: 679 HVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFED-SWHSMQFFD-QMGG 506
H+GV+K L E KLLPR+I+G+S GSI+CAV+ TR E+ + E + + F+ + G
Sbjct: 240 HIGVLKALYESKLLPRIISGASAGSIVCAVLCTRKDEEIPALVEAFPYGDLGVFEGEKDG 299
Query: 505 IFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHEPPRCL 326
+ ++R+ T G +I L ++R ++TFQEAY+ T R+ I V S +E PR L
Sbjct: 300 LSDHIRRLLTEGCWADISNLTRVMRSWLGDVTFQEAYNRTRRICNICVSSASIYELPRLL 359
Query: 325 NYLTSPHVVIWSAVTASCAFPGLFEAQELMAKN-RSGEIVPYHPPFNLGPEEGSTPARRW 149
NY+T+P+V+IWSAV ASC+ P +F+A L+ K+ +G VP++P TP + W
Sbjct: 360 NYITAPNVMIWSAVAASCSVPLVFQAASLLVKDPATGAHVPWNP----------TP-QHW 408
Query: 148 RDGSLEIDLPMMQLKELFNVNHFIVSQ 68
DGS++ DLPM +L E+FNVNHFIVSQ
Sbjct: 409 IDGSVDNDLPMTRLAEMFNVNHFIVSQ 435
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 778,501,928
Number of Sequences: 1393205
Number of extensions: 17022800
Number of successful extensions: 42290
Number of sequences better than 10.0: 159
Number of HSP's better than 10.0 without gapping: 40504
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 42229
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 47660818527
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)