
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0103.7
(411 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NONA_DROME (Q04047) No-on-transient A protein 37 0.070
GR2B_ARATH (Q38896) Glycine-rich protein 2b (AtGRP2b) 36 0.16
FBRL_CAEEL (Q22053) Fibrillarin 35 0.46
ROU_HUMAN (Q00839) Heterogenous nuclear ribonucleoprotein U (hnR... 34 0.60
GRP2_NICSY (P27484) Glycine-rich protein 2 34 0.60
CC39_CAEEL (Q09455) Cuticle collagen 39 precursor 33 1.0
CA35_HUMAN (P25940) Collagen alpha 3(V) chain precursor 33 1.0
GLH2_CAEEL (Q966L9) ATP-dependent RNA helicase glh-2 (EC 3.6.1.-... 33 1.3
TXA1_ACTEQ (Q9NJQ2) Neurotoxin Ae I precursor (Toxin Ae I) (AeNa) 33 1.7
CC19_CAEEL (P18835) Cuticle collagen 19 precursor 32 2.3
NSP1_YEAST (P14907) Nucleoporin NSP1 (Nuclear pore protein NSP1)... 32 3.9
CA13_MOUSE (P08121) Collagen alpha 1(III) chain precursor 32 3.9
SSB_COREF (Q8FLP9) Single-strand binding protein (SSB) (Helix-de... 31 5.0
CSP_PLAFW (P08307) Circumsporozoite protein precursor (CS) 31 5.0
CA25_HUMAN (P05997) Collagen alpha 2(V) chain precursor 31 5.0
C43B_MOUSE (Q9EQG9) Goodpasture antigen-binding protein (EC 2.7.... 31 5.0
GAG_JSRV (P31622) Gag polyprotein [Contains: Core protein p10; C... 31 6.6
CA21_RANCA (O42350) Collagen alpha 2(I) chain precursor 31 6.6
CA13_CHICK (P12105) Collagen alpha 1(III) chain precursor (Fragm... 31 6.6
U269_DEIRA (Q9RXP1) Protein DR0269 30 8.6
>NONA_DROME (Q04047) No-on-transient A protein
Length = 700
Score = 37.4 bits (85), Expect = 0.070
Identities = 39/145 (26%), Positives = 55/145 (37%), Gaps = 26/145 (17%)
Query: 222 KGLGVNKAPKKKE-------EKRKPYF*PQDQGRNQF-GRTWNPTGGFGFQGRPG----- 268
K LG + A K+ + EK++ + P Q +NQ + TGG G G P
Sbjct: 28 KNLGKHNAQKQNDSADGGPAEKKQRFGGPNAQNQNQNQNQNGGVTGGGGAVGGPNQNKNF 87
Query: 269 GFNNTPFCGRCRRNGHRAQECIVVLGGQSGV------QTGGSGNPN-------PTTTVGG 315
G N F G RN +RA G + +T + PN P T G
Sbjct: 88 GNNKGGFVGNRNRNNNRAGNQNRTFPGNNNSNQKPNNETSKADGPNALAKNNEPATAAAG 147
Query: 316 GSNRNANVPPRDNRRVGHPANKGKV 340
+ N N N+R G N+ +V
Sbjct: 148 QNQANQNANKGQNQRQGQNQNQNQV 172
>GR2B_ARATH (Q38896) Glycine-rich protein 2b (AtGRP2b)
Length = 201
Score = 36.2 bits (82), Expect = 0.16
Identities = 24/58 (41%), Positives = 28/58 (47%), Gaps = 4/58 (6%)
Query: 259 GGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGG 316
GG G GR GG + C +C GH A+EC GG SG GG G + GGG
Sbjct: 122 GGRGSGGRGGGGGDNS-CFKCGEPGHMARECSQGGGGYSG---GGGGGRYGSGGGGGG 175
>FBRL_CAEEL (Q22053) Fibrillarin
Length = 352
Score = 34.7 bits (78), Expect = 0.46
Identities = 21/52 (40%), Positives = 26/52 (49%), Gaps = 4/52 (7%)
Query: 248 RNQFGRTWNPTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVV----LGG 295
R FG +P GGFG +G P G +P GR G R + +VV LGG
Sbjct: 75 RGGFGGRGSPRGGFGGRGSPRGGRGSPRGGRGGAGGMRGGKTVVVEPHRLGG 126
>ROU_HUMAN (Q00839) Heterogenous nuclear ribonucleoprotein U (hnRNP
U) (Scaffold attachment factor A) (SAF-A) (pp120)
Length = 824
Score = 34.3 bits (77), Expect = 0.60
Identities = 35/128 (27%), Positives = 48/128 (37%), Gaps = 7/128 (5%)
Query: 211 EENEKT*KEYLKGLGVNKAPKKKEEKRKPYF*PQDQGRNQFGRTWNPTGGFGFQGRPGGF 270
EE +K ++Y + P+KK+ G+NQF R G G R G F
Sbjct: 659 EEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFNRGGGHRGRGGLNMRGGNF 718
Query: 271 NNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGGSNRNANVPPRDNRR 330
R G+ + + GG G +GG G P P V G +N R N
Sbjct: 719 RGG---APGNRGGYNRRGNMPQRGGGGG-GSGGIGYPYPRAPVFPGRGSYSN---RGNYN 771
Query: 331 VGHPANKG 338
G N+G
Sbjct: 772 RGGMPNRG 779
>GRP2_NICSY (P27484) Glycine-rich protein 2
Length = 214
Score = 34.3 bits (77), Expect = 0.60
Identities = 18/47 (38%), Positives = 24/47 (50%), Gaps = 4/47 (8%)
Query: 259 GGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSG 305
GG+G G GG C +C +GH A++C GG G + GG G
Sbjct: 146 GGYGGGGSGGGSG----CFKCGESGHFARDCSQSGGGGGGGRFGGGG 188
>CC39_CAEEL (Q09455) Cuticle collagen 39 precursor
Length = 323
Score = 33.5 bits (75), Expect = 1.0
Identities = 31/93 (33%), Positives = 34/93 (36%), Gaps = 13/93 (13%)
Query: 259 GGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVG---- 314
G G G+PGG N G H QECI G G G G P P G
Sbjct: 102 GDRGLDGQPGGAGNPGQPGVAGPKSHEQQECIKCPAGSPG-PAGAPGAPGPQGPNGQPGH 160
Query: 315 ---GGSNRNANV--PPRD---NRRVGHPANKGK 339
GGS A P D +VG P N G+
Sbjct: 161 PGQGGSQGPAGPRGPAGDAGAPGQVGAPGNPGQ 193
>CA35_HUMAN (P25940) Collagen alpha 3(V) chain precursor
Length = 1745
Score = 33.5 bits (75), Expect = 1.0
Identities = 29/109 (26%), Positives = 39/109 (35%), Gaps = 5/109 (4%)
Query: 257 PTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGG 316
P G G GRPG G G R + + L G+ G Q G G+ G
Sbjct: 522 PPGRVGKMGRPGADGARGLPGDTGPKGDRGFDGLPGLPGEKG-QRGDFGHVGQPGPPGED 580
Query: 317 SNRNANVPPRDNRRVGHPANKGKVFAMTMEEASESPELIKGMCYVNGFP 365
R A PP + G P +G + S P G+ ++G P
Sbjct: 581 GERGAEGPPGPTGQAGEPGPRG----LLGPRGSPGPTGRPGVTGIDGAP 625
>GLH2_CAEEL (Q966L9) ATP-dependent RNA helicase glh-2 (EC 3.6.1.-)
(Germline helicase-2)
Length = 974
Score = 33.1 bits (74), Expect = 1.3
Identities = 34/107 (31%), Positives = 42/107 (38%), Gaps = 19/107 (17%)
Query: 235 EKRKPYF*PQDQGRNQF--GRTWNPTGGFGFQGRPGGFNNTPF-----CGRCRRNGHRAQ 287
E+RKP +GRN F G GGFG G GF N C C+ GHR+
Sbjct: 413 EERKPR-----EGRNGFTSGFGGGNDGGFG-GGNAEGFGNNEERGPMKCFNCKGEGHRSA 466
Query: 288 ECI-----VVLGGQSGVQTGGSGNP-NPTTTVGGGSNRNANVPPRDN 328
EC G+ G ++ NP P G + VP DN
Sbjct: 467 ECPEPPRGCFNCGEQGHRSNECPNPAKPREGAEGEGPKATYVPVEDN 513
>TXA1_ACTEQ (Q9NJQ2) Neurotoxin Ae I precursor (Toxin Ae I) (AeNa)
Length = 82
Score = 32.7 bits (73), Expect = 1.7
Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 3/38 (7%)
Query: 243 PQDQGRNQFGRTWNPTGGFGFQGRPGGFNNTPFCGRCR 280
P +G G W TGG+G G P G++ FCG+ R
Sbjct: 39 PSTRGNKLSGTIWMKTGGYGGNGCPKGWH---FCGKSR 73
>CC19_CAEEL (P18835) Cuticle collagen 19 precursor
Length = 289
Score = 32.3 bits (72), Expect = 2.3
Identities = 33/106 (31%), Positives = 39/106 (36%), Gaps = 14/106 (13%)
Query: 257 PTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSG--------VQTGGSGNPN 308
P GG G QG G GR G R + GQ G +TG +GNP
Sbjct: 175 PVGGPGEQGPQGDAGRPGAAGRPGPAGPRGEPGTEYKPGQPGRPGPQGPRGETGPAGNPG 234
Query: 309 -PTTTVGGGSNRNANVPPRDNRRVGHPANKGKVFAMTMEEASESPE 353
P G N NA P GHP G V E+A+ P+
Sbjct: 235 APGNDGEAGKNGNAGRPGPP----GHPGKNG-VPGQKGEDAAPGPD 275
>NSP1_YEAST (P14907) Nucleoporin NSP1 (Nuclear pore protein NSP1)
(Nucleoskeletal-like protein) (p110)
Length = 823
Score = 31.6 bits (70), Expect = 3.9
Identities = 20/64 (31%), Positives = 25/64 (38%), Gaps = 5/64 (7%)
Query: 259 GGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGGSN 318
G FG GFNN+ N + A I G + GN NPT+ V G +N
Sbjct: 34 GAFGTGQSTFGFNNS-----APNNTNNANSSITPAFGSNNTGNTAFGNSNPTSNVFGSNN 88
Query: 319 RNAN 322
N
Sbjct: 89 STTN 92
>CA13_MOUSE (P08121) Collagen alpha 1(III) chain precursor
Length = 1464
Score = 31.6 bits (70), Expect = 3.9
Identities = 30/100 (30%), Positives = 37/100 (37%), Gaps = 17/100 (17%)
Query: 257 PTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGV--QTGGSGNPNPTTTVG 314
P G G G PG N G +NG R L G +G +TG G P PT G
Sbjct: 570 PRGQPGVMGFPGPKGND---GAPGKNGERGGPGGPGLPGPAGKNGETGPQGPPGPTGPAG 626
Query: 315 GGSNRN------------ANVPPRDNRRVGHPANKGKVFA 342
+ PP +N + G P KG+V A
Sbjct: 627 DKGDSGPPGPQGLQGIPGTGGPPGENGKPGEPGPKGEVGA 666
>SSB_COREF (Q8FLP9) Single-strand binding protein (SSB)
(Helix-destabilizing protein)
Length = 232
Score = 31.2 bits (69), Expect = 5.0
Identities = 31/101 (30%), Positives = 32/101 (30%), Gaps = 28/101 (27%)
Query: 246 QGRNQFGRTWNPTGGFGF--------QGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQS 297
QG NQ G GGFG QG GGF GG
Sbjct: 141 QGGNQGGNPGGGQGGFGQNTGGFGGQQGNQGGFGGGQ-------------------GGNP 181
Query: 298 GVQTGGSGNPNPTTTVGGGSNRNANVPPRDNRRVGHPANKG 338
G G SGN GGG N+ A P D PA G
Sbjct: 182 GGNQGQSGNNFNQGGFGGGGNQ-AAAPDNDPWNSAPPAGSG 221
>CSP_PLAFW (P08307) Circumsporozoite protein precursor (CS)
Length = 442
Score = 31.2 bits (69), Expect = 5.0
Identities = 24/92 (26%), Positives = 32/92 (34%), Gaps = 7/92 (7%)
Query: 245 DQGRNQFGRTWNPTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGS 304
D G N G N G GR G + +R+G+ + ++ G
Sbjct: 76 DDGDNDNGNNNNGNNNNGDNGREGKDED-------KRDGNNEDNEKLRKPKHKKLKQPGD 128
Query: 305 GNPNPTTTVGGGSNRNANVPPRDNRRVGHPAN 336
GNP+P N N NV P N AN
Sbjct: 129 GNPDPNANPNVDPNANPNVDPNANPNANPNAN 160
>CA25_HUMAN (P05997) Collagen alpha 2(V) chain precursor
Length = 1496
Score = 31.2 bits (69), Expect = 5.0
Identities = 33/114 (28%), Positives = 37/114 (31%), Gaps = 8/114 (7%)
Query: 233 KEEKRKPYF*PQDQGRNQFGRTWNPTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVV 292
KE P P GR P GG G +G PG G G Q IV
Sbjct: 934 KEGPPGPRGDPGSHGRVGVRGPAGPPGGPGDKGDPGEDGQPGPDGPPGPAGTTGQRGIVG 993
Query: 293 LGGQSGVQ--------TGGSGNPNPTTTVGGGSNRNANVPPRDNRRVGHPANKG 338
+ GQ G + G G PT G PP N VG P +G
Sbjct: 994 MPGQRGERGMPGLPGPAGTPGKVGPTGATGDKGPPGPVGPPGSNGPVGEPGPEG 1047
>C43B_MOUSE (Q9EQG9) Goodpasture antigen-binding protein (EC
2.7.1.37) (GPBP) (Collagen type IV alpha 3 binding
protein) (StAR-related lipid transfer protein 11)
(StARD11) (START domain-containing protein 11)
Length = 624
Score = 31.2 bits (69), Expect = 5.0
Identities = 29/132 (21%), Positives = 55/132 (40%), Gaps = 11/132 (8%)
Query: 37 EEIYHGLDKFLKRSPPKFEGGYNPDGAYEWVRELERIFETLVCAEPRKVAFASYLLSSEA 96
EE Y + + LK+ P +F G +G + E E F+ + A R+ S +
Sbjct: 288 EEAYKNVMEELKKKP-RFGGPDYEEGPNSLINE-EEFFDAVEAALDRQDKIEEQSQSEKV 345
Query: 97 RTWWTSVRGRITPEEGELIWEIFKNSFLEKYFPADAKCRKEMEFFELKQEAMSVGEYAAK 156
R W + + G+ + + F++K + + M +L + V ++++
Sbjct: 346 RLHWPT-----SLPSGDTFSSVGTHRFVQKPYSRSSS----MSSIDLVSASDDVHRFSSQ 396
Query: 157 FEELCQFHPRYS 168
EE+ Q H YS
Sbjct: 397 VEEMVQNHMNYS 408
>GAG_JSRV (P31622) Gag polyprotein [Contains: Core protein p10; Core
protein p18; Core protein p12; Core protein p27; Core
protein p14; Core protein p4]
Length = 612
Score = 30.8 bits (68), Expect = 6.6
Identities = 30/119 (25%), Positives = 46/119 (38%), Gaps = 25/119 (21%)
Query: 220 YLKGLGVNKAPKKKEEKRKPYF*PQDQGRNQFGRTWNPTGGFGFQGRPG----------- 268
Y++G+ + A + K K + Q Q RN+ G + G G+PG
Sbjct: 471 YMQGIAMAAALQGKSIKEVLF---QQQARNKKGLQKSGNSGCFVCGQPGHRAAVCPQKHQ 527
Query: 269 -GFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGGSNRNANVPPR 326
N C RC++ H A++C +T GNP P V G R + P+
Sbjct: 528 TSVNTPNLCPRCKKGKHWARDC--------RSKTDVQGNPLP--PVSGNWVRGQPLAPK 576
>CA21_RANCA (O42350) Collagen alpha 2(I) chain precursor
Length = 1355
Score = 30.8 bits (68), Expect = 6.6
Identities = 26/84 (30%), Positives = 31/84 (35%), Gaps = 4/84 (4%)
Query: 257 PTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGG 316
P G G +G G G R GH + G G +TG SG+ P VG
Sbjct: 989 PAGPQGVRGDKGEAGERGARGLDGRKGHNGLSGLPGPSGTPG-ETGPSGSVGP---VGPR 1044
Query: 317 SNRNANVPPRDNRRVGHPANKGKV 340
+ PP R GHP G V
Sbjct: 1045 GPSGPSGPPGKEGRSGHPGAMGPV 1068
>CA13_CHICK (P12105) Collagen alpha 1(III) chain precursor
(Fragments)
Length = 1262
Score = 30.8 bits (68), Expect = 6.6
Identities = 27/87 (31%), Positives = 32/87 (36%), Gaps = 14/87 (16%)
Query: 257 PTGGFGFQGRPG-----GFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTT 311
P GG G G PG G TP G +NG + G G + G +G P
Sbjct: 408 PPGGRGLPGPPGTSGNPGAKGTP--GEPGKNGAKGDP------GPKG-ERGENGTPGARG 458
Query: 312 TVGGGSNRNANVPPRDNRRVGHPANKG 338
G R AN P N G P +G
Sbjct: 459 PPGEEGKRGANGEPGQNGVPGTPGERG 485
>U269_DEIRA (Q9RXP1) Protein DR0269
Length = 322
Score = 30.4 bits (67), Expect = 8.6
Identities = 26/88 (29%), Positives = 32/88 (35%), Gaps = 5/88 (5%)
Query: 244 QDQGRNQFGRT-WNPTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTG 302
+D+G FG GGF + GGF G R R + GG G G
Sbjct: 173 EDRGERSFGGDRGGDRGGFRPREDRGGFGGDRDRGGFRPREDRGERSF---GGDRGGDRG 229
Query: 303 GSGNPNPTTTVGG-GSNRNANVPPRDNR 329
G G P GG G PR++R
Sbjct: 230 GQGGFRPREDRGGFGDRDRGGFRPREDR 257
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.137 0.423
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 53,367,785
Number of Sequences: 164201
Number of extensions: 2567290
Number of successful extensions: 6222
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 6142
Number of HSP's gapped (non-prelim): 98
length of query: 411
length of database: 59,974,054
effective HSP length: 113
effective length of query: 298
effective length of database: 41,419,341
effective search space: 12342963618
effective search space used: 12342963618
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)
Lotus: description of TM0103.7