
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC121245.10 - phase: 0
(300 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CA21_MOUSE (Q01149) Collagen alpha 2(I) chain precursor 33 0.66
CA21_HUMAN (P08123) Collagen alpha 2(I) chain precursor 32 1.5
CA21_CANFA (O46392) Collagen alpha 2(I) chain precursor 32 1.5
CA21_BOVIN (P02465) Collagen alpha 2(I) chain precursor 32 1.9
CLPP_PINCO (P36387) ATP-dependent Clp protease proteolytic subun... 32 2.5
Y085_CHLTR (O84087) Hypothetical protein CT085 31 3.3
CA21_ONCMY (O93484) Collagen alpha 2(I) chain precursor 31 3.3
HRBL_HUMAN (O95081) HIV-1 Rev binding protein-like protein (Rev/... 31 4.3
GCSP_RHOFA (Q8G9M2) Glycine dehydrogenase [decarboxylating] (EC ... 31 4.3
CA14_MOUSE (P02463) Collagen alpha 1(IV) chain precursor 31 4.3
PRD1_HUMAN (O75626) PR-domain zinc finger protein 1 (Beta-interf... 30 5.6
NAPF_HAEIN (P44650) Ferredoxin-type protein napF homolog 30 5.6
GAG_HV1Y2 (P35962) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1W2 (P05889) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1PV (P03350) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1OY (P20889) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1MN (P05888) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1LW (Q70622) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1JR (P20873) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
GAG_HV1J3 (P12494) Gag polyprotein [Contains: Core protein p17 (... 30 5.6
>CA21_MOUSE (Q01149) Collagen alpha 2(I) chain precursor
Length = 1372
Score = 33.5 bits (75), Expect = 0.66
Identities = 24/80 (30%), Positives = 30/80 (37%), Gaps = 5/80 (6%)
Query: 191 PIYGPQTPPLVAPNGDVGVDGMVINLATLLAGTVTNPFNNGYFQGPAAAPLEAVTACTGV 250
P+ P P G+VG+ G+ + G NP NG A L V G+
Sbjct: 275 PVGNPGPAGPAGPRGEVGLPGL-----SGPVGPPGNPGTNGLTGAKGATGLPGVAGAPGL 329
Query: 251 FGSGAYPGYAGRVLVDGASG 270
G PG AG GA G
Sbjct: 330 PGPRGIPGPAGAAGATGARG 349
>CA21_HUMAN (P08123) Collagen alpha 2(I) chain precursor
Length = 1366
Score = 32.3 bits (72), Expect = 1.5
Identities = 25/77 (32%), Positives = 30/77 (38%), Gaps = 7/77 (9%)
Query: 194 GPQTPPLVAPNGDVGVDGMVINLATLLAGTVTNPFNNGYFQGPAAAPLEAVTACTGVFGS 253
GP P P G+VG+ G+ + G NP NG AA L V G+ G
Sbjct: 274 GPTGP--AGPRGEVGLPGL-----SGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGP 326
Query: 254 GAYPGYAGRVLVDGASG 270
PG G GA G
Sbjct: 327 RGIPGPVGAAGATGARG 343
>CA21_CANFA (O46392) Collagen alpha 2(I) chain precursor
Length = 1366
Score = 32.3 bits (72), Expect = 1.5
Identities = 24/80 (30%), Positives = 30/80 (37%), Gaps = 5/80 (6%)
Query: 191 PIYGPQTPPLVAPNGDVGVDGMVINLATLLAGTVTNPFNNGYFQGPAAAPLEAVTACTGV 250
P+ P P G+VG+ G+ + G NP NG AA L V G+
Sbjct: 269 PVGNPGPAGPAGPRGEVGLPGV-----SGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGL 323
Query: 251 FGSGAYPGYAGRVLVDGASG 270
G PG G GA G
Sbjct: 324 PGPRGIPGPVGAAGATGARG 343
>CA21_BOVIN (P02465) Collagen alpha 2(I) chain precursor
Length = 1364
Score = 32.0 bits (71), Expect = 1.9
Identities = 24/80 (30%), Positives = 30/80 (37%), Gaps = 5/80 (6%)
Query: 191 PIYGPQTPPLVAPNGDVGVDGMVINLATLLAGTVTNPFNNGYFQGPAAAPLEAVTACTGV 250
P+ P P G+VG+ G+ + G NP NG AA L V G+
Sbjct: 267 PVGNPGPAGPAGPRGEVGLPGL-----SGPVGPPGNPGANGLPGAKGAAGLPGVAGAPGL 321
Query: 251 FGSGAYPGYAGRVLVDGASG 270
G PG G GA G
Sbjct: 322 PGPRGIPGPVGAAGATGARG 341
>CLPP_PINCO (P36387) ATP-dependent Clp protease proteolytic subunit
(EC 3.4.21.92) (Endopeptidase Clp)
Length = 205
Score = 31.6 bits (70), Expect = 2.5
Identities = 24/74 (32%), Positives = 33/74 (44%), Gaps = 1/74 (1%)
Query: 129 NELSSITVVLTAKDVNVEGFCMSRCGTHGSVRRGSGGARTPYIWVGNAETLCPGQCAWPF 188
N+L + V L+A+D N E F C GSV G G R V + T+C G A
Sbjct: 45 NQLMGLMVYLSAEDANKEIFSFINC-PGGSVIPGVGLYRMMQAIVPDVNTICMGVAASMG 103
Query: 189 HQPIYGPQTPPLVA 202
+ G + P +A
Sbjct: 104 SFILIGGEMPKRIA 117
>Y085_CHLTR (O84087) Hypothetical protein CT085
Length = 579
Score = 31.2 bits (69), Expect = 3.3
Identities = 29/121 (23%), Positives = 50/121 (40%), Gaps = 17/121 (14%)
Query: 57 FTPIQRSIIVDFINSLSTTGAALPSASAWWKTTEKYKVGSSALTVGKQFLHPAYTLGKNL 116
F + + I+ LS+ PS S+ WK +K G SAL + K+ L P+ L ++
Sbjct: 72 FPDLSSDLFEQIIHLLSSP----PSFSSLWKHRSLFKRGISALGMRKRHLRPSPFLYQDA 127
Query: 117 KGKDLLALATKFNE----LSSITVVLTAKDVN---------VEGFCMSRCGTHGSVRRGS 163
L + T + E ++ +V T N ++ F G H +++G
Sbjct: 128 PNLSQLPMLTSWPEDGGPFLTLPLVYTQSPENGVPNLGMYRMQRFDKETLGLHFQIQKGG 187
Query: 164 G 164
G
Sbjct: 188 G 188
>CA21_ONCMY (O93484) Collagen alpha 2(I) chain precursor
Length = 1356
Score = 31.2 bits (69), Expect = 3.3
Identities = 29/80 (36%), Positives = 32/80 (39%), Gaps = 3/80 (3%)
Query: 203 PNGDVGVDGMVINLATLLAGTVTNPFNNGYFQGPAAAPLEAVTACTGVFGSGAYPGYAGR 262
P G G G IN A G V NP NNG AA L V G G PG G
Sbjct: 272 PQGGRGEPG--INGAVGPVGPVGNPGNNGINGAKGAAGLPGVAGAPGFPGPRGGPGPQGP 329
Query: 263 VLVDGASGSSYNAHGANGRK 282
GA G + G +G+K
Sbjct: 330 QGSTGARGLGGDP-GPSGQK 348
>HRBL_HUMAN (O95081) HIV-1 Rev binding protein-like protein (Rev/Rex
activation domain binding protein-related) (RAB-R)
Length = 481
Score = 30.8 bits (68), Expect = 4.3
Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 182 GQCAWPFHQPIYGPQTPPLVAPN----GDVGVDGMVINLATLLAGTVTNPFNNGYFQGPA 237
G A F P++ PQTP + N GD+G + + AG TNPF GP+
Sbjct: 411 GAFASSFPAPLFPPQTPLVQQQNGSSFGDLGSAKLGQRPLSQPAGISTNPF----MTGPS 466
Query: 238 AAPLEAVTACTGVF 251
++P + T F
Sbjct: 467 SSPFASKPPTTNPF 480
>GCSP_RHOFA (Q8G9M2) Glycine dehydrogenase [decarboxylating] (EC
1.4.4.2) (Glycine decarboxylase) (Glycine cleavage
system P-protein) (Fragment)
Length = 949
Score = 30.8 bits (68), Expect = 4.3
Identities = 24/71 (33%), Positives = 33/71 (45%), Gaps = 16/71 (22%)
Query: 203 PNGDVGVDGMVINLA----TLLAGTVTNPFNNGYFQGPAAAPLEAVTACTGVFGSGAYPG 258
PNGDV VD + +A TL A +T P +G ++ E C V +G
Sbjct: 620 PNGDVDVDDLRAKIAEHADTLAAIMITYPSTHGVYEH------EISDICAAVHDAG---- 669
Query: 259 YAGRVLVDGAS 269
G+V VDGA+
Sbjct: 670 --GQVYVDGAN 678
>CA14_MOUSE (P02463) Collagen alpha 1(IV) chain precursor
Length = 1669
Score = 30.8 bits (68), Expect = 4.3
Identities = 27/86 (31%), Positives = 32/86 (36%), Gaps = 14/86 (16%)
Query: 181 PGQCAWPFHQPIYGPQTPPLVAPNGDVGVDGMVINLATLLAGTVTNPFNNGYFQGP---- 236
PG +P + GP PP V P G VG G AG P + G GP
Sbjct: 588 PGGVGFPGSRGDIGPPGPPGVGPIGPVGEKGQ--------AGFPGGPGSPG-LPGPKGEA 638
Query: 237 -AAAPLEAVTACTGVFGSGAYPGYAG 261
PL G+ GS +PG G
Sbjct: 639 GKVVPLPGPPGAAGLPGSPGFPGPQG 664
>PRD1_HUMAN (O75626) PR-domain zinc finger protein 1
(Beta-interferon gene positive-regulatory domain I
binding factor) (BLIMP-1) (Positive regulatory domain
I-binding factor 1) (PRDI-binding factor-1) (PRDI-BF1)
Length = 789
Score = 30.4 bits (67), Expect = 5.6
Identities = 19/43 (44%), Positives = 23/43 (53%), Gaps = 3/43 (6%)
Query: 244 VTACTGVFGSGAYPGYAGRVLVDGASGSSYNAHGANGRKYLLP 286
+ A G G G+YPGYA + A SYNAH K+LLP
Sbjct: 341 LNASYGTEGLGSYPGYAPLPHLPPAFIPSYNAHYP---KFLLP 380
>NAPF_HAEIN (P44650) Ferredoxin-type protein napF homolog
Length = 176
Score = 30.4 bits (67), Expect = 5.6
Identities = 29/106 (27%), Positives = 42/106 (39%), Gaps = 6/106 (5%)
Query: 147 GFCMSRCGTHGSVRRGSGGARTPYIWVGNAETLCPGQCAWPFHQPIYGPQTPPLVAPNGD 206
G C+S C T+ V+ G A P + N E G+C QPI+ P+ + D
Sbjct: 52 GDCLSVCETNILVK---GDAGFPEVRFDNGECTFCGKCVDACKQPIFYPRDQLPWSHKID 108
Query: 207 VGVDGMVINLATLLAGTVTNPFNNGYFQ---GPAAAPLEAVTACTG 249
+ V + ++ P N F+ G A PL AC G
Sbjct: 109 ISVSCLTLHRIECRTCQDNCPANAIRFKLQMGGVAQPLVNFDACNG 154
>GAG_HV1Y2 (P35962) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 499
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
>GAG_HV1W2 (P05889) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein)]
(Fragment)
Length = 388
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
>GAG_HV1PV (P03350) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 511
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
>GAG_HV1OY (P20889) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 498
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
>GAG_HV1MN (P05888) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 506
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 340 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 370
>GAG_HV1LW (Q70622) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 499
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
>GAG_HV1JR (P20873) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 503
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
>GAG_HV1J3 (P12494) Gag polyprotein [Contains: Core protein p17
(Matrix protein); Core protein p24 (Core antigen); Core
protein p2; Core protein p7 (Nucleocapsid protein); Core
protein p1; Core protein p6]
Length = 499
Score = 30.4 bits (67), Expect = 5.6
Identities = 17/35 (48%), Positives = 20/35 (56%), Gaps = 4/35 (11%)
Query: 235 GPAAAPLEAVTACTGVFGSGAYPGYAGRVLVDGAS 269
GPAA E +TAC GV G PG+ RVL + S
Sbjct: 337 GPAATLEEMMTACQGVGG----PGHKARVLAEAMS 367
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.136 0.422
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 37,521,198
Number of Sequences: 164201
Number of extensions: 1636623
Number of successful extensions: 3856
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 1
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 3834
Number of HSP's gapped (non-prelim): 50
length of query: 300
length of database: 59,974,054
effective HSP length: 110
effective length of query: 190
effective length of database: 41,911,944
effective search space: 7963269360
effective search space used: 7963269360
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)
Medicago: description of AC121245.10