[UP]
[1][TOP]
>UniRef100_Q7XZ52 Putative elongation factor 3 (Fragment) n=1 Tax=Griffithsia
japonica RepID=Q7XZ52_GRIJA
Length = 194
Score = 63.9 bits (154), Expect = 5e-09
Identities = 46/131 (35%), Positives = 69/131 (52%), Gaps = 2/131 (1%)
Frame = +1
Query: 31 STDRAADAKA-VAEEATTVGVFAGDS-LVATLKATLADSAKSKGPAREATCLLVSALVAK 204
+TDR AK V + + V + S LV+ ++ L+ S K +REA L+V L+AK
Sbjct: 31 ATDRVDAAKTFVDSQCSHVSSLSPSSGLVSAVEKLLSSSDKKGAASREAALLVVCELLAK 90
Query: 205 LGAPSLPFLAGLVTDMIQLLADKGGKGVIAAATKACEDLTTPCSAQAKKMVILPQLVTAL 384
+ P+L+ L+ ++ L+ADK K V AA KA + AK+ V + +LV A+
Sbjct: 91 HQMAASPYLSSLLPAILTLMADKHSKHVQNAAVKAGTAIVDVLGPVAKRGVAVEKLVGAI 150
Query: 385 GQDMKWQTQAG 417
KWQTQ G
Sbjct: 151 DVSAKWQTQGG 161
[2][TOP]
>UniRef100_UPI000023EBCD hypothetical protein FG00434.1 n=1 Tax=Gibberella zeae PH-1
RepID=UPI000023EBCD
Length = 1272
Score = 60.8 bits (146), Expect = 4e-08
Identities = 55/174 (31%), Positives = 64/174 (36%), Gaps = 35/174 (20%)
Frame = -2
Query: 424 RGRQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPC----QPAA 257
R R P P SP G P+S P G P P P Q RP HP +P
Sbjct: 629 RSRSPLGQPPLGPP--SPYGRG-PNSRPGTSVGRRSPMPPGMPPQQRPLHPQNGNRRPDM 685
Query: 256 G-------SCRSPGQPGRAATERPAWPRGPTQEGRLP-----RGQGPCSW---------- 143
G R PG GR RP+ PRGP +G P RG GP
Sbjct: 686 GPDGRRPSDTRRPGPDGR----RPSDPRGPGFDGHRPSEPRGRGNGPPPGHGHPPPGPYG 741
Query: 142 ---------PSPPGSPSGWRPASHRQRHPPSSPPPPPPSRRQRGRSTRRRQQAG 8
P PPG P + P R PP+ PP PP+ Q G+ R+ G
Sbjct: 742 NYRPGPGHPPGPPGPPGSYGPPGSRPNGPPNGPPNGPPNGAQ-GQMAHRKPVPG 794
[3][TOP]
>UniRef100_UPI0000250733 proline-rich proteoglycan 2 n=1 Tax=Rattus norvegicus
RepID=UPI0000250733
Length = 295
Score = 60.8 bits (146), Expect = 4e-08
Identities = 55/165 (33%), Positives = 64/165 (38%), Gaps = 25/165 (15%)
Frame = -2
Query: 424 RGRQPASATSCP---GPRRSPAAGG*PSSWPARCT---GSSGPRTPWWPRQ*RPCHPCQP 263
R QP S P GP++ P G P P R GP P P+Q P P P
Sbjct: 108 RPPQPGSPQGPPPPGGPQQRPPQGPPPQGGPQRPPQPGSPQGPPPPGGPQQRPPQGP--P 165
Query: 262 AAGSCRSPGQPGRAATERPAWPRGPTQEGRLPRG---QGPCSWPSPPGSPSGWRPASHRQ 92
G + P QPG + + P P GP Q R P+G QG P PGSP G P Q
Sbjct: 166 PQGGPQRPPQPG--SPQGPPPPGGPQQ--RAPQGPPPQGGPQRPPQPGSPQGPPPPGGPQ 221
Query: 91 RHPPSSPP----------------PPPPSRRQRGRSTRRRQQAGP 5
+ PP PP PPPP Q+ Q GP
Sbjct: 222 QRPPQGPPPQGGPQRPPQPGSPQGPPPPGGPQQRPPQGPPPQGGP 266
[4][TOP]
>UniRef100_P10165 Proline-rich proteoglycan 2 n=1 Tax=Rattus norvegicus
RepID=PRPG2_RAT
Length = 295
Score = 60.8 bits (146), Expect = 4e-08
Identities = 55/165 (33%), Positives = 64/165 (38%), Gaps = 25/165 (15%)
Frame = -2
Query: 424 RGRQPASATSCP---GPRRSPAAGG*PSSWPARCT---GSSGPRTPWWPRQ*RPCHPCQP 263
R QP S P GP++ P G P P R GP P P+Q P P P
Sbjct: 108 RPPQPGSPQGPPPPGGPQQRPPQGPPPQGGPQRPPQPGSPQGPPPPGGPQQRPPQGP--P 165
Query: 262 AAGSCRSPGQPGRAATERPAWPRGPTQEGRLPRG---QGPCSWPSPPGSPSGWRPASHRQ 92
G + P QPG + + P P GP Q R P+G QG P PGSP G P Q
Sbjct: 166 PQGGPQRPPQPG--SPQGPPPPGGPQQ--RAPQGPPPQGGPQRPPQPGSPQGPPPPGGPQ 221
Query: 91 RHPPSSPP----------------PPPPSRRQRGRSTRRRQQAGP 5
+ PP PP PPPP Q+ Q GP
Sbjct: 222 QRPPQGPPPQGGPQRPPQPGSPQGPPPPGGPQQRPPQGPPPQGGP 266
[5][TOP]
>UniRef100_Q0RFJ9 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a
RepID=Q0RFJ9_FRAAA
Length = 483
Score = 60.5 bits (145), Expect = 6e-08
Identities = 48/149 (32%), Positives = 61/149 (40%), Gaps = 22/149 (14%)
Frame = -2
Query: 427 ARGRQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSC 248
A G +P S P P++ P AG + W G G P W + A
Sbjct: 130 AAGPRPGGEQSWPPPQQQPGAGS--TGWGQPGAGQPGAEQPGWGQA-----GGGQQASDQ 182
Query: 247 RSPGQPG--RAATERPAW--PRGPTQEGRLPRG----------QGPCSWPSPPGS----- 125
+S GQPG + TE+ W P G Q G P G QGP +P GS
Sbjct: 183 QSWGQPGGGQPGTEQQGWGQPSGWPQAGYPPGGTGAYQGGPAYQGPAGYPGAQGSYQQNP 242
Query: 124 PSGWRPASHRQRH---PPSSPPPPPPSRR 47
P GW+P + Q+ +PPPPPP RR
Sbjct: 243 PGGWQPGAAWQQGGGWQQGAPPPPPPRRR 271
[6][TOP]
>UniRef100_C1N2U9 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545
RepID=C1N2U9_9CHLO
Length = 1032
Score = 59.7 bits (143), Expect = 1e-07
Identities = 38/127 (29%), Positives = 58/127 (45%)
Frame = +1
Query: 52 AKAVAEEATTVGVFAGDSLVATLKATLADSAKSKGPAREATCLLVSALVAKLGAPSLPFL 231
A +AE+ + + VA L L + +K ARE C+ + + + + L
Sbjct: 17 ASQIAEQVKSSPAGMNPADVAALSDALKEGSKGTAAAREGACIAIDTIASVAKTTAEHQL 76
Query: 232 AGLVTDMIQLLADKGGKGVIAAATKACEDLTTPCSAQAKKMVILPQLVTALGQDMKWQTQ 411
V D+++ ADK K V +AA A L SA +LP L+TA+ KWQT
Sbjct: 77 MPFVADLVRCCADKHSKEVQSAAAAATLTLAKTSSAYGLD-AVLPSLLTAMDPKEKWQTM 135
Query: 412 AGALELI 432
GAL ++
Sbjct: 136 VGALNMV 142
[7][TOP]
>UniRef100_UPI0001AE6A93 UPI0001AE6A93 related cluster n=1 Tax=Homo sapiens
RepID=UPI0001AE6A93
Length = 183
Score = 59.3 bits (142), Expect = 1e-07
Identities = 50/145 (34%), Positives = 59/145 (40%), Gaps = 6/145 (4%)
Frame = -2
Query: 421 GRQPASATSCPG-PRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*R--PCHPCQPAAGS 251
G QP PG P+ P GG S P G R P Q + P HP +P
Sbjct: 44 GNQPQRPPPPPGKPQGPPPQGGNQSQGPPPPPGKPEGRPPQGGNQSQGPPPHPGKPE--- 100
Query: 250 CRSPGQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPAS---HRQRHPP 80
R P Q G + +P P P QEG P+G P PPG P G PA + + PP
Sbjct: 101 -RPPPQGGNQSQGKPQGP--PQQEGNKPQG------PPPPGKPQGPPPAGGNPQQPQAPP 151
Query: 79 SSPPPPPPSRRQRGRSTRRRQQAGP 5
+ P PP Q GR R Q P
Sbjct: 152 AGKPQGPPPPPQGGRPPRPAQGQQP 176
[8][TOP]
>UniRef100_UPI000013DBDC proline-rich protein BstNI subfamily 4 precursor n=1 Tax=Homo
sapiens RepID=UPI000013DBDC
Length = 247
Score = 58.9 bits (141), Expect = 2e-07
Identities = 52/149 (34%), Positives = 59/149 (39%), Gaps = 10/149 (6%)
Frame = -2
Query: 421 GRQPASATSCPG-PRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*R--PCHPCQPAAGS 251
G Q PG P R P GG S P G P Q + P HP +P G
Sbjct: 107 GNQSQGTPPPPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGGNQSQGPPPHPGKPE-GP 165
Query: 250 CRSPGQPGRAATERPAWPRGPTQ-EGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSS 74
G R+A P P+GP Q EG P+G P PPG P G PA + P +
Sbjct: 166 PPQEGNKSRSARSPPGKPQGPPQQEGNKPQG------PPPPGKPQGPPPAGGNPQQPQAP 219
Query: 73 P------PPPPPSRRQRGRSTRRRQQAGP 5
P PPPPP Q GR R Q P
Sbjct: 220 PAGKPQGPPPPP---QGGRPPRPAQGQQP 245
[9][TOP]
>UniRef100_C3YTC4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae
RepID=C3YTC4_BRAFL
Length = 340
Score = 58.9 bits (141), Expect = 2e-07
Identities = 43/140 (30%), Positives = 54/140 (38%), Gaps = 9/140 (6%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*--PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
P + T GP P G P P G GP P PR G+ P
Sbjct: 195 PGNLTGVVGPPGLPGPPGPIGPPGLPGSAGGPPGPPGPIGPRG------VSGPKGNQGQP 248
Query: 238 GQPGRAATERPAWPRGPTQEGRLPRG-------QGPCSWPSPPGSPSGWRPASHRQRHPP 80
G G++ T+ P RGP + R PRG QGP WP PPG P G +
Sbjct: 249 GPEGQSGTQGPPGRRGPKGD-RGPRGPEGQSGLQGPPGWPGPPGGPPGPSGPKGEKGDKG 307
Query: 79 SSPPPPPPSRRQRGRSTRRR 20
PP PP ++ + + RR
Sbjct: 308 KKGPPGPPGKKGKSKREARR 327
[10][TOP]
>UniRef100_UPI0000EB29B4 UPI0000EB29B4 related cluster n=1 Tax=Canis lupus familiaris
RepID=UPI0000EB29B4
Length = 457
Score = 58.5 bits (140), Expect = 2e-07
Identities = 46/142 (32%), Positives = 55/142 (38%), Gaps = 12/142 (8%)
Frame = -2
Query: 424 RGRQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPC----QPAA 257
RG PA P P +P P+ P G S P PR RP + P A
Sbjct: 167 RGPPPAGRQPFPCPSPAP-----PTPPPCHPVGGSVPAPGTAPRTRRPANSALRGSPPGA 221
Query: 256 GSCRSPGQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQR---- 89
P PGR P PRG GR PR P + P +PS P HR +
Sbjct: 222 RLPEPPPTPGRTPPRYPQGPRGAPAPGR-PRFTEPRAAPLGSQAPSVPEPGGHRPQPEGI 280
Query: 88 ----HPPSSPPPPPPSRRQRGR 35
PP +PP PPP R+ G+
Sbjct: 281 AAGSSPPPAPPTPPPRPREHGK 302
[11][TOP]
>UniRef100_C0P9U0 Putative uncharacterized protein n=1 Tax=Zea mays
RepID=C0P9U0_MAIZE
Length = 316
Score = 58.5 bits (140), Expect = 2e-07
Identities = 45/129 (34%), Positives = 56/129 (43%), Gaps = 2/129 (1%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAATE 212
P PRR P P WP C G+ G R+ P Q P P + SC + PG ++
Sbjct: 58 PPPRRPP-----PRLWPPSCRGARGTRSRRCPGQPPPPRPPR----SCAAARAPGPSSPR 108
Query: 211 RPAWPRGPTQEGRLPRGQGPCSW-PSPPGSPSGWRPASHRQRHP-PSSPPPPPPSRRQRG 38
R + GR R PC+ PSPP S RP R P P +R +R
Sbjct: 109 ASRTGRRRRRRGRPSRRGAPCACAPSPPSCTSPSRPGRSRTLRPRPRLHRTACRTRHRRR 168
Query: 37 RSTRRRQQA 11
R TRRRQ+A
Sbjct: 169 RRTRRRQRA 177
[12][TOP]
>UniRef100_Q5CVD5 Putative uncharacterized protein n=1 Tax=Cryptosporidium parvum
Iowa II RepID=Q5CVD5_CRYPV
Length = 546
Score = 58.2 bits (139), Expect = 3e-07
Identities = 43/136 (31%), Positives = 55/136 (40%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQ 233
P+ A+ P P + P A P P+ GS + P P P P P S S G
Sbjct: 416 PSPASKGPPPPKGPPAPKGPPGPPSESEGSPASKGP--PPSKGPPAPKGPPGPSSESEGS 473
Query: 232 PGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPS 53
P P P GP + P +GP P+P G PS PAS + PP PPPP
Sbjct: 474 PATKGPPAPKGPPGPPESEGSPASKGP---PAPKGPPS---PAS-KGPPPPKGPPPPSSK 526
Query: 52 RRQRGRSTRRRQQAGP 5
G+ ++A P
Sbjct: 527 GPPTGKGPSLPKKAPP 542
Score = 53.9 bits (128), Expect = 5e-06
Identities = 47/142 (33%), Positives = 59/142 (41%), Gaps = 12/142 (8%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*R-PCHPCQPAAGSCRS 242
+ P ++ P P+ SPA G P+ S GP P P + P P + GS S
Sbjct: 389 KDPPASKGPPPPKGSPAPKGPPAPKGPPSPASKGPPPPKGPPAPKGPPGPPSESEGSPAS 448
Query: 241 PGQPGRAATERPAWPRGPTQEGR-LPRGQGPCSWPSPPGSP-SGWRPASH---RQRHPPS 77
G P P P GP+ E P +GP + PPG P S PAS + PPS
Sbjct: 449 KGPPPSKGPPAPKGPPGPSSESEGSPATKGPPAPKGPPGPPESEGSPASKGPPAPKGPPS 508
Query: 76 ----SPPPP--PPSRRQRGRST 29
PPPP PP +G T
Sbjct: 509 PASKGPPPPKGPPPPSSKGPPT 530
[13][TOP]
>UniRef100_B4L4Y6 GI21630 n=1 Tax=Drosophila mojavensis RepID=B4L4Y6_DROMO
Length = 537
Score = 57.0 bits (136), Expect = 6e-07
Identities = 41/137 (29%), Positives = 52/137 (37%), Gaps = 12/137 (8%)
Frame = -2
Query: 397 SCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAA 218
S P P G P SWP GS WP RP HP +P P P R
Sbjct: 205 SVPSVPSVPVYPGRPGSWPGSWPGS-------WPSPWRPNHPIRPV-----HPRPPIRPV 252
Query: 217 TERPAWPRGPTQEGRLPRGQGPCSWPSPPGSP---------SGWRPASHRQRHP---PSS 74
+ P WP+ P+Q G G S +P G+P +GWRP P P+S
Sbjct: 253 PQHPFWPQRPSQPG---NSNGSNSGNTPSGNPFWPNWLDWVNGWRPTKKPTTAPTVAPTS 309
Query: 73 PPPPPPSRRQRGRSTRR 23
P P + + S +
Sbjct: 310 APTESPKKPETNESVEQ 326
[14][TOP]
>UniRef100_C4E3E6 RNA polymerase sigma factor, sigma-70 family n=1
Tax=Streptosporangium roseum DSM 43021
RepID=C4E3E6_STRRS
Length = 628
Score = 56.6 bits (135), Expect = 8e-07
Identities = 44/131 (33%), Positives = 54/131 (41%), Gaps = 10/131 (7%)
Frame = -2
Query: 415 QPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPG 236
+P PGP +P + G P P T +GP P WP++ P P G+ P
Sbjct: 375 EPMPDRRVPGPVPAPTSTGGPPDRPGGPT--AGPAGPSWPQEPAPVLSGPPRPGAWERPA 432
Query: 235 QPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSP--------PGSPSGWRPA--SHRQRH 86
P ERP PRG Q R PR P P P P PS RP S R
Sbjct: 433 APRLGTWERPGPPRG-HQGIRPPRRCRPTPGPPPAAPRPVPTPAVPSPARPTPPSTTARP 491
Query: 85 PPSSPPPPPPS 53
P++P PP P+
Sbjct: 492 APTAPKPPRPA 502
[15][TOP]
>UniRef100_P05142 Proline-rich protein HaeIII subfamily 1 n=1 Tax=Mus musculus
RepID=PRH1_MOUSE
Length = 261
Score = 56.2 bits (134), Expect = 1e-06
Identities = 48/146 (32%), Positives = 54/146 (36%), Gaps = 8/146 (5%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R P GP++ P G P P + GP P P+ P P PA R P
Sbjct: 103 RPPQGPPPPGGPQQRPPQGPPPPGGP-QPRPPQGPPPPGGPQLRPPQGPPPPAGPQPRPP 161
Query: 238 GQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPS-----GWRPASHRQRHPPSS 74
P A +P P+GP G PR P P P G P G P Q PP
Sbjct: 162 QGPPPPAGPQPRPPQGPPPTGPQPR---PTQGPPPTGGPQQRPPQGPPPPGGPQPRPPQG 218
Query: 73 PPP---PPPSRRQRGRSTRRRQQAGP 5
PPP P PS Q T QQ P
Sbjct: 219 PPPPGGPQPSPTQGPPPTGGPQQTPP 244
[16][TOP]
>UniRef100_UPI0000E23028 PREDICTED: hypothetical protein n=1 Tax=Pan troglodytes
RepID=UPI0000E23028
Length = 205
Score = 55.8 bits (133), Expect = 1e-06
Identities = 53/152 (34%), Positives = 60/152 (39%), Gaps = 13/152 (8%)
Frame = -2
Query: 421 GRQPASATSCPG-PRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*R--PCHPCQPAAGS 251
G Q PG P P GG S P G P Q + P HP +P
Sbjct: 65 GNQSQGPPPHPGKPEGPPPQGGNQSQGPPPHPGKPERPPPQGGNQSQGPPPHPGKPE--- 121
Query: 250 CRSPGQPG---RAATERPAWPRGPTQ-EGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHP 83
R P Q G R+A P P+GP Q EG P+G P PPG P G PA + P
Sbjct: 122 -RPPPQEGNKSRSARSPPGKPQGPPQQEGNKPQG------PPPPGKPQGPPPAGGNPQQP 174
Query: 82 PSSP------PPPPPSRRQRGRSTRRRQQAGP 5
+ P PPPPP Q GR R Q P
Sbjct: 175 QAPPAGKPQGPPPPP---QGGRPPRPAQGQQP 203
[17][TOP]
>UniRef100_UPI0000E21343 PREDICTED: collagen, type XXVIII n=1 Tax=Pan troglodytes
RepID=UPI0000E21343
Length = 1125
Score = 55.8 bits (133), Expect = 1e-06
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ PG GDR P G +G G
Sbjct: 556 KGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPGIPGFKGEPGLS 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +GV+GP PV G + DG+P G R PG
Sbjct: 616 IRGPKGVQGPRGPVGAPGLKGDGYPGVPGPRGLPG 650
[18][TOP]
>UniRef100_UPI0000D9A853 PREDICTED: similar to procollagen, type VI, alpha 2 n=1 Tax=Macaca
mulatta RepID=UPI0000D9A853
Length = 1123
Score = 55.8 bits (133), Expect = 1e-06
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ PG GDR P G +G G
Sbjct: 554 KGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPGIPGFKGEPGLS 613
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +GV+GP PV G + DG+P G R PG
Sbjct: 614 IRGPKGVQGPRGPVGAPGLKGDGYPGVPGPRGLPG 648
[19][TOP]
>UniRef100_UPI00015DEFD0 proline rich protein HaeIII subfamily 1 n=1 Tax=Mus musculus
RepID=UPI00015DEFD0
Length = 261
Score = 55.8 bits (133), Expect = 1e-06
Identities = 48/146 (32%), Positives = 53/146 (36%), Gaps = 8/146 (5%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R P GP+ P G P P + GP P P+ P P PA R P
Sbjct: 103 RPPQGPPPPGGPQHRPPQGPPPPGGP-QPRPPQGPPPPGGPQLRPPQGPPPPAGPQPRPP 161
Query: 238 GQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPS-----GWRPASHRQRHPPSS 74
P A +P P+GP G PR P P P G P G P Q PP
Sbjct: 162 QGPPPPAGPQPRPPQGPPTTGPQPR---PTQGPPPTGGPQQRPPQGPPPPGGPQPRPPQG 218
Query: 73 PPP---PPPSRRQRGRSTRRRQQAGP 5
PPP P PS Q T QQ P
Sbjct: 219 PPPPGGPQPSPTQGPPPTGGPQQTPP 244
[20][TOP]
>UniRef100_UPI00015E0452 collagen, type XXVIII precursor n=1 Tax=Homo sapiens
RepID=UPI00015E0452
Length = 1125
Score = 55.8 bits (133), Expect = 1e-06
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ PG GDR P G +G G
Sbjct: 556 KGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPGIPGFKGEPGLS 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +GV+GP PV G + DG+P G R PG
Sbjct: 616 IRGPKGVQGPRGPVGAPGLKGDGYPGVPGPRGLPG 650
[21][TOP]
>UniRef100_C3YWB8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae
RepID=C3YWB8_BRAFL
Length = 488
Score = 55.8 bits (133), Expect = 1e-06
Identities = 41/113 (36%), Positives = 46/113 (40%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAATE 212
PGP P G P+ P G GP P P P PC P+ +PG PG
Sbjct: 295 PGPPGPPGPPGPPTGPPGPPPGPPGPPGPPGP----PGPPCGPSGPPPGAPGPPGPPPGP 350
Query: 211 RPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPS 53
PA P GP G P GP PPG P+G P P PPP PP+
Sbjct: 351 -PAGP-GPPPPGPAPGPPGP-----PPGPPAGPGPPPPGPAPGPPGPPPGPPA 396
Score = 54.7 bits (130), Expect = 3e-06
Identities = 43/135 (31%), Positives = 47/135 (34%), Gaps = 12/135 (8%)
Frame = -2
Query: 424 RGRQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCR 245
+G+ S S PGP P G P P P P P P P G+
Sbjct: 231 KGKAKTSGPSSPGPDAPPPPGAPPPPGPGAPPPPGAPPPPGPGAPPPPGAPPPPGPGAPP 290
Query: 244 SPGQPGRAA--------TERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQR 89
PG PG T P P GP P GP P PP PSG P +
Sbjct: 291 PPGPPGPPGPPGPPGPPTGPPGPPPGP------PGPPGPPGPPGPPCGPSGPPPGAPGPP 344
Query: 88 HPPSSPP----PPPP 56
PP PP PPPP
Sbjct: 345 GPPPGPPAGPGPPPP 359
[22][TOP]
>UniRef100_C3XQ94 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae
RepID=C3XQ94_BRAFL
Length = 513
Score = 55.8 bits (133), Expect = 1e-06
Identities = 41/113 (36%), Positives = 46/113 (40%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAATE 212
PGP P G P+ P G GP P P P PC P+ +PG PG
Sbjct: 320 PGPPGPPGPPGPPTGPPGPPPGPPGPPGPPGP----PGPPCGPSGPPPGAPGPPGPPPGP 375
Query: 211 RPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPS 53
PA P GP G P GP PPG P+G P P PPP PP+
Sbjct: 376 -PAGP-GPPPPGPAPGPPGP-----PPGPPAGPGPPPPGPAPGPPGPPPGPPA 421
[23][TOP]
>UniRef100_B5MDS6 Putative uncharacterized protein COL28A1 n=1 Tax=Homo sapiens
RepID=B5MDS6_HUMAN
Length = 713
Score = 55.8 bits (133), Expect = 1e-06
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ PG GDR P G +G G
Sbjct: 556 KGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPGIPGFKGEPGLS 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +GV+GP PV G + DG+P G R PG
Sbjct: 616 IRGPKGVQGPRGPVGAPGLKGDGYPGVPGPRGLPG 650
[24][TOP]
>UniRef100_Q2UY09-2 Isoform 2 of Collagen alpha-1(XXVIII) chain n=1 Tax=Homo sapiens
RepID=Q2UY09-2
Length = 713
Score = 55.8 bits (133), Expect = 1e-06
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ PG GDR P G +G G
Sbjct: 556 KGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPGIPGFKGEPGLS 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +GV+GP PV G + DG+P G R PG
Sbjct: 616 IRGPKGVQGPRGPVGAPGLKGDGYPGVPGPRGLPG 650
[25][TOP]
>UniRef100_Q2UY09 Collagen alpha-1(XXVIII) chain n=1 Tax=Homo sapiens
RepID=COSA1_HUMAN
Length = 1125
Score = 55.8 bits (133), Expect = 1e-06
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ PG GDR P G +G G
Sbjct: 556 KGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPGIPGFKGEPGLS 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +GV+GP PV G + DG+P G R PG
Sbjct: 616 IRGPKGVQGPRGPVGAPGLKGDGYPGVPGPRGLPG 650
[26][TOP]
>UniRef100_C0PKV4 Putative uncharacterized protein n=1 Tax=Zea mays
RepID=C0PKV4_MAIZE
Length = 246
Score = 55.5 bits (132), Expect = 2e-06
Identities = 51/144 (35%), Positives = 58/144 (40%), Gaps = 9/144 (6%)
Frame = -2
Query: 427 ARGRQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSC 248
AR R P S + G RRSP S P R G G R WPR P P+
Sbjct: 36 ARPRSPPSWSGRRGRRRSPRP-----SLPRR--GRRGARRGPWPRTPWPPAAGPPSPPRR 88
Query: 247 RSPGQPGRAATERPAWPRGPTQEGRLPRGQGP-CSWPS--------PPGSPSGWRPASHR 95
PG P R T R + P P P G GP C P P G+P+ RP S R
Sbjct: 89 WRPGAPARRRTPRRSTPPAPRTA---PSGAGPACRRPPATRARGTCPSGAPAAARPGSTR 145
Query: 94 QRHPPSSPPPPPPSRRQRGRSTRR 23
++ PPPP GR TRR
Sbjct: 146 PTCTSAAARPPPP-----GRGTRR 164
[27][TOP]
>UniRef100_A4R5L4 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea
RepID=A4R5L4_MAGGR
Length = 737
Score = 55.5 bits (132), Expect = 2e-06
Identities = 39/119 (32%), Positives = 51/119 (42%), Gaps = 5/119 (4%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAATE 212
P P R P G P P++ G P P P RP P +P A + PG+P E
Sbjct: 5 PPPNRPPPPGKPP---PSKLEGFGKPPAPASPPPNRPPPPVRPPADNPPPPGKPPPNKLE 61
Query: 211 ---RPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPP--PPPSR 50
+P P P LP + P P PPG P + + P+SPPP PPP++
Sbjct: 62 GFGKPPAPDSPPPNRPLPPVRPPADNPPPPGKPPPNKLEGFGKPPAPASPPPGKPPPNK 120
[28][TOP]
>UniRef100_UPI0001B55E13 putative chaplin n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B55E13
Length = 293
Score = 55.1 bits (131), Expect = 2e-06
Identities = 45/135 (33%), Positives = 56/135 (41%), Gaps = 14/135 (10%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQ 233
PA ++ P PR +P PA T + PR+P P PA G+ RSP +
Sbjct: 168 PALRSAAPAPRPAP---------PAADTPARAPRSP---------RPRAPAPGTPRSPPR 209
Query: 232 P------GRAATERPAWPRGPTQEGRLPRGQG--PCSWPSPPGSPSGWRPASHRQRHPPS 77
P RAA P+ P P PR P + P P + P +HR R PP
Sbjct: 210 PPGPRPPDRAARAPPSPPASPPPAPAPPRPARARPRAAPRAPADSATPPPRAHRPRAPPR 269
Query: 76 SPP------PPPPSR 50
PP PPPPSR
Sbjct: 270 RPPVRARTEPPPPSR 284
[29][TOP]
>UniRef100_UPI00015DEFD1 Proline-rich protein 2 precursor (Proline-rich protein MP-3). n=1
Tax=Mus musculus RepID=UPI00015DEFD1
Length = 227
Score = 54.7 bits (130), Expect = 3e-06
Identities = 41/124 (33%), Positives = 46/124 (37%), Gaps = 5/124 (4%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R P GP+ P G P P + GP P P+Q P P P R P
Sbjct: 103 RPPQGPPPPGGPQPRPPQGPPPPGGPQQ-RPPQGPPPPGGPQQRPPQGPPPPGGPQPRPP 161
Query: 238 GQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGS-----PSGWRPASHRQRHPPSS 74
P A +P P+GP G PR P P P G P G P Q PP
Sbjct: 162 QGPPPPAGPQPRPPQGPPPPGPHPR---PTQGPPPTGGPQQRPPQGPPPPGGPQPRPPQG 218
Query: 73 PPPP 62
PPPP
Sbjct: 219 PPPP 222
[30][TOP]
>UniRef100_C1MY88 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545
RepID=C1MY88_9CHLO
Length = 1591
Score = 54.7 bits (130), Expect = 3e-06
Identities = 43/117 (36%), Positives = 47/117 (40%), Gaps = 5/117 (4%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPA-----AGSCRSPGQPG 227
PG +PA G P S S PR P P + P P +P A S PGQP
Sbjct: 1145 PGAPAAPATPGTPPSPVVVEEKSPPPRAPSEPERSPPRAPSEPGRPPPTAPSPPPPGQPP 1204
Query: 226 RAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPP 56
R A P+ P P P P S P PP SP PP SPPPPPP
Sbjct: 1205 RPAPPPPSPPPPPPPPPPPPPLPPPPSPPPPPPSP------------PPPSPPPPPP 1249
[31][TOP]
>UniRef100_C3YL84 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae
RepID=C3YL84_BRAFL
Length = 414
Score = 54.7 bits (130), Expect = 3e-06
Identities = 36/133 (27%), Positives = 52/133 (39%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQ 233
PA P + P P+ PA+ + P+ P Q P P QP A + P Q
Sbjct: 140 PAQPPKPPAQPQQP-----PAQPPAKPQPPAQPQQPPAQPQQPPAQPQQPPAQPQQPPAQ 194
Query: 232 PGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPS 53
P A + PA P+ P Q + P P + P PP P +Q P SPP P
Sbjct: 195 PPAQAQQPPAKPQPPAQPPKPPAQSQPPAKPQPPAEPQQPAEQPQKQIEQPPSPPQAPKE 254
Query: 52 RRQRGRSTRRRQQ 14
+ ++ ++
Sbjct: 255 EVKEPEEEKKEEE 267
[32][TOP]
>UniRef100_UPI0000EBEFAE PREDICTED: hypothetical protein, partial n=1 Tax=Bos taurus
RepID=UPI0000EBEFAE
Length = 343
Score = 54.3 bits (129), Expect = 4e-06
Identities = 56/160 (35%), Positives = 66/160 (41%), Gaps = 22/160 (13%)
Frame = -2
Query: 421 GRQPASATSCPG-PRRSP----AAGG*PSSWPARCTGSSGPRT-----PWWPRQ*RPCHP 272
GR SA S PG PR +P A G + P G + PR P W R+ R C
Sbjct: 71 GRTLVSALSSPGLPRGTPPVTKATGELLTLDPGAPPGPARPRPVAFSRPTWRRRTRKC-- 128
Query: 271 C-QPAAGSCRSPGQ-PGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPS-GWRPAS 101
C +P R PG+ RA + P P P Q G P G P PP P+ G RP
Sbjct: 129 CRRPLGARSRRPGEVEPRARSPPPRGPLAPRQPG--PPSPGLTPLPPPPPHPAPGDRP-- 184
Query: 100 HRQRHPPSSPPPPPPSRRQRG---------RSTRRRQQAG 8
PPP PP RR RG R RRR++ G
Sbjct: 185 ---------PPPRPPERRSRGAGEEEGEGEREARRRREGG 215
[33][TOP]
>UniRef100_UPI0000E23040 PREDICTED: hypothetical protein isoform 1 n=1 Tax=Pan troglodytes
RepID=UPI0000E23040
Length = 582
Score = 54.3 bits (129), Expect = 4e-06
Identities = 47/147 (31%), Positives = 61/147 (41%), Gaps = 14/147 (9%)
Frame = -2
Query: 424 RGRQPASATSCPG-PRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSC 248
+G + SA S PG P+ P GG P P + G + P+ P P + +P P +
Sbjct: 258 QGDKSRSARSPPGKPQGPPPQGGKPQGPPPQ--GGNQPQGPPPPPE-KPQGPAPQGGSNS 314
Query: 247 RS----PGQPGRAATERPAWPRGPTQ-----EGRLPRGQGPCSWPSPPGSPSGWRPASHR 95
RS PG+P + P+GP +G P+G S SPPG P G P
Sbjct: 315 RSARSPPGKPQGPPPQGGNQPQGPPPPPEKPQGPPPQGDKSRSARSPPGKPQGPPPQGGN 374
Query: 94 QRH----PPSSPPPPPPSRRQRGRSTR 26
Q PP P PPP RS R
Sbjct: 375 QPQGPPPPPGKPQGPPPQGGSNSRSAR 401
[34][TOP]
>UniRef100_UPI0001B7BA1F proline-rich protein 15 n=1 Tax=Rattus norvegicus
RepID=UPI0001B7BA1F
Length = 204
Score = 54.3 bits (129), Expect = 4e-06
Identities = 40/121 (33%), Positives = 50/121 (41%), Gaps = 4/121 (3%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAATE 212
PG + P G P P + GP P P+Q +P P +G + P PG +
Sbjct: 63 PGKPQGPPPPGGPQQKPPQPGNQQGPPPPGGPQQ-KP-----PQSGKPQGPPPPG-GPQQ 115
Query: 211 RPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPS----SPPPPPPSRRQ 44
RP P +G P G GP P PG P G P Q+ PP PPPP +Q
Sbjct: 116 RPPQPGNQKPQGPPPPG-GPQKKPPQPGKPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQ 174
Query: 43 R 41
R
Sbjct: 175 R 175
[35][TOP]
>UniRef100_B4V2Z1 Serine/threonine protein kinase n=1 Tax=Streptomyces sp. Mg1
RepID=B4V2Z1_9ACTO
Length = 586
Score = 54.3 bits (129), Expect = 4e-06
Identities = 47/144 (32%), Positives = 52/144 (36%), Gaps = 34/144 (23%)
Frame = -2
Query: 349 SWPARCTGSSGPRTPWWPRQ*R------PCHPCQPAAGSCRSPGQPGRAATERPAW---- 200
+WP T GP P PR R P P PA S RS PGRA + PAW
Sbjct: 440 TWP---TAPPGPPPPPPPRPRRAAPGREPRPPRAPARTSRRSSPAPGRAPSPPPAWASRP 496
Query: 199 ----PRGPTQEGR------------LPRGQGPCSWPSPPGSPSGWR--------PASHRQ 92
P GP R P G SWP PP +P W PAS R
Sbjct: 497 SSRSPSGPAGSARSWAATSPSSTSSAPTAAGTGSWPPPPTAPWSWTPPVTPTRLPASVRA 556
Query: 91 RHPPSSPPPPPPSRRQRGRSTRRR 20
P S+ P R+T RR
Sbjct: 557 ARPTSASPSTRTGHCTTSRATARR 580
[36][TOP]
>UniRef100_C7BGM8 Formin 2A n=1 Tax=Physcomitrella patens RepID=C7BGM8_PHYPA
Length = 1238
Score = 54.3 bits (129), Expect = 4e-06
Identities = 45/133 (33%), Positives = 50/133 (37%), Gaps = 8/133 (6%)
Frame = -2
Query: 409 ASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRS--PG 236
++A P P +P G P+ P P P R P P P G R P
Sbjct: 668 SNAPPPPPPLPAPPGGARPAGPPP-----PPPPPPGGARPAGPPPPPSPPGGRGRGGPPP 722
Query: 235 QPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHP------PSS 74
P RPA P P G G+GP P PP P G RPA P P
Sbjct: 723 PPPPPGGARPAVPPPPPPPG----GRGPGGPPPPPPPPGGARPAGAPPPPPPPGGKGPGG 778
Query: 73 PPPPPPSRRQRGR 35
PPPPPP RGR
Sbjct: 779 PPPPPPPGAGRGR 791
[37][TOP]
>UniRef100_C5DNX4 ZYRO0A12386p n=1 Tax=Zygosaccharomyces rouxii CBS 732
RepID=C5DNX4_ZYGRC
Length = 743
Score = 54.3 bits (129), Expect = 4e-06
Identities = 40/136 (29%), Positives = 54/136 (39%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQ 233
P SA+S P P +PA + P T S+ P P P +P P P S R P
Sbjct: 512 PRSASSAPAPAPAPAPPSPAAPAPPLPTASAPPVPPATPS--KPSKP--PKNVSSRIPST 567
Query: 232 PGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPS 53
P +A P+ P P+ P P P+PP + + P + R P + PPPP +
Sbjct: 568 PSSSAPPVPSAPSAPSPPSAPPAPPAP---PTPPSTSAPPLPGTSAPRKPTAPPPPPIST 624
Query: 52 RRQRGRSTRRRQQAGP 5
RR P
Sbjct: 625 SSSYSEEASRRAPPPP 640
[38][TOP]
>UniRef100_UPI00015BB2CD proline rich protein HaeIII subfamily 1 precursor n=1 Tax=Mus
musculus RepID=UPI00015BB2CD
Length = 261
Score = 53.9 bits (128), Expect = 5e-06
Identities = 43/131 (32%), Positives = 51/131 (38%), Gaps = 4/131 (3%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R P GP++ P G P P GP P P+Q P P P R P
Sbjct: 89 RPPQGPPPPGGPQQRPPQGPPPPGGPQH-RPPQGPPPPGGPQQRPPQGPPPPGGPQLRPP 147
Query: 238 GQPGRAATERPAWPRG-PTQEGRLPR-GQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPP 65
P A +P P+G P G PR QGP + P G P Q+ PP PPP
Sbjct: 148 QGPPPPAGPQPRPPQGPPPPAGPQPRPPQGPPTTGPQPRPTQGPPPTGGPQQRPPQGPPP 207
Query: 64 P--PPSRRQRG 38
P P R +G
Sbjct: 208 PGGPQPRPPQG 218
Score = 53.9 bits (128), Expect = 5e-06
Identities = 48/152 (31%), Positives = 53/152 (34%), Gaps = 14/152 (9%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R P GP+ P G P P + GP P P+ P P PA R P
Sbjct: 103 RPPQGPPPPGGPQHRPPQGPPPPGGPQQ-RPPQGPPPPGGPQLRPPQGPPPPAGPQPRPP 161
Query: 238 GQPGRAATERPAWPRGPTQEGRLPRG-----------QGPCSWPSPPGSPSGWRPASHRQ 92
P A +P P+GP G PR Q P P PPG P Q
Sbjct: 162 QGPPPPAGPQPRPPQGPPTTGPQPRPTQGPPPTGGPQQRPPQGPPPPGGP---------Q 212
Query: 91 RHPPSSPPP---PPPSRRQRGRSTRRRQQAGP 5
PP PPP P PS Q T QQ P
Sbjct: 213 PRPPQGPPPPGGPQPSPTQGPPPTGGPQQTPP 244
[39][TOP]
>UniRef100_Q8GFF2 Putative uncharacterized protein n=1 Tax=Streptomyces aureofaciens
RepID=Q8GFF2_STRAU
Length = 579
Score = 53.9 bits (128), Expect = 5e-06
Identities = 47/137 (34%), Positives = 57/137 (41%), Gaps = 5/137 (3%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R PA A PGPR +P +S PAR TGS P P P P A + +P
Sbjct: 404 RPPARAP--PGPRPAPTRA---ASTPAR-TGSRPASPPTRPTAPSPAPAAPPRAAAAPTP 457
Query: 238 GQ--PGRAATERPAWPRGPTQ-EGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPP 68
+ P P RGP GR P + + P SP G P R+R PP P
Sbjct: 458 ARRPPPPPTPPVPRARRGPAAGNGRPPSTRDRTAGTRAPASPPGAPPPVRRRRPPPPRAP 517
Query: 67 PP--PPSRRQRGRSTRR 23
PP P +R R+T R
Sbjct: 518 PPHHPSARNPSARATPR 534
[40][TOP]
>UniRef100_C7YHF9 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI
77-13-4 RepID=C7YHF9_NECH7
Length = 1285
Score = 53.9 bits (128), Expect = 5e-06
Identities = 46/134 (34%), Positives = 52/134 (38%), Gaps = 13/134 (9%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGP--RTPWWPRQ*RPCHPCQPAAG-SC 248
R P S +RSP P P R P R P RP P +P G
Sbjct: 639 RGPNSRPGTSDGKRSPMP---PGMGPPRSPHPQNPNRRPDMGPDGRRPSDPRRPGPGPDG 695
Query: 247 RSPGQPGRAATE--RPAWPRGPTQEGRLP-----RGQGPCS---WPSPPGSPSGWRPASH 98
R P P R + RP+ PRGP +GR P RG GP P PPG +RP
Sbjct: 696 RRPSDPRRPGPDGRRPSDPRGPGPDGRRPSDPRARGNGPPPPGHGPPPPGPYGNFRPGPG 755
Query: 97 RQRHPPSSPPPPPP 56
R P P PP P
Sbjct: 756 RSPGPHGPPGPPGP 769
[41][TOP]
>UniRef100_UPI00017C2B55 PREDICTED: similar to collagen, type XXVIII n=1 Tax=Bos taurus
RepID=UPI00017C2B55
Length = 1147
Score = 53.5 bits (127), Expect = 7e-06
Identities = 39/96 (40%), Positives = 50/96 (52%), Gaps = 5/96 (5%)
Frame = +3
Query: 120 EGDPGGLGQE--QGPC-PRGNLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGR 287
+G G GQ+ QGP P+G+ P +GP G G S+ PG GDR P G +G G
Sbjct: 555 KGSKGNQGQKGSQGPGGPKGD-PGIMGPVGMPGISIPGPPGPKGDRGGPGMPGFKGEPGI 613
Query: 288 HCRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +G +GP PV G + DG+P G R PG
Sbjct: 614 AIRGPKGAQGPQGPVGAPGLKGDGYPGVPGPRGIPG 649
[42][TOP]
>UniRef100_UPI0001795F69 PREDICTED: similar to collagen, type XXVIII n=1 Tax=Equus caballus
RepID=UPI0001795F69
Length = 1127
Score = 53.5 bits (127), Expect = 7e-06
Identities = 38/95 (40%), Positives = 45/95 (47%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRGNL--PSCVGPRGQAGRSVAALPGWPGDRHDPAA-G*QGWQGRH 290
+G G GQ P P G P +GP G G S PG GDR P G +G G
Sbjct: 556 KGSKGNQGQRGFPGPEGPKGDPGVMGPFGMPGASNPGPPGPKGDRGGPGVPGFKGEPGIS 615
Query: 291 CRGHQGVRGPDDPVQRAGQE-DGHPPAAGDRLGPG 392
RG +G +GP PV G + D +P AAG R PG
Sbjct: 616 IRGPKGAQGPRGPVGAPGPKGDSYPGAAGPRGLPG 650
[43][TOP]
>UniRef100_UPI0001552FA1 PREDICTED: hypothetical protein n=1 Tax=Mus musculus
RepID=UPI0001552FA1
Length = 261
Score = 53.5 bits (127), Expect = 7e-06
Identities = 44/134 (32%), Positives = 50/134 (37%), Gaps = 7/134 (5%)
Frame = -2
Query: 418 RQPASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSP 239
R P GP+ P G P P + GP P P+ P P PA R P
Sbjct: 103 RPPQGPPPPGGPQLRPPQGPPPPGGP-QPRPPQGPPPPGGPQLRPPQGPPPPAGPQPRPP 161
Query: 238 GQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPS-----GWRPASHRQRHPPSS 74
P A +P P+GP G PR P P P G P G P Q PP
Sbjct: 162 QGPPPPAGPQPRPPQGPPTTGPQPR---PTQGPPPTGGPQQRPPQGPPPPGGPQPRPPQG 218
Query: 73 PPPP--PPSRRQRG 38
PPPP P R +G
Sbjct: 219 PPPPGGPQPRPTQG 232
[44][TOP]
>UniRef100_UPI0000DA2670 PREDICTED: similar to procollagen, type VI, alpha 2 n=1 Tax=Rattus
norvegicus RepID=UPI0000DA2670
Length = 1141
Score = 53.5 bits (127), Expect = 7e-06
Identities = 37/95 (38%), Positives = 45/95 (47%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ G GDR P G +G G
Sbjct: 556 KGSKGNQGQRGFPGPEGPKGEPGIMGPFGMPGASIPGPSGPKGDRGGPGMPGLKGEPGLS 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +G +GP PV G + DG+P AG R PG
Sbjct: 616 VRGPKGAQGPRGPVGAPGLKGDGYPGVAGPRGLPG 650
[45][TOP]
>UniRef100_UPI0000F33194 UPI0000F33194 related cluster n=1 Tax=Bos taurus
RepID=UPI0000F33194
Length = 1152
Score = 53.5 bits (127), Expect = 7e-06
Identities = 39/96 (40%), Positives = 50/96 (52%), Gaps = 5/96 (5%)
Frame = +3
Query: 120 EGDPGGLGQE--QGPC-PRGNLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGR 287
+G G GQ+ QGP P+G+ P +GP G G S+ PG GDR P G +G G
Sbjct: 559 KGSKGNQGQKGSQGPGGPKGD-PGIMGPVGMPGISIPGPPGPKGDRGGPGMPGFKGEPGI 617
Query: 288 HCRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +G +GP PV G + DG+P G R PG
Sbjct: 618 AIRGPKGAQGPQGPVGAPGLKGDGYPGVPGPRGIPG 653
[46][TOP]
>UniRef100_B9TQ05 Putative uncharacterized protein (Fragment) n=1 Tax=Ricinus
communis RepID=B9TQ05_RICCO
Length = 216
Score = 53.5 bits (127), Expect = 7e-06
Identities = 53/140 (37%), Positives = 61/140 (43%), Gaps = 11/140 (7%)
Frame = -2
Query: 391 PGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRAA-- 218
PGP R PA P PA S+ PR P PR R +P + SP RA
Sbjct: 13 PGPARPPA----PPRRPAPPARSALPR-PDCPRSLRS----RPRPRAAASPSARRRAGSR 63
Query: 217 TERPA----WPRGPTQEGRLPRGQGPCSWPSPPGS--PSGWRPAS---HRQRHPPSSPPP 65
T RPA RGP R P P + PSP S P RPA + P ++ P
Sbjct: 64 TPRPAPRSGGDRGPPVSRRRPPASAPSNRPSPGSSRRPRRARPARAAIQARSSPAAAAPR 123
Query: 64 PPPSRRQRGRSTRRRQQAGP 5
PPP R RGR RRR+ A P
Sbjct: 124 PPPGR--RGRPKRRRRAAPP 141
[47][TOP]
>UniRef100_B4UHB4 MaoC domain protein dehydratase n=1 Tax=Anaeromyxobacter sp. K
RepID=B4UHB4_ANASK
Length = 332
Score = 53.1 bits (126), Expect = 9e-06
Identities = 38/127 (29%), Positives = 54/127 (42%), Gaps = 3/127 (2%)
Frame = -2
Query: 391 PGPRR-SPAAGG*PSSWPARCTG--SSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQPGRA 221
P P+R +P AG P+ PA ++ P P P P QPA + ++P P A
Sbjct: 200 PAPQRPAPPAGARPAPAPAAAPRPPAAAPARPGAA----PSRPAQPARPATQAPRPPASA 255
Query: 220 ATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPSRRQR 41
A RPA P R P P+ P P+ RPA+ + P P P+R+++
Sbjct: 256 A--RPAPAARPASATRAPAAAAKAKRPAAPARPAAKRPAAANAKGPARPHPAKRPARKEK 313
Query: 40 GRSTRRR 20
R R
Sbjct: 314 AAGARAR 320
[48][TOP]
>UniRef100_C2BJ30 Fe-S oxidoreductase n=1 Tax=Corynebacterium pseudogenitalium ATCC
33035 RepID=C2BJ30_9CORY
Length = 969
Score = 53.1 bits (126), Expect = 9e-06
Identities = 39/122 (31%), Positives = 54/122 (44%), Gaps = 5/122 (4%)
Frame = -2
Query: 412 PASATSCPGPRR--SPAAGG*PS-SWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRS 242
P SA + P P+ +PAA P+ S P+ + + P P P+ P P PAA S +
Sbjct: 840 PPSAPAAPAPKAPSAPAAPAAPAPSAPSAPSAPTPPAAPQTPQA--PAAPAAPAAPSAPT 897
Query: 241 PGQPGRAATERPAWPRGPTQEG--RLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPP 68
P P A P P PT P P + P PP +P+ +P PP++P
Sbjct: 898 P--PSAPAAPAPKAPAAPTPPNVPAAPAAPTPPAAPKPPQAPAAPKPPQAAPPAPPAAPA 955
Query: 67 PP 62
PP
Sbjct: 956 PP 957
[49][TOP]
>UniRef100_B9RLU7 Putative uncharacterized protein n=1 Tax=Ricinus communis
RepID=B9RLU7_RICCO
Length = 1550
Score = 53.1 bits (126), Expect = 9e-06
Identities = 41/131 (31%), Positives = 49/131 (37%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*PSSWPARCTGSSGPRTPWWPRQ*RPCHPCQPAAGSCRSPGQ 233
P + P P P G P P G+ P P P + P P P G+
Sbjct: 994 PPGRGAPPPPPPPPGRGAPPPPPPPPGRGAPPPPPP--PGRGPPPPPPPPGRGAPPPLPP 1051
Query: 232 PGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHRQRHPPSSPPPPPPS 53
PGR A P P G P G+G P PP P G P PP + PPPPP
Sbjct: 1052 PGRGAPPPPPPPGGGGPPPPPPPGRG---GPPPPPPPGGRVPGPPAPPRPPGAGPPPPPP 1108
Query: 52 RRQRGRSTRRR 20
+G +T R
Sbjct: 1109 LGAKGAATDTR 1119
[50][TOP]
>UniRef100_P91250 Collagen protein 73, confirmed by transcript evidence n=1
Tax=Caenorhabditis elegans RepID=P91250_CAEEL
Length = 285
Score = 53.1 bits (126), Expect = 9e-06
Identities = 43/119 (36%), Positives = 52/119 (43%), Gaps = 20/119 (16%)
Frame = +3
Query: 117 PEGDPGGLGQEQGPCPRGNLPSCVGPRGQAG-----------------RSVAALPGWPGD 245
P G PG G EQGP R P GP+G G R+V A PG PG
Sbjct: 157 PPGQPGAPG-EQGPNGRPGAPGAPGPQGPPGTAGNDGTPGQPGAPGQVRTVPAPPGNPGQ 215
Query: 246 RHDPAA-G*QGWQGRHCRGHQGVRGPDDPVQRAGQE--DGHPPAAGDRLGPGHEVADAG 413
+P A G G GR G+ G +GP P GQ+ G+P A G+ PG + A G
Sbjct: 216 PGEPGAQGPPGEDGR--PGNSGPQGPPGPQGEPGQDGAPGNPGAPGEAGEPGKDGAKGG 272
[51][TOP]
>UniRef100_O18286 Protein ZK1010.7, confirmed by transcript evidence n=1
Tax=Caenorhabditis elegans RepID=O18286_CAEEL
Length = 298
Score = 53.1 bits (126), Expect = 9e-06
Identities = 43/102 (42%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Frame = +3
Query: 114 HP--EGDPGGLGQEQGPCPRGNLPSCV--GPRGQAGR---SVAALPGWPGDRHDPA-AG* 269
HP G+ GG+G + P P GN GPRG+ GR S ALPG PG +P +G
Sbjct: 169 HPGRNGNDGGVGPQGPPGPPGNNGEGGRDGPRGEQGRPAISTPALPGDPGAPGEPGPSGL 228
Query: 270 QGWQGRHCR-GHQGVRGPDDPVQRAGQEDGHPPAAGDRLGPG 392
G QG+ R G G GP P GQ+ GHP AG PG
Sbjct: 229 PGDQGQAGRPGSDGAPGPQGPPGPPGQQ-GHPGQAGPAGQPG 269
[52][TOP]
>UniRef100_A8Q1F6 Collagen col-34, putative n=1 Tax=Brugia malayi RepID=A8Q1F6_BRUMA
Length = 304
Score = 53.1 bits (126), Expect = 9e-06
Identities = 42/139 (30%), Positives = 52/139 (37%), Gaps = 18/139 (12%)
Frame = -2
Query: 412 PASATSCPGPRRSPAAGG*PSSWPAR-CTGSSGPRTPWWPRQ*RPCHPCQPA-------- 260
P PG +P G P P+R C + P PC PC P
Sbjct: 115 PPGKPGKPGKPGAPGLPGNPGKPPSRPCEQVTPP----------PCKPCPPGPPGPPGPP 164
Query: 259 -----AGSCRSPGQPGRAATERPAWPRGPTQEGRLPRGQGPCSWPSPPGSPSGWRPASHR 95
AG +PG+PG A P+GP + P QGP P PG+P+ P
Sbjct: 165 GPPGDAGEPGAPGRPGADAPPGEPGPKGPPGQVGEPGPQGP---PGDPGAPAPSEPLIPG 221
Query: 94 QRHPPSSP----PPPPPSR 50
+ PP P PP PP R
Sbjct: 222 EPGPPGEPGVPGPPGPPGR 240
[53][TOP]
>UniRef100_Q2UY11-2 Isoform 2 of Collagen alpha-1(XXVIII) chain n=1 Tax=Mus musculus
RepID=Q2UY11-2
Length = 699
Score = 53.1 bits (126), Expect = 9e-06
Identities = 37/95 (38%), Positives = 45/95 (47%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ G GDR P G +G G
Sbjct: 556 KGSKGNQGQRGFPGPEGPKGEPGVMGPFGMPGASIPGPSGPKGDRGGPGMPGLKGEPGLP 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +G +GP PV G + DG+P AG R PG
Sbjct: 616 VRGPKGAQGPRGPVGAPGLKGDGYPGVAGPRGLPG 650
[54][TOP]
>UniRef100_Q2UY11 Collagen alpha-1(XXVIII) chain n=1 Tax=Mus musculus
RepID=COSA1_MOUSE
Length = 1141
Score = 53.1 bits (126), Expect = 9e-06
Identities = 37/95 (38%), Positives = 45/95 (47%), Gaps = 4/95 (4%)
Frame = +3
Query: 120 EGDPGGLGQEQGPCPRG--NLPSCVGPRGQAGRSVAALPGWPGDRHDPA-AG*QGWQGRH 290
+G G GQ P P G P +GP G G S+ G GDR P G +G G
Sbjct: 556 KGSKGNQGQRGFPGPEGPKGEPGVMGPFGMPGASIPGPSGPKGDRGGPGMPGLKGEPGLP 615
Query: 291 CRGHQGVRGPDDPVQRAG-QEDGHPPAAGDRLGPG 392
RG +G +GP PV G + DG+P AG R PG
Sbjct: 616 VRGPKGAQGPRGPVGAPGLKGDGYPGVAGPRGLPG 650