
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC122163.2 + phase: 0
(428 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q8L6X7 Hypothetical protein [Arabidopsis thaliana] 555 e-156
UniRef100_O49634 Hypothetical protein AT4g22290 [Arabidopsis tha... 475 e-133
UniRef100_Q6Z1R9 Hypothetical protein OSJNBa0038P10.32 [Oryza sa... 445 e-123
UniRef100_Q9ZVA7 F9K20.7 protein [Arabidopsis thaliana] 433 e-120
UniRef100_Q9FZ45 F6I1.14 protein [Arabidopsis thaliana] 432 e-120
UniRef100_Q7X8W5 OSJNBa0035M09.9 protein [Oryza sativa] 424 e-117
UniRef100_Q7XN56 OSJNBb0103I08.7 protein [Oryza sativa] 192 1e-47
UniRef100_Q67V14 Hypothetical protein OSJNBa0019I19.50 [Oryza sa... 189 2e-46
UniRef100_UPI000036EC71 UPI000036EC71 UniRef100 entry 44 0.009
UniRef100_O27571 Coenzyme F390 synthetase I [Methanobacterium th... 44 0.009
UniRef100_Q5YZA8 Hypothetical protein [Nocardia farcinica] 42 0.034
UniRef100_Q8MQG9 Prion-like-(Q/n-rich)-domain-bearing protein pr... 42 0.034
UniRef100_O02123 Prion-like-(Q/n-rich)-domain-bearing protein pr... 42 0.034
UniRef100_P70478 Adenomatous polyposis coli protein [Rattus norv... 42 0.044
UniRef100_Q9IB91 Type I collagen alpha 1 [Xenopus laevis] 41 0.057
UniRef100_Q802B5 Col1a1-prov protein [Xenopus laevis] 41 0.057
UniRef100_UPI00002DAAFB UPI00002DAAFB UniRef100 entry 41 0.075
UniRef100_UPI00002F9781 UPI00002F9781 UniRef100 entry 40 0.097
UniRef100_Q50751 Coenzyme F390 synthetase [Methanobacterium ther... 40 0.097
UniRef100_Q9W506 CG3443-PB [Drosophila melanogaster] 40 0.13
>UniRef100_Q8L6X7 Hypothetical protein [Arabidopsis thaliana]
Length = 445
Score = 555 bits (1429), Expect = e-156
Identities = 289/450 (64%), Positives = 355/450 (78%), Gaps = 27/450 (6%)
Query: 1 MGTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDP 60
M RI SHQL +GLYVSG+ EQPKER PPTMA+R+VPYTGGD KKSGELG+M DI V+D
Sbjct: 1 MAGRIQSHQLPNGLYVSGKLEQPKER-PPTMAARAVPYTGGDIKKSGELGRMFDISVVDS 59
Query: 61 KS-----------HPSSSSSQLSTGP----ARSRPNSGQVGKNITGSGTLSRKSTGSGPI 105
S + S +S+L P + S PNSG V ++ SG++ + S GP+
Sbjct: 60 ASFQGPPPLIVGGNSSGGTSRLQAPPRVSGSSSNPNSGSV-RSGPNSGSVKKFS---GPL 115
Query: 106 A-LQPTGLITSGPVGS-GPV-GASRRSGQLEQSGS---MGKAVYGSAVTSLG-EEVKVGF 158
+ LQPTGLITSG +GS GP+ SRRSGQL+ S K YGS+VTSL + V+VGF
Sbjct: 116 SQLQPTGLITSGSLGSSGPILSGSRRSGQLDHQLSNLASSKPKYGSSVTSLNVDPVRVGF 175
Query: 159 RVSRSVVWVFMVVVAMCLLVGVFLMVAVKKNVILFALGGVIVPVLVLIIWNCVLGRKGLL 218
+V +++VW ++V AM LLVG FL VAVKK V++ A+ + P +V+++WNCV RKGLL
Sbjct: 176 KVPKAMVWAVLIVAAMGLLVGAFLTVAVKKPVVIAAVLAAVCPAIVVLVWNCVWRRKGLL 235
Query: 219 GFVKRYPDAELRGAIDGQYVKVTGVVTCGSIPLESSYQRIPRCVYVSSELYEYKGWGGKS 278
F+K+YPDAELRGAIDGQ+VKVTGVVTCGSIPLESS+QR PRCVYVS+ELYEYKG+GGKS
Sbjct: 236 SFIKKYPDAELRGAIDGQFVKVTGVVTCGSIPLESSFQRTPRCVYVSTELYEYKGFGGKS 295
Query: 279 AHPKHRCFTWGSRYSEKYIADFYISDFQTGLRALVKAGYGNKVAPFVKPTTVVDVTKENR 338
A+PKHRCF+WGSR++EKY++DFYISDFQ+GLRALVKAGYG+KV+PFVKP TV +VT +N+
Sbjct: 296 ANPKHRCFSWGSRHAEKYVSDFYISDFQSGLRALVKAGYGSKVSPFVKPATVANVTTQNK 355
Query: 339 ELSPNFLGWLADRKLSTDDRIMRLKEGHIKEGSTVSVMGVVRRHENVLMIVPPTEPVSTG 398
+LSP+FL WL+DR LS DDR+MRLKEG+IKEGSTVSVMG+VRRH+NVLMIVPP E VS+G
Sbjct: 356 DLSPSFLKWLSDRNLSADDRVMRLKEGYIKEGSTVSVMGMVRRHDNVLMIVPPAEAVSSG 415
Query: 399 CQWMRCLLPTGVEGLIITCEDNQNADVIAV 428
C+W CL PT +GLIITC+DNQNADVI V
Sbjct: 416 CRWWHCLFPTYADGLIITCDDNQNADVIPV 445
>UniRef100_O49634 Hypothetical protein AT4g22290 [Arabidopsis thaliana]
Length = 974
Score = 475 bits (1223), Expect = e-133
Identities = 252/400 (63%), Positives = 313/400 (78%), Gaps = 27/400 (6%)
Query: 1 MGTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDP 60
M RI SHQL +GLYVSG+ EQPKER PPTMA+R+VPYTGGD KKSGELG+M DI V+D
Sbjct: 1 MAGRIQSHQLPNGLYVSGKLEQPKER-PPTMAARAVPYTGGDIKKSGELGRMFDISVVDS 59
Query: 61 KS-----------HPSSSSSQLSTGP----ARSRPNSGQVGKNITGSGTLSRKSTGSGPI 105
S + S +S+L P + S PNSG V ++ SG++ + S GP+
Sbjct: 60 ASFQGPPPLIVGGNSSGGTSRLQAPPRVSGSSSNPNSGSV-RSGPNSGSVKKFS---GPL 115
Query: 106 A-LQPTGLITSGPVGS-GPV-GASRRSGQLEQSGS---MGKAVYGSAVTSLG-EEVKVGF 158
+ LQPTGLITSG +GS GP+ SRRSGQL+ S K YGS+VTSL + V+VGF
Sbjct: 116 SQLQPTGLITSGSLGSSGPILSGSRRSGQLDHQLSNLASSKPKYGSSVTSLNVDPVRVGF 175
Query: 159 RVSRSVVWVFMVVVAMCLLVGVFLMVAVKKNVILFALGGVIVPVLVLIIWNCVLGRKGLL 218
+V +++VW ++V AM LLVG FL VAVKK V++ A+ + P +V+++WNCV RKGLL
Sbjct: 176 KVPKAMVWAVLIVAAMGLLVGAFLTVAVKKPVVIAAVLAAVCPAIVVLVWNCVWRRKGLL 235
Query: 219 GFVKRYPDAELRGAIDGQYVKVTGVVTCGSIPLESSYQRIPRCVYVSSELYEYKGWGGKS 278
F+K+YPDAELRGAIDGQ+VKVTGVVTCGSIPLESS+QR PRCVYVS+ELYEYKG+GGKS
Sbjct: 236 SFIKKYPDAELRGAIDGQFVKVTGVVTCGSIPLESSFQRTPRCVYVSTELYEYKGFGGKS 295
Query: 279 AHPKHRCFTWGSRYSEKYIADFYISDFQTGLRALVKAGYGNKVAPFVKPTTVVDVTKENR 338
A+PKHRCF+WGSR++EKY++DFYISDFQ+GLRALVKAGYG+KV+PFVKP TV +VT +N+
Sbjct: 296 ANPKHRCFSWGSRHAEKYVSDFYISDFQSGLRALVKAGYGSKVSPFVKPATVANVTTQNK 355
Query: 339 ELSPNFLGWLADRKLSTDDRIMRLKEGHIKEGSTVSVMGV 378
+LSP+FL WL+DR LS DDR+MRLKEG+IKEGSTVSVMG+
Sbjct: 356 DLSPSFLKWLSDRNLSADDRVMRLKEGYIKEGSTVSVMGM 395
>UniRef100_Q6Z1R9 Hypothetical protein OSJNBa0038P10.32 [Oryza sativa]
Length = 482
Score = 445 bits (1144), Expect = e-123
Identities = 236/483 (48%), Positives = 316/483 (64%), Gaps = 56/483 (11%)
Query: 1 MGTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPV--- 57
MG+R PSHQLS+GLYVSGRPEQPKE+ P + S ++PYTGGD KKSGELGKM D+ V
Sbjct: 1 MGSRFPSHQLSNGLYVSGRPEQPKEK-APVICSTAMPYTGGDIKKSGELGKMFDLHVEKS 59
Query: 58 -------LDPKSHPSSSSSQLSTGP---ARSRPN-SGQVGKNITGSGTLSRKSTGSGPI- 105
P + S + ++GP A R N SG + ++ G+G +R + SGP+
Sbjct: 60 RKSGPLGNQPSRNTSFGGAGSNSGPVSNALGRSNYSGSISSSVPGAGGSARAKSNSGPLN 119
Query: 106 ------------------------------ALQPTGLITSGPVGSGPV---GASRR-SGQ 131
L TGLITSGP+ SGP+ GA R+ SG
Sbjct: 120 KHGEPGKKSSGPQSGGVTPMARQNSGPLPPVLPTTGLITSGPISSGPLNSSGAPRKVSGP 179
Query: 132 LEQSGSM----GKAVYGSAVTSLGEE--VKVGFRVSRSVVWVFMVVVAMCLLVGVFLMVA 185
L+ S SM + AVT+L + + + ++++W+ +++ M + G F++ A
Sbjct: 180 LDPSVSMKMRATSFAHNPAVTNLNADDGYSIKGSIPKTILWMVILLFLMGFIAGGFILGA 239
Query: 186 VKKNVILFALGGVIVPVLVLIIWNCVLGRKGLLGFVKRYPDAELRGAIDGQYVKVTGVVT 245
V ++L + + V L+IWN G +G+ GFV RYPDA+LR A DGQYVKVTGVVT
Sbjct: 240 VHNPILLVVVVVIFCFVAALVIWNICWGTRGVTGFVSRYPDADLRTAKDGQYVKVTGVVT 299
Query: 246 CGSIPLESSYQRIPRCVYVSSELYEYKGWGGKSAHPKHRCFTWGSRYSEKYIADFYISDF 305
CG+ PLESS+QR+PRCVY S+ LYEY+GW K+A+ +HR FTWG R E++ DFYISDF
Sbjct: 300 CGNFPLESSFQRVPRCVYTSTCLYEYRGWDSKAANTEHRQFTWGLRSMERHAVDFYISDF 359
Query: 306 QTGLRALVKAGYGNKVAPFVKPTTVVDVTKENRELSPNFLGWLADRKLSTDDRIMRLKEG 365
Q+GLRALVK GYG +V P+V + V+D+ +N+++SP FL WL +R LS+DDRIMRLKEG
Sbjct: 360 QSGLRALVKTGYGARVTPYVDESVVIDINPDNKDMSPEFLRWLRERNLSSDDRIMRLKEG 419
Query: 366 HIKEGSTVSVMGVVRRHENVLMIVPPTEPVSTGCQWMRCLLPTGVEGLIITCEDNQNADV 425
+IKEGSTVSVMGVV+R++NVLMIVPP+EP+STGCQW +C+LPT ++GL++ CED N DV
Sbjct: 420 YIKEGSTVSVMGVVQRNDNVLMIVPPSEPISTGCQWAKCILPTSLDGLVLRCEDTSNIDV 479
Query: 426 IAV 428
I V
Sbjct: 480 IPV 482
>UniRef100_Q9ZVA7 F9K20.7 protein [Arabidopsis thaliana]
Length = 468
Score = 433 bits (1113), Expect = e-120
Identities = 235/472 (49%), Positives = 306/472 (64%), Gaps = 48/472 (10%)
Query: 1 MGTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDP 60
MG+R SHQLS+GL+VSGRPEQPKE+ PPTM+S ++PYTGGD KKSGELGKM DIP
Sbjct: 1 MGSRYASHQLSNGLFVSGRPEQPKEK-PPTMSSVAMPYTGGDIKKSGELGKMFDIPTDGT 59
Query: 61 KSHPS------SSSSQLSTGPARSRPNS-GQVGKNITGSGTLSRKSTGSGPIA------- 106
KS S SS S +GP PN+ G++ N+ +G+ S K T SGP++
Sbjct: 60 KSRKSGPITGGSSRSGAQSGPV---PNATGRMSGNLASAGSNSMKKTNSGPLSKHGEPLK 116
Query: 107 --------------------LQPTGLITSGPVGSGPV---GASRR-SGQLEQSGSMG--- 139
L TGLITSGP+ SGP+ GA R+ SG L+ SGSM
Sbjct: 117 KSSGPQSGGVTRQNSGPIPILPTTGLITSGPITSGPLNSSGAPRKISGPLDYSGSMKTHM 176
Query: 140 -KAVYGSAVTSLGEEVKVGFRVS--RSVVWVFMVVVAMCLLVGVFLMVAVKKNVILFALG 196
V+ AVT+L E S + V+W+ +++ M L G F++ AV ++L +
Sbjct: 177 PSVVHNQAVTTLAPEDDFSCMKSFPKPVLWLVILIFVMGFLAGGFILGAVHNAILLIVVA 236
Query: 197 GVIVPVLVLIIWNCVLGRKGLLGFVKRYPDAELRGAIDGQYVKVTGVVTCGSIPLESSYQ 256
+ V L IWN R+G+ F+ RYPDA+LR A +GQYVKVTGVVTCG++PLESS+
Sbjct: 237 VLFTVVAALFIWNISCERRGITDFIARYPDADLRTAKNGQYVKVTGVVTCGNVPLESSFH 296
Query: 257 RIPRCVYVSSELYEYKGWGGKSAHPKHRCFTWGSRYSEKYIADFYISDFQTGLRALVKAG 316
R+PRCVY S+ LYEY+GWG K A+ HR FTWG R +E+++ DFYISDFQ+GLRALVK G
Sbjct: 297 RVPRCVYTSTCLYEYRGWGSKPANASHRRFTWGLRSAERHVVDFYISDFQSGLRALVKTG 356
Query: 317 YGNKVAPFVKPTTVVDVTKENRELSPNFLGWLADRKLSTDDRIMRLKEGHIKEGSTVSVM 376
G KV P V + V+D N + SP+F+ WL + L+ DDRIMRLKEG+IKEGSTVSV+
Sbjct: 357 NGAKVTPLVDDSVVIDFKPGNEQASPDFVRWLGKKNLTNDDRIMRLKEGYIKEGSTVSVI 416
Query: 377 GVVRRHENVLMIVPPTEPVSTGCQWMRCLLPTGVEGLIITCEDNQNADVIAV 428
GVV+R++NVLMIVP TEP++ G QW +C P +EG+++ CED+ N D I V
Sbjct: 417 GVVQRNDNVLMIVPTTEPLAAGWQWSKCTFPASLEGIVLRCEDSSNVDAIPV 468
>UniRef100_Q9FZ45 F6I1.14 protein [Arabidopsis thaliana]
Length = 474
Score = 432 bits (1112), Expect = e-120
Identities = 233/475 (49%), Positives = 311/475 (65%), Gaps = 48/475 (10%)
Query: 1 MGTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDP 60
MG+R PSHQLS+GL+VSGRPEQPKER P TM++ ++PYTGGD K+SGELGKM DIP
Sbjct: 1 MGSRYPSHQLSNGLFVSGRPEQPKERAP-TMSAVAMPYTGGDIKRSGELGKMFDIPADGT 59
Query: 61 KSHPSS------SSSQLSTGPARSRPNS----GQVGKNITGSGTLSRKSTGSGPIA---- 106
KS S S S G A+S P + G++ ++ +G++S K T SGP++
Sbjct: 60 KSRKSGPIPGAPSRSGSFAGTAQSGPGAPMATGRMSGSLASAGSVSMKKTNSGPLSKHGE 119
Query: 107 -----------------------LQPTGLITSGPVGSGPV---GASRR-SGQLEQSGSMG 139
L TGLITSGP+ SGP+ GA R+ SG L+ SG M
Sbjct: 120 PLKKSSGPQSGGVTRQNSGSIPILPATGLITSGPITSGPLNSSGAPRKVSGPLDSSGLMK 179
Query: 140 K----AVYGSAVTSLGEEVKVGFRVS--RSVVWVFMVVVAMCLLVGVFLMVAVKKNVILF 193
V+ AVT+LG E S + V+W+ +++ M L G F++ AV ++L
Sbjct: 180 SHMPTVVHNQAVTTLGPEDDFSCLKSFPKPVLWLVVLIFIMGFLAGGFILGAVHNPILLV 239
Query: 194 ALGGVIVPVLVLIIWNCVLGRKGLLGFVKRYPDAELRGAIDGQYVKVTGVVTCGSIPLES 253
+ + V L IWN GR+G+ F+ RYPDA+LR A +GQ+VKVTGVVTCG++PLES
Sbjct: 240 VVAILFTVVAALFIWNICWGRRGITDFIARYPDADLRTAKNGQHVKVTGVVTCGNVPLES 299
Query: 254 SYQRIPRCVYVSSELYEYKGWGGKSAHPKHRCFTWGSRYSEKYIADFYISDFQTGLRALV 313
S+ R+PRCVY S+ LYEY+GWG K A+ HR FTWG R SE+++ DFYISDFQ+GLRALV
Sbjct: 300 SFHRVPRCVYTSTCLYEYRGWGSKPANSSHRHFTWGLRSSERHVVDFYISDFQSGLRALV 359
Query: 314 KAGYGNKVAPFVKPTTVVDVTKENRELSPNFLGWLADRKLSTDDRIMRLKEGHIKEGSTV 373
K G G KV P V + V+D + + ++SP+F+ WL + L++DDRIMRLKEG+IKEGSTV
Sbjct: 360 KTGSGAKVTPLVDDSVVIDFKQGSEQVSPDFVRWLGKKNLTSDDRIMRLKEGYIKEGSTV 419
Query: 374 SVMGVVRRHENVLMIVPPTEPVSTGCQWMRCLLPTGVEGLIITCEDNQNADVIAV 428
SV+GVV+R++NVLMIVP +EP++ G QW RC PT +EG+++ CED+ N D I V
Sbjct: 420 SVIGVVQRNDNVLMIVPSSEPLAAGWQWRRCTFPTSLEGIVLRCEDSSNVDAIPV 474
>UniRef100_Q7X8W5 OSJNBa0035M09.9 protein [Oryza sativa]
Length = 482
Score = 424 bits (1089), Expect = e-117
Identities = 231/483 (47%), Positives = 311/483 (63%), Gaps = 56/483 (11%)
Query: 1 MGTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDI-PVLD 59
MG+R PSHQLS+GLYVSGRPEQPKE+ PT+ S ++PYTGGD KKSGELGKM ++ V
Sbjct: 1 MGSRFPSHQLSNGLYVSGRPEQPKEK-APTICSTAMPYTGGDIKKSGELGKMFELHAVKS 59
Query: 60 PKSHP---------------------------------SSSSSQLSTGPARSRPNSGQVG 86
KS P SSS ++G AR++ +SG +
Sbjct: 60 RKSGPLSNAPSRNASFGGAASNSGPVPNAGDRSNYSGSLSSSVPGASGSARAKSSSGPLN 119
Query: 87 KN-----------ITGSGTLSRKSTGSGPIALQPTGLITSGPVGSGPV---GASRR-SGQ 131
K+ G ++R+++G P L TGLITSGP+ SGP+ GA R+ SG
Sbjct: 120 KHGEPVKRSSGPQSGGVTPMARQNSGPLPPMLPTTGLITSGPITSGPLNSSGAQRKVSGP 179
Query: 132 LEQSGSMGKAV----YGSAVTSLGEE--VKVGFRVSRSVVWVFMVVVAMCLLVGVFLMVA 185
L+ + S + AVT + E + +S+ ++ V+ + L+ G+ ++ A
Sbjct: 180 LDSAASKKTRATSFSHNQAVTKITTEDSYSITGSLSKLILGAVGVLFVLGLIAGILILSA 239
Query: 186 VKKNVILFALGGVIVPVLVLIIWNCVLGRKGLLGFVKRYPDAELRGAIDGQYVKVTGVVT 245
V ++L + + V L IWN R+G++GFV RY DA+LR A DGQY+KVTGVVT
Sbjct: 240 VHNAILLIVVLVLFGFVAALFIWNACWARRGVIGFVDRYSDADLRTAKDGQYIKVTGVVT 299
Query: 246 CGSIPLESSYQRIPRCVYVSSELYEYKGWGGKSAHPKHRCFTWGSRYSEKYIADFYISDF 305
CG+ PLESSYQR+PRCVY S+ L+EY+GW K+A+ +H FTWG R E++ DFYISDF
Sbjct: 300 CGNFPLESSYQRVPRCVYTSTTLHEYRGWDSKAANTQHHRFTWGLRSMEQHAVDFYISDF 359
Query: 306 QTGLRALVKAGYGNKVAPFVKPTTVVDVTKENRELSPNFLGWLADRKLSTDDRIMRLKEG 365
Q+GLRALVKAGYG +V PFV + ++D+ +N+++SP F WL +R LS+DDRIMRLKEG
Sbjct: 360 QSGLRALVKAGYGARVTPFVDESVIIDIDPDNKDMSPEFRRWLRERNLSSDDRIMRLKEG 419
Query: 366 HIKEGSTVSVMGVVRRHENVLMIVPPTEPVSTGCQWMRCLLPTGVEGLIITCEDNQNADV 425
+IKEGSTVSVMGVV++++NVLMIVPP EP+STGCQW +C+LP + GL++ CED N DV
Sbjct: 420 YIKEGSTVSVMGVVQKNDNVLMIVPPPEPISTGCQWAKCVLPRDLYGLVLRCEDTSNIDV 479
Query: 426 IAV 428
IAV
Sbjct: 480 IAV 482
>UniRef100_Q7XN56 OSJNBb0103I08.7 protein [Oryza sativa]
Length = 465
Score = 192 bits (489), Expect = 1e-47
Identities = 98/257 (38%), Positives = 157/257 (60%), Gaps = 4/257 (1%)
Query: 162 RSVVWVFMVVVAMCLLVGVFLMVAVKKNVILFALGGVIVPVLVLIIWNCVLGRKG--LLG 219
R+ VW ++A+ L +G ++ V +L + V+ ++WN G L
Sbjct: 207 RAAVWAVAALLAVGLGLGALVLAVVHSAALLVVAVLLSAAVVAFLLWNAAASASGRALRR 266
Query: 220 FVKRYPDAELRGAIDGQYVKVTGVVTCGSIPLESSYQRIPRCVYVSSELYEYKGWGGKSA 279
FV P + LR A D Q VK+TG+V CG I L SSY+++ CVY S+ L + WG + A
Sbjct: 267 FVDGLPASSLRSATDDQLVKITGLVACGDISLISSYEKVENCVYTSTLLRKCGRWGSEVA 326
Query: 280 HPKHRCFTWGSRYSEKYIADFYISDFQTGLRALVKAGYGNKVAPFVKPTTVVDVTKENRE 339
+PK+RC W ++E++ ADFYI+D ++G RALVKAG+ ++V P + +V T N E
Sbjct: 327 NPKNRCSKWKLTHAERFAADFYITDAKSGKRALVKAGHDSRVVPLIDENLLV-TTSGNTE 385
Query: 340 LSPNFLGWLADRKLSTDD-RIMRLKEGHIKEGSTVSVMGVVRRHENVLMIVPPTEPVSTG 398
LS WL +R + +++ +++RL+EG+I EG +SV+G++ + + LMI+PP EP+STG
Sbjct: 386 LSSTLRCWLDERNIPSEECQLIRLEEGYIAEGMRLSVIGILSKKDGDLMILPPPEPISTG 445
Query: 399 CQWMRCLLPTGVEGLII 415
C ++ LLPT +G+++
Sbjct: 446 CVFLSFLLPTYFDGIVL 462
>UniRef100_Q67V14 Hypothetical protein OSJNBa0019I19.50 [Oryza sativa]
Length = 491
Score = 189 bits (479), Expect = 2e-46
Identities = 146/433 (33%), Positives = 207/433 (47%), Gaps = 54/433 (12%)
Query: 8 HQLSSGLYVSG-RPEQPKERQPPTMASRSVP-YTGGDPKKSGELGKMLDIPVLDPKSHPS 65
H++ SG+YVSG P++ KER+ + S + P YTGGD +SGELG+M DI
Sbjct: 101 HKIGSGMYVSGPAPDRGKERRQLSSGSVATPPYTGGDVSRSGELGRMFDI---------- 150
Query: 66 SSSSQLSTGPARSRPNSGQVGKNIT-----GSGTLSRKSTGS---GPIALQPTGLITSGP 117
PA SR +SG + + + SG LS+ S GP P P
Sbjct: 151 ---GGAGVSPASSRRSSGPLPRPLPLLPSPASGPLSQLSHSGLLVGPSPPPPPPQTQQSP 207
Query: 118 VGSGPVGASRRSGQLEQSGSMGKAVYGSAVTSLGEEVKVGFRVSRSVVWVFMVVVAMCLL 177
GS + RR E++ + +A G A LG V V V+ SV L
Sbjct: 208 AGSWRKSSRRR----EEAAAAPEAARGRA--RLG--VSVACYVAASVA------ATAGLG 253
Query: 178 VGVFLMVAVKKNVILFALGGVIVPVLVLIIWNCVLGRKGLLGFVKRYPDA--ELRGAIDG 235
G F +VA + +L A GG + V WN F +R PD + G
Sbjct: 254 AGAFFLVAWHRWEVLSAAGGAVAAVAAAFAWNVRRRDAEAERFFRRLPDTVFDQSDMPIG 313
Query: 236 QYVKVTGVVTCGSIPLESSYQRIPRCVYVSSELYEYKGWGGKSAHPKHRCFTWGSRYSEK 295
+ VK+TG VTCG PL + + RC++ S +LYE +G CF W +SE
Sbjct: 314 ELVKITGQVTCGHQPLGARFHDAARCIFTSVQLYERRGC----------CFRWQQTHSET 363
Query: 296 YIADFYISDFQTGLRALVKAGYGNKVAPFVKPTTVVDVTKENRELSPNFLGWLADRKLST 355
A+FYISD TG R V+AG G K+ +K T + E + S N W+A LS
Sbjct: 364 RTANFYISDRNTGKRFYVRAGEGGKITWMIKQKTD-SLDGERKGASRNLKSWMASNDLSC 422
Query: 356 DDRIMRLKEGHIKEGSTVSVMGVVRRHENVLMIVPPTEPVSTGCQWMRCLLPTGVEGLII 415
D + +KEG EG T SV+GV+++H ++ P+ V+TGCQ+ RC+ P VEGLI+
Sbjct: 423 DGTV-HVKEG---EGDTASVIGVLKKHHAYDIVDAPSGVVTTGCQFTRCMFPVHVEGLIL 478
Query: 416 TCEDNQNADVIAV 428
+++ + +V V
Sbjct: 479 VGDEDPDDEVYMV 491
>UniRef100_UPI000036EC71 UPI000036EC71 UniRef100 entry
Length = 684
Score = 43.9 bits (102), Expect = 0.009
Identities = 46/158 (29%), Positives = 65/158 (41%), Gaps = 13/158 (8%)
Query: 2 GTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDPK 61
G R P SS G+P R PT++S VP SG +
Sbjct: 367 GVRQPGSSSSSA---PGQPSTGVAR--PTVSSGPVPRRQNGSSSSGPERSISGSKKPTND 421
Query: 62 SHPSSSSSQLSTGPARSRPNSGQVGKNITGSGTLSR---KSTGSGPIALQPTGLITS--- 115
S+PS + + GP + +SG G+ I+GSG+ +R S G G P L
Sbjct: 422 SNPSRRTVSGTCGPGQPASSSGGPGRPISGSGSSARPLGSSRGPGRPVSSPHELRRPVSG 481
Query: 116 -GPVGSGPVGASRR-SGQLEQSGSMGKAVYGSAVTSLG 151
GP G +G R SG + ++ +V G V+SLG
Sbjct: 482 LGPPGRSVIGPGRSISGSIPAGRTVSNSVPGRPVSSLG 519
>UniRef100_O27571 Coenzyme F390 synthetase I [Methanobacterium thermoautotrophicum]
Length = 453
Score = 43.9 bits (102), Expect = 0.009
Identities = 24/57 (42%), Positives = 28/57 (49%), Gaps = 2/57 (3%)
Query: 267 ELYEYKGWGGKSAHPKHRCFTWGS--RYSEKYIADFYISDFQTGLRALVKAGYGNKV 321
E+Y G S PK TWG RY+EKY F F+TG R +V A YG V
Sbjct: 94 EIYTIHETSGTSGRPKSFFLTWGDWQRYAEKYARSFVSQGFETGDRVVVCASYGMNV 150
>UniRef100_Q5YZA8 Hypothetical protein [Nocardia farcinica]
Length = 461
Score = 42.0 bits (97), Expect = 0.034
Identities = 41/159 (25%), Positives = 58/159 (35%), Gaps = 28/159 (17%)
Query: 12 SGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDPKSHPSSSSSQL 71
+G SG P P + P +SR+ P G DP D P DP+ SS
Sbjct: 279 AGTNPSGNPNTPAAQDP---SSRNDPTAGDDPAD--------DSPYEDPQGTNPQSSEPQ 327
Query: 72 STGPARSRPNSGQVGKNIT------GSGTLSRKSTGSGPIALQPTGLI-----------T 114
ST P + P++ G +++ S T S S G G L G T
Sbjct: 328 STVPQSTVPSAAPPGSSLSDPSGRPNSSTPSLGSPGLGAPGLGSPGATPSPGSSVPGAKT 387
Query: 115 SGPVGSGPVGASRRSGQLEQSGSMGKAVYGSAVTSLGEE 153
+ P G G +G ++G G G+ G+E
Sbjct: 388 AQPTGLGTAATRAATGAAGRAGMPGMGAMGAGAGRRGDE 426
>UniRef100_Q8MQG9 Prion-like-(Q/n-rich)-domain-bearing protein protein 75, isoform c
[Caenorhabditis elegans]
Length = 539
Score = 42.0 bits (97), Expect = 0.034
Identities = 43/138 (31%), Positives = 50/138 (36%), Gaps = 33/138 (23%)
Query: 22 QPKERQPPTMASRSV------PYTGGDPKKSGELGKMLDIPVLDPKSHP----------S 65
Q ER PPT + + P GG K S E + + P P+ P S
Sbjct: 359 QAPERSPPTGSPPTGSPPTGRPPRGGPGKSSEESSESREGPRGGPRGGPRGGPRKSSEES 418
Query: 66 SSSSQLSTGPARSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLIT----------- 114
S S + GP RS P G TGS R GS P PTGL +
Sbjct: 419 SESREEPRGPRRSPPT----GSPPTGSPPTGRPPRGSPPTGSPPTGLPSRQKRQAPEDRP 474
Query: 115 --SGPVGSGPVGASRRSG 130
S P GS P G R G
Sbjct: 475 TGSPPTGSPPTGRPHRGG 492
Score = 34.7 bits (78), Expect = 5.3
Identities = 32/114 (28%), Positives = 42/114 (36%), Gaps = 10/114 (8%)
Query: 20 PEQPKERQPPTMASRSVPYTGGDPKKSGELGKM-LDIPVLDPKSHPSS--SSSQLSTGPA 76
P P+ P P TG P+ S G +P + P + S + P
Sbjct: 425 PRGPRRSPPTGSPPTGSPPTGRPPRGSPPTGSPPTGLPSRQKRQAPEDRPTGSPPTGSPP 484
Query: 77 RSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLITSGPVGSGPVGASRRSG 130
RP+ G GK S + + GP PTG S P GS P GA + G
Sbjct: 485 TGRPHRGGPGK----SESSESREGPRGPRRSPPTG---SPPTGSPPTGAPPKKG 531
>UniRef100_O02123 Prion-like-(Q/n-rich)-domain-bearing protein protein 75, isoform a
[Caenorhabditis elegans]
Length = 524
Score = 42.0 bits (97), Expect = 0.034
Identities = 43/138 (31%), Positives = 50/138 (36%), Gaps = 33/138 (23%)
Query: 22 QPKERQPPTMASRSV------PYTGGDPKKSGELGKMLDIPVLDPKSHP----------S 65
Q ER PPT + + P GG K S E + + P P+ P S
Sbjct: 344 QAPERSPPTGSPPTGSPPTGRPPRGGPGKSSEESSESREGPRGGPRGGPRGGPRKSSEES 403
Query: 66 SSSSQLSTGPARSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLIT----------- 114
S S + GP RS P G TGS R GS P PTGL +
Sbjct: 404 SESREEPRGPRRSPPT----GSPPTGSPPTGRPPRGSPPTGSPPTGLPSRQKRQAPEDRP 459
Query: 115 --SGPVGSGPVGASRRSG 130
S P GS P G R G
Sbjct: 460 TGSPPTGSPPTGRPHRGG 477
Score = 34.7 bits (78), Expect = 5.3
Identities = 32/114 (28%), Positives = 42/114 (36%), Gaps = 10/114 (8%)
Query: 20 PEQPKERQPPTMASRSVPYTGGDPKKSGELGKM-LDIPVLDPKSHPSS--SSSQLSTGPA 76
P P+ P P TG P+ S G +P + P + S + P
Sbjct: 410 PRGPRRSPPTGSPPTGSPPTGRPPRGSPPTGSPPTGLPSRQKRQAPEDRPTGSPPTGSPP 469
Query: 77 RSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLITSGPVGSGPVGASRRSG 130
RP+ G GK S + + GP PTG S P GS P GA + G
Sbjct: 470 TGRPHRGGPGK----SESSESREGPRGPRRSPPTG---SPPTGSPPTGAPPKKG 516
>UniRef100_P70478 Adenomatous polyposis coli protein [Rattus norvegicus]
Length = 2842
Score = 41.6 bits (96), Expect = 0.044
Identities = 33/127 (25%), Positives = 48/127 (36%), Gaps = 3/127 (2%)
Query: 19 RPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDPKSHPSSSSSQLSTGPAR- 77
+P E P T + + + P +SG P P S P S + S P R
Sbjct: 2275 KPAVKSELSPITRQTSHISGSNKGPSRSGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRN 2334
Query: 78 --SRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLITSGPVGSGPVGASRRSGQLEQS 135
S PN + T S KS+GSG ++ G S S G S+ + + +S
Sbjct: 2335 GISTPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQLSQQNLSKQTGLSKNASSIPRS 2394
Query: 136 GSMGKAV 142
S K +
Sbjct: 2395 ESASKGL 2401
>UniRef100_Q9IB91 Type I collagen alpha 1 [Xenopus laevis]
Length = 1447
Score = 41.2 bits (95), Expect = 0.057
Identities = 39/131 (29%), Positives = 51/131 (38%), Gaps = 8/131 (6%)
Query: 17 SGRPEQPKERQPP-TMASRSVPYTGGDPKKSGELG------KMLDIPVLDPKSHPSSSSS 69
+G+P +P ER PP +R +P T G P G G D PK P S
Sbjct: 221 AGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFNGLDGAKGDSGPAGPKGEPGSPGE 280
Query: 70 QLSTGPARSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLITSGPVG-SGPVGASRR 128
+ G R SG+ G+ +R + G+ A P SGP G G VG
Sbjct: 281 NGAPGQVGPRGLSGERGRPGPSGPAGARGNDGAPGAAGPPGSTGPSGPPGFPGGVGPKGD 340
Query: 129 SGQLEQSGSMG 139
+G GS G
Sbjct: 341 AGPQGSRGSDG 351
>UniRef100_Q802B5 Col1a1-prov protein [Xenopus laevis]
Length = 1449
Score = 41.2 bits (95), Expect = 0.057
Identities = 39/131 (29%), Positives = 51/131 (38%), Gaps = 8/131 (6%)
Query: 17 SGRPEQPKERQPP-TMASRSVPYTGGDPKKSGELG------KMLDIPVLDPKSHPSSSSS 69
+G+P +P ER PP +R +P T G P G G D PK P S
Sbjct: 221 AGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFNGLDGAKGDSGPAGPKGEPGSPGE 280
Query: 70 QLSTGPARSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLITSGPVG-SGPVGASRR 128
+ G R SG+ G+ +R + G+ A P SGP G G VG
Sbjct: 281 NGAPGQVGPRGLSGERGRPGPSGPAGARGNDGAPGAAGPPGSTGPSGPPGFPGGVGPKGD 340
Query: 129 SGQLEQSGSMG 139
+G GS G
Sbjct: 341 AGPQGSRGSDG 351
>UniRef100_UPI00002DAAFB UPI00002DAAFB UniRef100 entry
Length = 764
Score = 40.8 bits (94), Expect = 0.075
Identities = 37/125 (29%), Positives = 50/125 (39%), Gaps = 13/125 (10%)
Query: 19 RPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDPKSHPSSSSSQLSTG--PA 76
RPE P P A R+V GG P S + P P++ PSS+ + ST P
Sbjct: 575 RPEAPA----PKAAPRTVGPVGGSPAPSTQ-------PSSRPQAAPSSTPAPASTPAPPT 623
Query: 77 RSRPNSGQVGKNITGSGTLSRKSTGSGPIALQPTGLITSGPVGSGPVGASRRSGQLEQSG 136
+ PN GQ + ++ + R P P SGP P+ R + E S
Sbjct: 624 ATLPNPGQEPRKLSYAEVCQRPPKDPPPATPAPAPSGPSGPSSGQPLRELRVNKPEEPSS 683
Query: 137 SMGKA 141
S G A
Sbjct: 684 SSGPA 688
>UniRef100_UPI00002F9781 UPI00002F9781 UniRef100 entry
Length = 1403
Score = 40.4 bits (93), Expect = 0.097
Identities = 42/153 (27%), Positives = 59/153 (38%), Gaps = 22/153 (14%)
Query: 17 SGRPEQPKERQPP-TMASRSVPYTGGDPKKSGELGKMLDIPVLD----------PKSHPS 65
+GRP +P +R P +R P T G P G G P LD PK
Sbjct: 139 NGRPGKPGDRGVPGPQGARGFPGTPGLPGMKGHRG----YPGLDGRKGESGAAGPKGETG 194
Query: 66 SSSSQLSTGPARSRPNSGQVGKNITGSGTLSRKSTGS-------GPIALQPTGLITSGPV 118
+ + S G A +R + G+ G+ T +R + G+ GP+ GP
Sbjct: 195 ARGAAGSPGQAGARGSPGERGRAGPAGATGARGADGNVGPAGAAGPVGNAGPPGFPGGPG 254
Query: 119 GSGPVGASRRSGQLEQSGSMGKAVYGSAVTSLG 151
G VG + SG GS G+ AV +G
Sbjct: 255 PKGDVGPAGSSGPSGPQGSRGEPGPNGAVGPVG 287
>UniRef100_Q50751 Coenzyme F390 synthetase [Methanobacterium thermoformicicum]
Length = 377
Score = 40.4 bits (93), Expect = 0.097
Identities = 22/57 (38%), Positives = 27/57 (46%), Gaps = 2/57 (3%)
Query: 267 ELYEYKGWGGKSAHPKHRCFTWGS--RYSEKYIADFYISDFQTGLRALVKAGYGNKV 321
++Y G S PK TWG RY+EKY F F+ G R +V A YG V
Sbjct: 90 DIYTIHETSGTSGRPKSFFLTWGDWQRYAEKYARSFVSQGFERGDRVVVCASYGMNV 146
>UniRef100_Q9W506 CG3443-PB [Drosophila melanogaster]
Length = 3433
Score = 40.0 bits (92), Expect = 0.13
Identities = 40/138 (28%), Positives = 54/138 (38%), Gaps = 7/138 (5%)
Query: 2 GTRIPSHQLSSGLYVSGRPEQPKERQPPTMASRSVPYTGGDPKKSGELGKMLDIPVLDPK 61
G P+ SSG S RPE R ++P G GE GK L P +
Sbjct: 2944 GASGPTRATSSGGVRSRRPEVGSSRGRDHERRATLPIASGGA--GGEPGKDLTAPQTQVE 3001
Query: 62 SHPSSSSSQLSTGPARSRPNSG-QVGKNITGSGTLSRKSTGSGPIALQPTGLITSG---- 116
PS S++LS+ G +G IT G RK+ G + TG +S
Sbjct: 3002 HEPSPRSTKLSSSSGSLGLGMGVGLGNIITTPGDYPRKTKGPICLTAVDTGTSSSAAEGK 3061
Query: 117 PVGSGPVGASRRSGQLEQ 134
P GSG V + G + +
Sbjct: 3062 PAGSGTVAGTGVGGVIRK 3079
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.317 0.136 0.407
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 747,773,666
Number of Sequences: 2790947
Number of extensions: 33131520
Number of successful extensions: 108332
Number of sequences better than 10.0: 292
Number of HSP's better than 10.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 304
Number of HSP's that attempted gapping in prelim test: 106853
Number of HSP's gapped (non-prelim): 1276
length of query: 428
length of database: 848,049,833
effective HSP length: 130
effective length of query: 298
effective length of database: 485,226,723
effective search space: 144597563454
effective search space used: 144597563454
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)
Medicago: description of AC122163.2