Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC141111.4 - phase: 0 /pseudo
         (1069 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

AL366725                                                              259  5e-69
TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Ci...   116  5e-26
BG586326 similar to PIR|G84493|G8 probable retroelement pol poly...    47  3e-05
TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finge...    46  7e-05
TC78961 similar to GP|18252179|gb|AAL61922.1 unknown protein {Ar...    44  4e-04
TC82733 similar to GP|10177404|dbj|BAB10535. gene_id:K24M7.12~pi...    43  7e-04
TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-...    41  0.002
BG452991 PIR|A25875|A25 histone H4 - Tetrahymena thermophila, pa...    37  0.003
BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin...    38  0.018
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4....    32  1.3
TC80683 homologue to GP|8777424|dbj|BAA97014.1 gb|AAF56406.1~gen...    31  2.9
TC81797 similar to GP|17065224|gb|AAL32766.1 Unknown protein {Ar...    30  3.8
TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA...    30  3.8

>AL366725 
          Length = 485

 Score =  259 bits (661), Expect = 5e-69
 Identities = 119/161 (73%), Positives = 136/161 (83%)
 Frame = +2

Query: 43  KFYPHYTAETAEFSKCIKIENGLRADIKRAIGYQKIRNFSELVSSCRIYEEDTKAHYKVM 102
           KFYPHY AETAEFSKCIK ENGLR DIKRAIGYQ++R F +LV++CRIYEEDTKAH KV+
Sbjct: 2   KFYPHYAAETAEFSKCIKFENGLRPDIKRAIGYQQLRVFPDLVNTCRIYEEDTKAHDKVV 181

Query: 103 SERRGKGQLSRPKPYSAPPDKGK*RLKDERRPKMRDAPTDIVCFKCGEKGHKSNVCDRDE 162
           +ER+ KGQ SRPKPYSAP DKGK R+ D+RRPK +DAP +IVCF  GEKGHKSNVC ++ 
Sbjct: 182 NERKTKGQ*SRPKPYSAPADKGKQRMVDDRRPKKKDAPAEIVCFNYGEKGHKSNVCPKEI 361

Query: 163 KKCFRCGKKGHTLADCKRGDIVCYNFNEEGHISLQCTQPKK 203
           KKC RC KKGH +ADCKR DIVC+N NEEGHI  QC QPK+
Sbjct: 362 KKCVRCDKKGHIVADCKRNDIVCFNCNEEGHIGSQCKQPKR 484


>TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Cicer
           arietinum}, partial (8%)
          Length = 516

 Score =  116 bits (290), Expect = 5e-26
 Identities = 55/59 (93%), Positives = 57/59 (96%)
 Frame = +2

Query: 1   RREFLNRYFPEDVRGKKEIEFLELKQGDMSVTEYAAKFVELAKFYPHYTAETAEFSKCI 59
           R+EFL RYFPEDVRGKKEIEFLELKQGDMSVTEYAAKFVELA FYPHY+AETAEFSKCI
Sbjct: 338 RKEFLGRYFPEDVRGKKEIEFLELKQGDMSVTEYAAKFVELATFYPHYSAETAEFSKCI 514


>BG586326 similar to PIR|G84493|G8 probable retroelement pol polyprotein
           [imported] - Arabidopsis thaliana, partial (13%)
          Length = 736

 Score = 47.4 bits (111), Expect = 3e-05
 Identities = 45/113 (39%), Positives = 56/113 (48%)
 Frame = +3

Query: 686 FIVMRPSWA*VVF*CKMVKW*LMLLGS*RFMKRIILHMIWSWLRWFLF*KFGGIICTVPD 745
           FI  R S  *V +*  M +      GS*  M+     MI  WLR +  *+FG   C VP 
Sbjct: 54  FIQTRLSLD*VAY*PSMRRSSPTRQGS*ENMRETTPPMI*KWLR*YSP*RFGAHTCMVPR 233

Query: 746 LKCLVITKV*NIYLIRKS*I*GREDGLTC*RIMILV*ITIRVKLMCLQML*AG 798
            + +   KV*+I+L   S* *GR  G   *   I   + IR KL+  Q L*AG
Sbjct: 234 FRYIRTIKV*SIFLPSLS*T*GRGGGWNS*LTTI*TSLIIREKLIW*QTL*AG 392


>TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
           {Oryza sativa (japonica cultivar-group)}, partial (96%)
          Length = 1286

 Score = 46.2 bits (108), Expect = 7e-05
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
 Frame = +1

Query: 129 KDERRPK--MRDAPTDIVCFKCGEKGHKSNVCDRDEKKCFRCGKKGHTLADCKRGDIVCY 186
           K+ +RP   +R+ P   VC  C   GH ++ C   +  C+ C + GH  + C    I C+
Sbjct: 427 KNCKRPGHYVRECPNVAVCHNCSLPGHIASECST-KSLCWNCKEPGHMASSCPNEGI-CH 600

Query: 187 NFNEEGHISLQCTQPKK 203
              + GH + +CT P+K
Sbjct: 601 TCGKAGHRARECTVPQK 651



 Score = 44.3 bits (103), Expect = 3e-04
 Identities = 26/77 (33%), Positives = 38/77 (48%), Gaps = 1/77 (1%)
 Frame = +1

Query: 140 PTDI-VCFKCGEKGHKSNVCDRDEKKCFRCGKKGHTLADCKRGDIVCYNFNEEGHISLQC 198
           P D+ +C  C ++GH +  C  +EK C  C K GH   DC   D +C   N  GH++ QC
Sbjct: 655 PGDLRLCNNCYKQGHIAVECT-NEKACNNCRKTGHLARDCP-NDPICNLCNISGHVARQC 828

Query: 199 TQPKKVRTGGKVFALTG 215
            +   +   G   +L G
Sbjct: 829 PKSNVIGDRGGGGSLRG 879



 Score = 43.1 bits (100), Expect = 6e-04
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 6/65 (9%)
 Frame = +1

Query: 140 PTDIVCFKCGEKGHKSNVCDRDEKK------CFRCGKKGHTLADCKRGDIVCYNFNEEGH 193
           P + +C  CG+ GH++  C   +K       C  C K+GH   +C   +  C N  + GH
Sbjct: 580 PNEGICHTCGKAGHRARECTVPQKPPGDLRLCNNCYKQGHIAVEC-TNEKACNNCRKTGH 756

Query: 194 ISLQC 198
           ++  C
Sbjct: 757 LARDC 771



 Score = 41.6 bits (96), Expect = 0.002
 Identities = 23/78 (29%), Positives = 31/78 (39%)
 Frame = +1

Query: 132 RRPKMRDAPTDIVCFKCGEKGHKSNVCDRDEKKCFRCGKKGHTLADCKRGDIVCYNFNEE 191
           RR   R    D +C  C   GH    C  +   C  C   GH  ++C    + C+N  E 
Sbjct: 385 RRDSRRGFSQDNLCKNCKRPGHYVRECP-NVAVCHNCSLPGHIASECSTKSL-CWNCKEP 558

Query: 192 GHISLQCTQPKKVRTGGK 209
           GH++  C       T GK
Sbjct: 559 GHMASSCPNEGICHTCGK 612



 Score = 36.2 bits (82), Expect = 0.069
 Identities = 21/85 (24%), Positives = 30/85 (34%), Gaps = 23/85 (27%)
 Frame = +1

Query: 137 RDAPTDIVCFKCGEKGHKSNVCDRD----------------------EKKCFRCGKKGHT 174
           RD P D +C  C   GH +  C +                       +  C  C + GH 
Sbjct: 763 RDCPNDPICNLCNISGHVARQCPKSNVIGDRGGGGSLRGGYRDGGFRDVVCRSCQQFGHM 942

Query: 175 LADCKRGDI-VCYNFNEEGHISLQC 198
             DC  G + +C N    GH + +C
Sbjct: 943 SRDCMGGPLMICQNCGGRGHQAYEC 1017



 Score = 33.1 bits (74), Expect = 0.59
 Identities = 16/44 (36%), Positives = 20/44 (45%), Gaps = 1/44 (2%)
 Frame = +1

Query: 142 DIVCFKCGEKGHKSNVCDRDEKK-CFRCGKKGHTLADCKRGDIV 184
           D+VC  C + GH S  C       C  CG +GH   +C  G  V
Sbjct: 904 DVVCRSCQQFGHMSRDCMGGPLMICQNCGGRGHQAYECPSGRFV 1035


>TC78961 similar to GP|18252179|gb|AAL61922.1 unknown protein {Arabidopsis
           thaliana}, partial (71%)
          Length = 974

 Score = 43.5 bits (101), Expect = 4e-04
 Identities = 21/44 (47%), Positives = 23/44 (51%), Gaps = 3/44 (6%)
 Frame = +2

Query: 164 KCFRCGKKGHTLADCKRGD--IVCYNFNEEGHISLQC-TQPKKV 204
           +CF CG  GH   DCK GD    CY   E GHI   C   PKK+
Sbjct: 356 RCFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNCKNSPKKL 487



 Score = 40.8 bits (94), Expect = 0.003
 Identities = 16/37 (43%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
 Frame = +2

Query: 145 CFKCGEKGHKSNVCDRDE--KKCFRCGKKGHTLADCK 179
           CF CG  GH +  C   +   KC+RCG++GH   +CK
Sbjct: 359 CFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNCK 469


>TC82733 similar to GP|10177404|dbj|BAB10535.
           gene_id:K24M7.12~pir||S42136~similar to unknown protein
           {Arabidopsis thaliana}, partial (57%)
          Length = 710

 Score = 42.7 bits (99), Expect = 7e-04
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 12/67 (17%)
 Frame = +3

Query: 144 VCFKCGEKGHKSNVCD-----RDEKKCFRCGKKGHTLADC----KRGDIV---CYNFNEE 191
           +C +C  +GH++  C       D K  + CG  GH+LA+C    + G  +   C+   E+
Sbjct: 354 ICLRCRRRGHRAQNCPDGGSKEDFKY*YNCGDNGHSLANCPHPLQEGGTMFAQCFVCKEQ 533

Query: 192 GHISLQC 198
           GH+S  C
Sbjct: 534 GHLSKNC 554



 Score = 41.6 bits (96), Expect = 0.002
 Identities = 35/119 (29%), Positives = 50/119 (41%), Gaps = 11/119 (9%)
 Frame = +3

Query: 105 RRGKGQLSRPKPYS-APPDKGK*RLKDERRPKMRDAPTDIVCFKCGEKGHKSNVCDRD-- 161
           ++ K +  R KP S + P  GK  L   R P M+   +   CF C    H +  C +   
Sbjct: 177 KKKKNKFKRKKPDSNSKPRTGKRPL---RVPGMKPGDS---CFICKGLDHIAKFCTQKAE 338

Query: 162 ---EKKCFRCGKKGHTLADCKRGDI-----VCYNFNEEGHISLQCTQPKKVRTGGKVFA 212
               K C RC ++GH   +C  G         YN  + GH    C  P  ++ GG +FA
Sbjct: 339 WEKNKICLRCRRRGHRAQNCPDGGSKEDFKY*YNCGDNGHSLANCPHP--LQEGGTMFA 509


>TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-1 {Triticum
           aestivum}, partial (39%)
          Length = 630

 Score = 41.2 bits (95), Expect = 0.002
 Identities = 23/88 (26%), Positives = 30/88 (33%), Gaps = 31/88 (35%)
 Frame = +3

Query: 145 CFKCGEKGHKSNVCDRDEKK-----------------CFRCGKKGHTLADCKR------- 180
           C+ CG+ GH +  CDR ++                  C+ CG   H   DC R       
Sbjct: 351 CYTCGDTGHIARDCDRSDRNDRNDRSGGGGGGDRDRACYTCGSFEHFARDCMRGGGNNNN 530

Query: 181 -------GDIVCYNFNEEGHISLQCTQP 201
                  G   CY     GHI+  C  P
Sbjct: 531 GGGGYGGGGTSCYRCGGVGHIARDCATP 614


>BG452991 PIR|A25875|A25 histone H4 - Tetrahymena thermophila, partial (33%)
          Length = 560

 Score = 37.4 bits (85), Expect(2) = 0.003
 Identities = 26/61 (42%), Positives = 36/61 (58%)
 Frame = +1

Query: 818 ET*VLSVSCHLRVYSWVC*RLIVIS*TVSEKHIKLMSSLLI*WLLVMKLKIMTSRLMIKV 877
           ET*V  V C  RV +W C*R       VS++  + M S  I* L +++LK++  +LMIK 
Sbjct: 82  ET*VWFVKCRHRV*NWGC*RSTTNFWIVSKRLRRWM*SW*I*CLGIIRLKMVILKLMIKE 261

Query: 878 C 878
           C
Sbjct: 262 C 264



 Score = 22.3 bits (46), Expect(2) = 0.003
 Identities = 9/12 (75%), Positives = 10/12 (83%)
 Frame = +3

Query: 802 TCLL*W*KSLSC 813
           TCLL*W +S SC
Sbjct: 33  TCLL*WLESWSC 68


>BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin cluster;
           Ty3-Gypsy type {Oryza sativa}, partial (15%)
          Length = 716

 Score = 38.1 bits (87), Expect = 0.018
 Identities = 16/27 (59%), Positives = 20/27 (73%)
 Frame = +2

Query: 557 LDKFVVVFIDDILIYSETEEEHAEHLK 583
           LD  V+VF +DILIYS+ E EH  HL+
Sbjct: 506 LDSLVIVFSNDILIYSKNENEHENHLR 586


>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
           Arabidopsis thaliana, partial (17%)
          Length = 378

 Score = 32.0 bits (71), Expect = 1.3
 Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 20/57 (35%)
 Frame = +1

Query: 145 CFKCGEKGHKSNVCDRDEK--------------------KCFRCGKKGHTLADCKRG 181
           C+ CGE GH +  C                          C+ CG+ GH   DC  G
Sbjct: 16  CYNCGESGHMARECTSGGGGGGGRYGGGGGGGGGGGGGGSCYSCGESGHFARDCPTG 186


>TC80683 homologue to GP|8777424|dbj|BAA97014.1
           gb|AAF56406.1~gene_id:K9P8.7~strong similarity to
           unknown protein {Arabidopsis thaliana}, partial (16%)
          Length = 1360

 Score = 30.8 bits (68), Expect = 2.9
 Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
 Frame = +1

Query: 105 RRGKGQLSRPKPYSAPPDKGK*RLK-----DERRPKMRDAPTDIVCFKCGEKGHKSNVCD 159
           R  KG+L + K   A  D+ +  ++        +P  ++    ++  +  +KG KS+   
Sbjct: 886 RGQKGKLKKMKEKYADQDEEERSIRMSLLASSGKPIKKEETLPVI--ETSDKGKKSDSGP 1059

Query: 160 RDEKK-CFRCGKKGHTLADCK 179
            D  K C++C K GH   DCK
Sbjct: 1060IDAPKICYKCKKVGHLSRDCK 1122


>TC81797 similar to GP|17065224|gb|AAL32766.1 Unknown protein {Arabidopsis
           thaliana}, partial (17%)
          Length = 758

 Score = 30.4 bits (67), Expect = 3.8
 Identities = 17/43 (39%), Positives = 29/43 (66%)
 Frame = -2

Query: 233 IVLL*LLL*ILVLLIVSLLLIVHISWVWLYLI*KDKWLLKLQL 275
           +VL+ +++ +L+LL++ LLL+    W+WL+L     WLL L L
Sbjct: 493 LVLMAIMMFLLLLLLLVLLLL----WLWLWL-----WLLLLML 392


>TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA helicase
           {Oryza sativa}, partial (7%)
          Length = 624

 Score = 30.4 bits (67), Expect = 3.8
 Identities = 12/20 (60%), Positives = 14/20 (70%), Gaps = 2/20 (10%)
 Frame = +1

Query: 165 CFRCGKKGHTLADC--KRGD 182
           CF CG+ GH  +DC  KRGD
Sbjct: 166 CFTCGESGHRASDCPNKRGD 225


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.369    0.167    0.628 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,590,598
Number of Sequences: 36976
Number of extensions: 599713
Number of successful extensions: 7899
Number of sequences better than 10.0: 27
Number of HSP's better than 10.0 without gapping: 1701
Number of HSP's successfully gapped in prelim test: 293
Number of HSP's that attempted gapping in prelim test: 5861
Number of HSP's gapped (non-prelim): 2411
length of query: 1069
length of database: 9,014,727
effective HSP length: 106
effective length of query: 963
effective length of database: 5,095,271
effective search space: 4906745973
effective search space used: 4906745973
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 14 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 36 (21.7 bits)
S2: 63 (28.9 bits)


Medicago: description of AC141111.4