Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC141111.5 - phase: 0 /pseudo
         (1478 letters)

Database: LJGI 
           28,460 sequences; 14,692,800 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BG662087                                                               92  6e-19
BE122516                                                               78  9e-15
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ...    74  2e-13
AV410603                                                               72  8e-13
AU089582                                                               50  6e-08
AU251673                                                               55  1e-07
TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti...    53  3e-07
BI418821                                                               46  5e-05
TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, part...    44  2e-04
TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, par...    42  5e-04
TC18698                                                                39  0.006
TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti...    37  0.023
TC12574                                                                36  0.051
TC17929                                                                34  0.19
AV779679                                                               34  0.19
BP039068                                                               31  1.2
TC9039                                                                 30  2.1
TC8917 similar to UP|AAH64262 (AAH64262) MGC76273 protein, parti...    29  6.2
TC17786 similar to PIR|T46177|T46177 villin 3 homolog T8H10.10 -...    28  8.1
TC19927 weakly similar to UP|O80689 (O80689) F8K4.2 protein, par...    28  8.1

>BG662087 
          Length = 373

 Score = 92.0 bits (227), Expect = 6e-19
 Identities = 45/113 (39%), Positives = 68/113 (59%)
 Frame = +1

Query: 557 GSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQK 616
           G  R+ +DY  LNK   K+ YPLP ID L+D     ++ S +D  SGYHQIK+   D  K
Sbjct: 16  GKWRMWVDYTDLNKACPKDSYPLPSIDKLVDGASDNELLSLMDAYSGYHQIKMHPSDEDK 195

Query: 617 TAFSTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDRFVVVFIDDILIYS 669
           TAF T   +Y Y+ +PFG+ NA   +   M+R+F   + R + V++D++++ S
Sbjct: 196 TAFMTARVNYCYQTIPFGLKNAGATYQXLMDRVFXDXVGRNMEVYLDNMIVKS 354


>BE122516 
          Length = 364

 Score = 78.2 bits (191), Expect = 9e-15
 Identities = 46/116 (39%), Positives = 71/116 (60%)
 Frame = +2

Query: 408 LGMNWLEYNHVHINYFTKSVYFSSVEEESGAEFLSTKQLKQMERDGILMYPLMASLSFEN 467
           +GMNWL  N   +N   K+V F + E ++     + K  K  E +  ++  L A  + ++
Sbjct: 14  VGMNWLTANDATLNCRKKTVTFGTSEGDAKRVKRTDKVGKASECESDVL--LGALETDKS 187

Query: 468 QAVIDKLQVVCDFPEVFPDEIPDVPLEREVEFSIDLILGTKPVSMAPYRMSASELA 523
              ++ + VV +F +VFP+E+ ++P EREVEFSID + GT P+S+APYRMS  ELA
Sbjct: 188 DTGVEGIPVVREFSDVFPEEVSELPPEREVEFSID*VPGTGPISIAPYRMSLVELA 355


>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
           [imported] - Agrobacterium                tumefaciens
           (strain C58, Dupont) {Agrobacterium tumefaciens;},
           partial (6%)
          Length = 561

 Score = 73.6 bits (179), Expect = 2e-13
 Identities = 45/156 (28%), Positives = 68/156 (42%), Gaps = 7/156 (4%)
 Frame = -2

Query: 201 RKTKGQQSRPKPYSAPADKGKQRMVDDR------RPKKKDAPAEITCFNCGEKGHKSNVC 254
           R  + +  + KP+  P ++G              RP + D  +EI C  C +KGH +N C
Sbjct: 464 RFDRNKSFQKKPFQRPQNRGTSSGYSHSFGNFVPRPTQSDT-SEIVCHRCSKKGHFANRC 288

Query: 255 PEEIKKCVRCGKKGHIVADCKRNDI-VCFNFNEEGHIGSQCKQPKKSPTTGRVFALDGTQ 313
           P+ +  C  C K GH   DC    +    N            + K+   + RV+ + G +
Sbjct: 287 PDLV--CWNCQKTGHSGKDCTNPKVEAATNAIAARRPAPAANKGKRPVASARVYTVSGAE 114

Query: 314 TENEDRLIRCTCYINNTPLVAIIDTGATHCFIAFDC 349
           +   D LIR    +N  PL  + D+GATH FI   C
Sbjct: 113 SHRADGLIRSVGSVNCKPLTILFDSGATHSFIDLAC 6


>AV410603 
          Length = 162

 Score = 71.6 bits (174), Expect = 8e-13
 Identities = 30/53 (56%), Positives = 42/53 (78%)
 Frame = +1

Query: 572 TIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFSTRYG 624
           T+K+ +P+P +D+L+D+L G++ FSK+DLRSGYHQI VK ED  KT F T +G
Sbjct: 4   TVKDSFPMPTVDELLDELRGSQFFSKLDLRSGYHQILVKPEDRHKTVFRTHHG 162


>AU089582 
          Length = 383

 Score = 50.4 bits (119), Expect(2) = 6e-08
 Identities = 30/84 (35%), Positives = 42/84 (49%)
 Frame = +1

Query: 1161 SCCPVGRDLYQGDREVTWCSFEHCIR*RSKIYF*VLEKLARGFGFKVEIEFGLSSADRWS 1220
            SC  +  DL   D  + WC+  +  R RS I+   LE  +  FG  ++ E+  SS++ WS
Sbjct: 7    SCVSIC*DLLG*DCFLAWCTCVYNFRSRSSIHITFLEVFSNCFGNSIKNEYRFSSSN*WS 186

Query: 1221 VGEDNSVARGFVESLCS*ARRNLG 1244
            V ED S  RG+   LC    R  G
Sbjct: 187  VREDYSDLRGYASCLCVXPERXAG 258



 Score = 24.6 bits (52), Expect(2) = 6e-08
 Identities = 19/43 (44%), Positives = 23/43 (53%)
 Frame = +2

Query: 1240 RRNLG*SSSVDRVHIQ**LSF*YWNDTF*GFVWSEMQNSVVLV 1282
            R  LG     D + I **LS *+ + T *  VW EMQ +  LV
Sbjct: 245  RGXLGSVFVFDGIRI***LSV*HLDGTI*SLVW*EMQVTYRLV 373


>AU251673 
          Length = 413

 Score = 54.7 bits (130), Expect = 1e-07
 Identities = 37/138 (26%), Positives = 65/138 (46%), Gaps = 1/138 (0%)
 Frame = +2

Query: 56  CSEVQKVRFGTHQLAEEADDWWVSLLPNLDQDGVDVTWAVF*REFMRRYFPEDVRRKKEI 115
           CS+ + V   + QL   A DW+  L           TWA F  EFM R+ P+ VR     
Sbjct: 5   CSDTRAVELASFQLEGVARDWYNVLTRAKPVGSPPWTWADFSAEFMNRFLPQSVRDGFVR 184

Query: 116 EFLELKQG-NMYVTEYAAKFVELAEFYPHYAAETVEFSKCIKFENGLRPDIKRAIEYQQI 174
           +F  L+Q   M V+EY+A F  L+ + P+     +E  +  +F  GL+  + +++   + 
Sbjct: 185 DFERLEQAEGMTVSEYSAHFTHLSRYVPY---PLLEEERVKRFVRGLKEYLFKSVVGSKS 355

Query: 175 RVFPDLVNSCRIYEEDTK 192
               ++++   + E+  K
Sbjct: 356 STLSEVLSLALLVEQRQK 409


>TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (26%)
          Length = 555

 Score = 53.1 bits (126), Expect = 3e-07
 Identities = 26/93 (27%), Positives = 42/93 (44%)
 Frame = +2

Query: 202 KTKGQQSRPKPYSAPADKGKQRMVDDRRPKKKDAPAEITCFNCGEKGHKSNVCPEEIKKC 261
           +++ +   P      +D+   R    RR  ++    +  C NC   GH +  CP  +  C
Sbjct: 227 RSRSRSRSPMDRKIRSDRFSYRDAPYRRDSRRGFSRDNLCKNCKRPGHFARECP-NVAIC 403

Query: 262 VRCGKKGHIVADCKRNDIVCFNFNEEGHIGSQC 294
             CG  GHI ++C    + C+N  E GH+ S C
Sbjct: 404 HNCGLPGHIASECTTKSL-CWNCKEPGHMASSC 499


>BI418821 
          Length = 614

 Score = 45.8 bits (107), Expect = 5e-05
 Identities = 20/71 (28%), Positives = 30/71 (42%), Gaps = 15/71 (21%)
 Frame = +2

Query: 241 CFNCGEKGHKSNVCPEEIKK--------CVRCGKKGHIVADCKRND-------IVCFNFN 285
           C+NCG+ GH +  C              C  CG  GH+  DC R++         C+N  
Sbjct: 392 CYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLARDCNRSNNNSGGGGAGCYNCG 571

Query: 286 EEGHIGSQCKQ 296
           + GH+   C +
Sbjct: 572 DTGHLARDCNR 604



 Score = 38.5 bits (88), Expect = 0.008
 Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 7/45 (15%)
 Frame = +2

Query: 241 CFNCGEKGHKSNVCPEEIKK-------CVRCGKKGHIVADCKRND 278
           C+NCG+ GH +  C             C  CG  GH+  DC R++
Sbjct: 476 CYNCGDAGHLARDCNRSNNNSGGGGAGCYNCGDTGHLARDCNRSN 610



 Score = 35.4 bits (80), Expect = 0.066
 Identities = 14/52 (26%), Positives = 22/52 (41%), Gaps = 8/52 (15%)
 Frame = +2

Query: 261 CVRCGKKGHIVADCKRND--------IVCFNFNEEGHIGSQCKQPKKSPTTG 304
           C  CG  GH+  DC R++          C+N  + GH+   C +   +   G
Sbjct: 392 CYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLARDCNRSNNNSGGG 547


>TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (56%)
          Length = 598

 Score = 43.5 bits (101), Expect = 2e-04
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
 Frame = +1

Query: 236 PAEITCFNCGEKGHKSNVCP--EEIKKCVRCGKKGHIVADCKRN 277
           P    CFNCG  GH +  C   +   KC RCG++GHI  +CK +
Sbjct: 376 PGSGRCFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNCKNS 507



 Score = 38.5 bits (88), Expect = 0.008
 Identities = 19/43 (44%), Positives = 21/43 (48%), Gaps = 3/43 (6%)
 Frame = +1

Query: 260 KCVRCGKKGHIVADCKRND--IVCFNFNEEGHIGSQCK-QPKK 299
           +C  CG  GH   DCK  D    C+   E GHI   CK  PKK
Sbjct: 388 RCFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNCKNSPKK 516


>TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (62%)
          Length = 684

 Score = 42.4 bits (98), Expect = 5e-04
 Identities = 18/44 (40%), Positives = 24/44 (53%), Gaps = 2/44 (4%)
 Frame = +2

Query: 236 PAEITCFNCGEKGHKSNVCP--EEIKKCVRCGKKGHIVADCKRN 277
           P    CFNCG  GH +  C   +   KC RCG +GH+  +CK +
Sbjct: 356 PGSGRCFNCGLDGHWARDCKAGDWKNKCYRCGDRGHVERNCKNS 487



 Score = 37.4 bits (85), Expect = 0.017
 Identities = 17/44 (38%), Positives = 22/44 (49%), Gaps = 3/44 (6%)
 Frame = +2

Query: 260 KCVRCGKKGHIVADCKRND--IVCFNFNEEGHIGSQCK-QPKKS 300
           +C  CG  GH   DCK  D    C+   + GH+   CK  PKK+
Sbjct: 368 RCFNCGLDGHWARDCKAGDWKNKCYRCGDRGHVERNCKNSPKKN 499


>TC18698 
          Length = 808

 Score = 38.9 bits (89), Expect = 0.006
 Identities = 17/55 (30%), Positives = 32/55 (57%)
 Frame = -2

Query: 615 QKTAFSTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDRFVVVFIDDILIYS 669
           +KT       +Y Y+VMP G+ N    +   M++IFH  + + V V+++D+++ S
Sbjct: 804 KKTTLKINRVNYYYQVMPLGLKNI*TTYQRLMDKIFHKQI*KNVEVYVEDMIVKS 640


>TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (9%)
          Length = 450

 Score = 37.0 bits (84), Expect = 0.023
 Identities = 13/36 (36%), Positives = 19/36 (52%)
 Frame = +3

Query: 239 ITCFNCGEKGHKSNVCPEEIKKCVRCGKKGHIVADC 274
           + C NC + GH S  C   +  C  CG +GH+  +C
Sbjct: 3   VVCRNCQQLGHMSRDCMGPLMICHNCGGRGHLAYEC 110



 Score = 31.2 bits (69), Expect = 1.2
 Identities = 11/34 (32%), Positives = 17/34 (49%)
 Frame = +3

Query: 261 CVRCGKKGHIVADCKRNDIVCFNFNEEGHIGSQC 294
           C  C + GH+  DC    ++C N    GH+  +C
Sbjct: 9   CRNCQQLGHMSRDCMGPLMICHNCGGRGHLAYEC 110


>TC12574 
          Length = 325

 Score = 35.8 bits (81), Expect = 0.051
 Identities = 14/30 (46%), Positives = 22/30 (72%)
 Frame = +2

Query: 642 FMEYMNRIFHAFLDRFVVVFIDDILIYSKD 671
           F   +N IF +F + F++VFI+DIL Y++D
Sbjct: 2   FKNSVNHIFESFFEHFMIVFINDILSYTED 91


>TC17929 
          Length = 791

 Score = 33.9 bits (76), Expect = 0.19
 Identities = 12/27 (44%), Positives = 16/27 (58%)
 Frame = +2

Query: 231 KKKDAPAEITCFNCGEKGHKSNVCPEE 257
           +  D+    TC+ CGE GHK   CP+E
Sbjct: 32  RPNDSKFRQTCYRCGESGHKMRNCPKE 112


>AV779679 
          Length = 440

 Score = 33.9 bits (76), Expect = 0.19
 Identities = 15/30 (50%), Positives = 21/30 (70%)
 Frame = +3

Query: 558 SMRLCIDYRQLNKVTIKNRYPLPRIDDLMD 587
           +M+LC DY QL+ VTI N+  LP +D+  D
Sbjct: 345 TMQLCDDYMQLDYVTIPNKSLLPHLDEWSD 434


>BP039068 
          Length = 467

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 11/20 (55%), Positives = 12/20 (60%)
 Frame = +3

Query: 236 PAEITCFNCGEKGHKSNVCP 255
           P +  C NC E GH SN CP
Sbjct: 297 PRQTVCMNCQETGHASNDCP 356


>TC9039 
          Length = 1218

 Score = 30.4 bits (67), Expect = 2.1
 Identities = 18/65 (27%), Positives = 29/65 (43%), Gaps = 7/65 (10%)
 Frame = +2

Query: 197 VVNERKTKGQQSRPKPY-SAPADKGKQRMVDDRRPKKKDAPA------EITCFNCGEKGH 249
           V  E + K ++     + S   DKGK++   + + +  DAPA      + TC+ C   GH
Sbjct: 692 VQEEERLKQERKESAHFVSTSKDKGKRKKTVEPKNEAADAPAPKKQKEDDTCYFCNVSGH 871

Query: 250 KSNVC 254
               C
Sbjct: 872 MKKKC 886


>TC8917 similar to UP|AAH64262 (AAH64262) MGC76273 protein, partial (3%)
          Length = 676

 Score = 28.9 bits (63), Expect = 6.2
 Identities = 16/58 (27%), Positives = 29/58 (49%), Gaps = 4/58 (6%)
 Frame = +2

Query: 188 EEDTKAHYK----VVNERKTKGQQSRPKPYSAPADKGKQRMVDDRRPKKKDAPAEITC 241
           EE+ +A  K       E K + ++  PKP   P ++  +  V + +PK ++A +  TC
Sbjct: 323 EEEVQAEPKEEKATTEEVKVETKEVNPKPEEEPKEEEPKAQVQEEKPKTEEA*S*ETC 496


>TC17786 similar to PIR|T46177|T46177 villin 3 homolog T8H10.10 - Arabidopsis
            thaliana (fragment) {Arabidopsis thaliana;}, partial
            (42%)
          Length = 1228

 Score = 28.5 bits (62), Expect = 8.1
 Identities = 8/18 (44%), Positives = 11/18 (60%)
 Frame = -1

Query: 1331 RRRPRVFESHSCDWCWMC 1348
            R R    E++ C+WCW C
Sbjct: 106  REREEKSENYCCEWCWCC 53


>TC19927 weakly similar to UP|O80689 (O80689) F8K4.2 protein, partial (14%)
          Length = 529

 Score = 28.5 bits (62), Expect = 8.1
 Identities = 13/44 (29%), Positives = 26/44 (58%)
 Frame = -1

Query: 1329 VSRRRPRVFESHSCDWCWMCFEVKEVDSEIFRSVSDIEKIWNGG 1372
            V ++RP    S+ C + +   +  +V +++F S+  IE I++GG
Sbjct: 139  VIKQRPHKVPSNICSFSYGTGQRIQVTTQVFHSIGIIEYIFSGG 8


  Database: LJGI
    Posted date:  Jul 30, 2004 11:16 AM
  Number of letters in database: 14,692,800
  Number of sequences in database:  28,460
  
Lambda     K      H
   0.349    0.155    0.537 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,666,281
Number of Sequences: 28460
Number of extensions: 338562
Number of successful extensions: 3030
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 2921
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3006
length of query: 1478
length of database: 4,897,600
effective HSP length: 102
effective length of query: 1376
effective length of database: 1,994,680
effective search space: 2744679680
effective search space used: 2744679680
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.8 bits)
S2: 61 (28.1 bits)


Medicago: description of AC141111.5