Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC126008.1 - phase: 0 /pseudo
         (790 letters)

Database: LJGI 
           28,460 sequences; 14,692,800 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BE122516                                                               96  2e-20
BG662087                                                               91  5e-19
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ...    75  4e-14
AV410603                                                               70  1e-12
BI418821                                                               56  2e-08
AU251673                                                               55  4e-08
TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti...    54  7e-08
TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR...    43  2e-04
TC18698                                                                43  2e-04
TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, part...    42  4e-04
TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti...    40  0.001
TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, par...    40  0.001
AV420911                                                               39  0.002
TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR...    39  0.004
TC12574                                                                36  0.027
AV773507                                                               32  0.39
TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, ...    32  0.39
TC9039                                                                 32  0.51
TC19289 similar to GB|AAK15470.1|13194578|AF308596 endoribonucle...    30  1.1
TC12928                                                                30  1.5

>BE122516 
          Length = 364

 Score = 96.3 bits (238), Expect = 2e-20
 Identities = 55/114 (48%), Positives = 74/114 (64%)
 Frame = +2

Query: 430 GMDWLLAFGVSINCLTKSVTFSKPVEELGREFLTAEQVKKSLDGEACVFMMFASLKVSGE 489
           GM+WL A   ++NC  K+VTF     +  R   T ++V K+ + E+ V +       S +
Sbjct: 17  GMNWLTANDATLNCRKKTVTFGTSEGDAKRVKRT-DKVGKASECESDVLLGALETDKS-D 190

Query: 490 KGASVLPVVQEFPEVFPEDITELPSEREVEFAIDLVPGTSPISIAPYRMSASEL 543
            G   +PVV+EF +VFPE+++ELP EREVEF+ID VPGT PISIAPYRMS  EL
Sbjct: 191 TGVEGIPVVREFSDVFPEEVSELPPEREVEFSID*VPGTGPISIAPYRMSLVEL 352


>BG662087 
          Length = 373

 Score = 91.3 bits (225), Expect = 5e-19
 Identities = 47/119 (39%), Positives = 67/119 (55%)
 Frame = +1

Query: 578 GSMRLCVDYRQLNKLTIKNRYPLPRIDD*MDQLVGAEVFSKIDLRSGYHQIRVKAEDISK 637
           G  R+ VDY  LNK   K+ YPLP ID  +D     E+ S +D  SGYHQI++   D  K
Sbjct: 16  GKWRMWVDYTDLNKACPKDSYPLPSIDKLVDGASDNELLSLMDAYSGYHQIKMHPSDEDK 195

Query: 638 TAFRTRYGHYEYSVMPFGVSNAPGVFMEYMNRIFHPYLDRFVVVFIDDILVYSKSEEEH 696
           TAF T   +Y Y  +PFG+ NA   +   M+R+F   + R + V++D+++V S     H
Sbjct: 196 TAFMTARVNYCYQTIPFGLKNAGATYQXLMDRVFXDXVGRNMEVYLDNMIVKSALRANH 372


>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
           [imported] - Agrobacterium                tumefaciens
           (strain C58, Dupont) {Agrobacterium tumefaciens;},
           partial (6%)
          Length = 561

 Score = 75.1 bits (183), Expect = 4e-14
 Identities = 46/144 (31%), Positives = 68/144 (46%), Gaps = 14/144 (9%)
 Frame = -2

Query: 241 KPYSKDKGKKRESGGGS-------RPSLAD---VKCYRCGTLGHYANECMNDLNCHKCGK 290
           KP+ + + +   SG          RP+ +D   + C+RC   GH+AN C  DL C  C K
Sbjct: 434 KPFQRPQNRGTSSGYSHSFGNFVPRPTQSDTSEIVCHRCSKKGHFANRCP-DLVCWNCQK 258

Query: 291 AGHKAVDCR--VVAREITCYNCGEKGHISTKPKK--AAGKVFALNAKEVEQPDNLIRGMC 346
            GH   DC    V               + K K+  A+ +V+ ++  E  + D LIR + 
Sbjct: 257 TGHSGKDCTNPKVEAATNAIAARRPAPAANKGKRPVASARVYTVSGAESHRADGLIRSVG 78

Query: 347 FINSTPLIAIIDTGATHSFISASC 370
            +N  PL  + D+GATHSFI  +C
Sbjct: 77  SVNCKPLTILFDSGATHSFIDLAC 6


>AV410603 
          Length = 162

 Score = 70.1 bits (170), Expect = 1e-12
 Identities = 30/53 (56%), Positives = 42/53 (78%)
 Frame = +1

Query: 593 TIKNRYPLPRIDD*MDQLVGAEVFSKIDLRSGYHQIRVKAEDISKTAFRTRYG 645
           T+K+ +P+P +D+ +D+L G++ FSK+DLRSGYHQI VK ED  KT FRT +G
Sbjct: 4   TVKDSFPMPTVDELLDELRGSQFFSKLDLRSGYHQILVKPEDRHKTVFRTHHG 162


>BI418821 
          Length = 614

 Score = 56.2 bits (134), Expect = 2e-08
 Identities = 30/84 (35%), Positives = 37/84 (43%), Gaps = 14/84 (16%)
 Frame = +2

Query: 248 GKKRESGGGSRPSLADVKCYRCGTLGHYANECMNDLN---------CHKCGKAGHKAVDC 298
           G++R  GGG         CY CG  GH A +C    N         C+ CG AGH A DC
Sbjct: 347 GERRNGGGGGGGGGG---CYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLARDC 517

Query: 299 RVVARE-----ITCYNCGEKGHIS 317
                        CYNCG+ GH++
Sbjct: 518 NRSNNNSGGGGAGCYNCGDTGHLA 589



 Score = 28.5 bits (62), Expect = 4.3
 Identities = 19/79 (24%), Positives = 27/79 (34%), Gaps = 6/79 (7%)
 Frame = +2

Query: 212 CRMFDDAGKAKSNYYKVQNEKKGKGH------GVGKPYSKDKGKKRESGGGSRPSLADVK 265
           C    D G    + ++  N   G G       G     ++D  +   + GG         
Sbjct: 392 CYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLARDCNRSNNNSGGGGAG----- 556

Query: 266 CYRCGTLGHYANECMNDLN 284
           CY CG  GH A +C    N
Sbjct: 557 CYNCGDTGHLARDCNRSNN 613


>AU251673 
          Length = 413

 Score = 55.1 bits (131), Expect = 4e-08
 Identities = 34/111 (30%), Positives = 56/111 (49%), Gaps = 1/111 (0%)
 Frame = +2

Query: 86  DDQKVRLATHMFAEEAEYWWINAKGRLELGGDVVTWAKFKVEFLRKYFPEDLRTKKEVEF 145
           D + V LA+      A  W+        +G    TWA F  EF+ ++ P+ +R     +F
Sbjct: 11  DTRAVELASFQLEGVARDWYNVLTRAKPVGSPPWTWADFSAEFMNRFLPQSVRDGFVRDF 190

Query: 146 LNLKQGI-MSVAEYAAKFEELSRFCPYINAEDAMVSKCVKFESGLRPVIYQ 195
             L+Q   M+V+EY+A F  LSR+ PY   E+  V + V+   GL+  +++
Sbjct: 191 ERLEQAEGMTVSEYSAHFTHLSRYVPYPLLEEERVKRFVR---GLKEYLFK 334


>TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (26%)
          Length = 555

 Score = 54.3 bits (129), Expect = 7e-08
 Identities = 29/77 (37%), Positives = 39/77 (49%)
 Frame = +2

Query: 242 PYSKDKGKKRESGGGSRPSLADVKCYRCGTLGHYANECMNDLNCHKCGKAGHKAVDCRVV 301
           PY +D  +     G SR +L    C  C   GH+A EC N   CH CG  GH A +C   
Sbjct: 299 PYRRDSRR-----GFSRDNL----CKNCKRPGHFARECPNVAICHNCGLPGHIASEC--- 442

Query: 302 AREITCYNCGEKGHIST 318
             +  C+NC E GH+++
Sbjct: 443 TTKSLCWNCKEPGHMAS 493



 Score = 37.4 bits (85), Expect = 0.009
 Identities = 14/33 (42%), Positives = 17/33 (51%)
 Frame = +2

Query: 266 CYRCGTLGHYANECMNDLNCHKCGKAGHKAVDC 298
           C+ CG  GH A+EC     C  C + GH A  C
Sbjct: 401 CHNCGLPGHIASECTTKSLCWNCKEPGHMASSC 499



 Score = 34.3 bits (77), Expect = 0.078
 Identities = 19/50 (38%), Positives = 25/50 (50%)
 Frame = +2

Query: 285 CHKCGKAGHKAVDCRVVAREITCYNCGEKGHISTKPKKAAGKVFALNAKE 334
           C  C + GH A +C  VA    C+NCG  GHI++   +   K    N KE
Sbjct: 344 CKNCKRPGHFARECPNVA---ICHNCGLPGHIAS---ECTTKSLCWNCKE 475



 Score = 25.8 bits (55), Expect(2) = 0.24
 Identities = 8/13 (61%), Positives = 10/13 (76%)
 Frame = +1

Query: 286 HKCGKAGHKAVDC 298
           H CGK GH+A +C
Sbjct: 517 HTCGKVGHRAREC 555



 Score = 25.4 bits (54), Expect(2) = 0.24
 Identities = 10/25 (40%), Positives = 12/25 (48%)
 Frame = +2

Query: 266 CYRCGTLGHYANECMNDLNCHKCGK 290
           C+ C   GH A+ C ND  C    K
Sbjct: 458 CWNCKEPGHMASSCPNDGICTPVAK 532


>TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
           RSZp22 [validated] -                Arabidopsis thaliana
           {Arabidopsis thaliana;}, partial (89%)
          Length = 912

 Score = 43.1 bits (100), Expect = 2e-04
 Identities = 21/56 (37%), Positives = 29/56 (51%)
 Frame = +1

Query: 224 NYYKVQNEKKGKGHGVGKPYSKDKGKKRESGGGSRPSLADVKCYRCGTLGHYANEC 279
           N ++V+     +G G G    +  G+    GGGS     D+KCY CG  GH+A EC
Sbjct: 268 NGWRVELSHNSRGGGGGGGGGRGGGRSGGGGGGS-----DMKCYECGEPGHFAREC 420


>TC18698 
          Length = 808

 Score = 42.7 bits (99), Expect = 2e-04
 Identities = 20/60 (33%), Positives = 33/60 (54%)
 Frame = -2

Query: 637 KTAFRTRYGHYEYSVMPFGVSNAPGVFMEYMNRIFHPYLDRFVVVFIDDILVYSKSEEEH 696
           KT  +    +Y Y VMP G+ N    +   M++IFH  + + V V+++D++V S  E  H
Sbjct: 801 KTTLKINRVNYYYQVMPLGLKNI*TTYQRLMDKIFHKQI*KNVEVYVEDMIVKSSQE*FH 622


>TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (56%)
          Length = 598

 Score = 42.0 bits (97), Expect = 4e-04
 Identities = 18/43 (41%), Positives = 23/43 (52%)
 Frame = +1

Query: 285 CHKCGKAGHKAVDCRVVAREITCYNCGEKGHISTKPKKAAGKV 327
           C  CG  GH A DC+    +  CY CGE+GHI    K +  K+
Sbjct: 391 CFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNCKNSPKKL 519



 Score = 29.3 bits (64), Expect = 2.5
 Identities = 10/17 (58%), Positives = 10/17 (58%)
 Frame = +1

Query: 265 KCYRCGTLGHYANECMN 281
           KCYRCG  GH    C N
Sbjct: 454 KCYRCGERGHIEKNCKN 504


>TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (9%)
          Length = 450

 Score = 40.4 bits (93), Expect = 0.001
 Identities = 17/36 (47%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
 Frame = +3

Query: 264 VKCYRCGTLGHYANECMNDLN-CHKCGKAGHKAVDC 298
           V C  C  LGH + +CM  L  CH CG  GH A +C
Sbjct: 3   VVCRNCQQLGHMSRDCMGPLMICHNCGGRGHLAYEC 110



 Score = 33.1 bits (74), Expect = 0.17
 Identities = 12/33 (36%), Positives = 20/33 (60%)
 Frame = +3

Query: 285 CHKCGKAGHKAVDCRVVAREITCYNCGEKGHIS 317
           C  C + GH + DC  +   + C+NCG +GH++
Sbjct: 9   CRNCQQLGHMSRDC--MGPLMICHNCGGRGHLA 101


>TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (62%)
          Length = 684

 Score = 40.4 bits (93), Expect = 0.001
 Identities = 16/42 (38%), Positives = 22/42 (52%)
 Frame = +2

Query: 285 CHKCGKAGHKAVDCRVVAREITCYNCGEKGHISTKPKKAAGK 326
           C  CG  GH A DC+    +  CY CG++GH+    K +  K
Sbjct: 371 CFNCGLDGHWARDCKAGDWKNKCYRCGDRGHVERNCKNSPKK 496



 Score = 29.3 bits (64), Expect = 2.5
 Identities = 10/17 (58%), Positives = 10/17 (58%)
 Frame = +2

Query: 265 KCYRCGTLGHYANECMN 281
           KCYRCG  GH    C N
Sbjct: 434 KCYRCGDRGHVERNCKN 484


>AV420911 
          Length = 418

 Score = 39.3 bits (90), Expect = 0.002
 Identities = 28/90 (31%), Positives = 38/90 (42%)
 Frame = +2

Query: 190 RPVIYQYMCVQEIRDFDTLVHKCRMFDDAGKAKSNYYKVQNEKKGKGHGVGKPYSKDKGK 249
           RP  Y ++   + RD    +H        GK   N ++V+     KG G G+        
Sbjct: 206 RPPGYAFLEFDDKRDALDAIHALD-----GK---NGWRVELSHNSKGGGGGR-------- 337

Query: 250 KRESGGGSRPSLADVKCYRCGTLGHYANEC 279
               GGG      D+KCY CG  GH+A EC
Sbjct: 338 ----GGGRGRGGEDLKCYECGEPGHFAREC 415



 Score = 29.6 bits (65), Expect = 1.9
 Identities = 10/18 (55%), Positives = 14/18 (77%)
 Frame = +2

Query: 282 DLNCHKCGKAGHKAVDCR 299
           DL C++CG+ GH A +CR
Sbjct: 365 DLKCYECGEPGHFARECR 418


>TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
           RSZp22 [validated] -                Arabidopsis thaliana
           {Arabidopsis thaliana;}, partial (42%)
          Length = 587

 Score = 38.5 bits (88), Expect = 0.004
 Identities = 20/56 (35%), Positives = 26/56 (45%)
 Frame = +1

Query: 224 NYYKVQNEKKGKGHGVGKPYSKDKGKKRESGGGSRPSLADVKCYRCGTLGHYANEC 279
           N ++V+     KG G G+            GGG      D+KCY CG  GH+A EC
Sbjct: 76  NGWRVELSHNSKGGGGGR------------GGGRGRGGEDLKCYECGEPGHFAREC 207



 Score = 29.6 bits (65), Expect = 1.9
 Identities = 10/18 (55%), Positives = 14/18 (77%)
 Frame = +1

Query: 282 DLNCHKCGKAGHKAVDCR 299
           DL C++CG+ GH A +CR
Sbjct: 157 DLKCYECGEPGHFARECR 210


>TC12574 
          Length = 325

 Score = 35.8 bits (81), Expect = 0.027
 Identities = 14/33 (42%), Positives = 23/33 (69%)
 Frame = +2

Query: 663 FMEYMNRIFHPYLDRFVVVFIDDILVYSKSEEE 695
           F   +N IF  + + F++VFI+DIL Y++ +EE
Sbjct: 2   FKNSVNHIFESFFEHFMIVFINDILSYTEDKEE 100


>AV773507 
          Length = 496

 Score = 32.0 bits (71), Expect = 0.39
 Identities = 23/105 (21%), Positives = 45/105 (41%), Gaps = 9/105 (8%)
 Frame = +2

Query: 203 RDFDTLVHKCRMFDDAGKAKSNYYKVQNEKKGKGHGVGKPYSKDKGKKRESGGGSRPSLA 262
           RD   L  +  M  D G+ +   +    +++G    + + + ++ G +  S   ++P + 
Sbjct: 77  RDGKILECQIMMERDTGRPRGFGFITFADRRGMEDAIKEMHGREIGDRIISVNKAQPKMG 256

Query: 263 DVKC--------YRCGTLGHY-ANECMNDLNCHKCGKAGHKAVDC 298
                       Y  G  G Y A + +   +C +CG+ GH+A DC
Sbjct: 257 GDDADQGYRGGGYSSGGRGSYGAGDRVGQDDCFECGRPGHRARDC 391


>TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, partial
           (58%)
          Length = 881

 Score = 32.0 bits (71), Expect = 0.39
 Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
 Frame = +2

Query: 246 DKGKKRESGG--GSRPSLADVKCYRCGTLGHYANECMND 282
           D+G +  S G  GSR S    +C++CG  GH+A EC ++
Sbjct: 356 DRGDRDRSRGYGGSRGSNGG-ECFKCGKPGHFARECPSE 469



 Score = 30.8 bits (68), Expect = 0.87
 Identities = 19/64 (29%), Positives = 25/64 (38%)
 Frame = +2

Query: 235 KGHGVGKPYSKDKGKKRESGGGSRPSLADVKCYRCGTLGHYANECMNDLNCHKCGKAGHK 294
           K    G    +D G +    G  R    D    R  + G+  +   N   C KCGK GH 
Sbjct: 281 KAQPQGSSRDRDDGDRYRERGRDRDDRGD----RDRSRGYGGSRGSNGGECFKCGKPGHF 448

Query: 295 AVDC 298
           A +C
Sbjct: 449 AREC 460


>TC9039 
          Length = 1218

 Score = 31.6 bits (70), Expect = 0.51
 Identities = 30/144 (20%), Positives = 56/144 (38%), Gaps = 11/144 (7%)
 Frame = +2

Query: 147 NLKQGIMSVAEYAAKFEELSRFCPYINAEDAMVSKCV--KFESGLRPVIYQYMCVQEIRD 204
           N+++ IM ++  A+K + L      +   D ++   V     +        Y C +E   
Sbjct: 503 NIREYIMGMSNIASKLKALK-----LELSDDLLIHLVLLSLPAQFSQFKISYNCPKEKWS 667

Query: 205 FDTLVHKCRMFDDAGKAKSNYYKVQNEKKGKGHGVGKPYSKDKGKKRES---------GG 255
            + L+  C   ++         +++ E+K   H V    SKDKGK++++           
Sbjct: 668 LNELISFCVQEEE---------RLKQERKESAHFVST--SKDKGKRKKTVEPKNEAADAP 814

Query: 256 GSRPSLADVKCYRCGTLGHYANEC 279
             +    D  CY C   GH   +C
Sbjct: 815 APKKQKEDDTCYFCNVSGHMKKKC 886


>TC19289 similar to GB|AAK15470.1|13194578|AF308596 endoribonuclease/protein
           kinase IRE1-like protein {Arabidopsis thaliana;},
           partial (18%)
          Length = 470

 Score = 30.4 bits (67), Expect = 1.1
 Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
 Frame = +1

Query: 706 VLKENQLCAKLSK---CEFWLKEVSFLGHVISKGGISVDPSKVDAVLQWESPKSVFE 759
           ++KE  LCAKLS     +  LK++S LGH  + GG S           W++P+ + +
Sbjct: 7   IIKERSLCAKLSDMGISKRLLKDMSSLGHNATGGGSS----------GWQAPEQLVQ 147


>TC12928 
          Length = 755

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 16/38 (42%), Positives = 20/38 (52%)
 Frame = +3

Query: 38 RNEGEAEERRLDRFLRNNPPTFKGRFDPEGAQTWVQGM 75
          R+EGE    R+D F+  + P    RFD   A TW  GM
Sbjct: 6  RHEGETSFHRMDMFINFSDPFIAKRFD-VNACTWAFGM 116


  Database: LJGI
    Posted date:  Jul 30, 2004 11:16 AM
  Number of letters in database: 14,692,800
  Number of sequences in database:  28,460
  
Lambda     K      H
   0.321    0.138    0.414 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,864,030
Number of Sequences: 28460
Number of extensions: 173295
Number of successful extensions: 827
Number of sequences better than 10.0: 55
Number of HSP's better than 10.0 without gapping: 754
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 804
length of query: 790
length of database: 4,897,600
effective HSP length: 98
effective length of query: 692
effective length of database: 2,108,520
effective search space: 1459095840
effective search space used: 1459095840
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (27.3 bits)


Medicago: description of AC126008.1