Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC144765.5 - phase: 0 /pseudo
         (443 letters)

Database: GMGI 
           63,676 sequences; 37,918,896 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

NP595172 polyprotein [Glycine max]                                    169  3e-42
CO982196                                                              125  5e-29
BI425021                                                               87  2e-17
BI317638 weakly similar to GP|9294238|dbj| contains similarity t...    84  2e-17
BM084967                                                               85  6e-17
AW507721 weakly similar to GP|27764548|gb polyprotein {Glycine m...    72  7e-13
TC233069                                                               69  6e-12
BQ299538                                                               50  2e-06
CA820403 weakly similar to GP|13273463|gb| pol protein integrase...    32  0.75
AW472663                                                               31  0.98
BU090971                                                               28  8.3

>NP595172 polyprotein [Glycine max]
          Length = 4659

 Score =  169 bits (427), Expect = 3e-42
 Identities = 98/268 (36%), Positives = 139/268 (51%)
 Frame = +1

Query: 1    NFGLNYHPDKAKVVADALSRKTLHMSALMVKEFELLEQFRDLSLVCELSSQSVQLGMLKI 60
            +F + Y P K    ADALSR  +                               L   + 
Sbjct: 2959 DFKIEYKPGKDNQAADALSRMFM-------------------------------LAWSEP 3045

Query: 61   NSDFLGSIREAQQVDVKFVDLMVVSNQAEESDFKVDEQGVLRFRGRICIPDNEELKKLIL 120
            +S FL  +R     D     LM    Q  ++      +G+L ++ R+ IP  EE+   IL
Sbjct: 3046 HSIFLEELRARLISDPHLKQLMETYKQGADASHYTVREGLLYWKDRVVIPAEEEIVNKIL 3225

Query: 121  EEGHKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTP 180
            +E H S +  H G T+    LK  F+W  +++DV  ++  CL CQ++K  +  PAGLL P
Sbjct: 3226 QEYHSSPIGGHAGITRTLARLKAQFYWPKMQEDVKAYIQKCLICQQAKSNNTLPAGLLQP 3405

Query: 181  LDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIPINISYPVAQLAEIYIQ 240
            L +P+  W+ ++MDF++ LPN S G   I VV+DRLTK AHFIP+   Y    +AE ++ 
Sbjct: 3406 LPIPQQVWEDVAMDFITGLPN-SFGLSVIMVVIDRLTKYAHFIPLKADYNSKVVAEAFMS 3582

Query: 241  NIVKLHGVPSSIVSDRDPRFTSRFWRSL 268
            +IVKLHG+P SIVSDRD  FTS FW+ L
Sbjct: 3583 HIVKLHGIPRSIVSDRDRVFTSTFWQHL 3666


>CO982196 
          Length = 812

 Score =  125 bits (313), Expect = 5e-29
 Identities = 67/166 (40%), Positives = 98/166 (58%)
 Frame = +1

Query: 99  GVLRFRGRICIPDNEELKKLILEEGHKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARFV 158
           G L F+ R+ +  N     L+L+E   S L  H G  + ++ +  + +W G+KK    +V
Sbjct: 316 GKLYFKDRLVLSKNSTKIPLLLKELQDSPLGGHSGFFRTFKRVANVVFWQGMKKTTRDYV 495

Query: 159 YACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTK 218
            AC  C+++K     PAGLL  L +P   W  ISMDF+  LP  ++G D+I VVVDRLTK
Sbjct: 496 AACEICRRNKTSTLSPAGLL*LLPIPTKVWTDISMDFIGGLPK-AQGKDNILVVVDRLTK 672

Query: 219 SAHFIPINISYPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRF 264
            AHF  ++  Y   ++AE++I+ +V+LHG P+SIVSD    F S F
Sbjct: 673 YAHFFALSHPYTAKEVAELFIKELVRLHGFPASIVSDXXRLFMSLF 810


>BI425021 
          Length = 426

 Score = 87.0 bits (214), Expect = 2e-17
 Identities = 41/91 (45%), Positives = 58/91 (63%)
 Frame = -1

Query: 178 LTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIPINISYPVAQLAEI 237
           L PL VP+  W+ +SMDF+  LP    GH +I+VVV+R +K  H   +  S+    +A +
Sbjct: 426 LCPLPVPQRPWEDLSMDFIVGLP-PYHGHTTIFVVVNRFSKGIHLGTLPTSHTAHMVASL 250

Query: 238 YIQNIVKLHGVPSSIVSDRDPRFTSRFWRSL 268
           ++  ++KLHG P SIVSDRDP F S FW+ L
Sbjct: 249 FLNIVIKLHGFPRSIVSDRDPLFISHFWQDL 157


>BI317638 weakly similar to GP|9294238|dbj| contains similarity to reverse
           transcriptase~gene_id:K11J14.5 {Arabidopsis thaliana},
           partial (5%)
          Length = 420

 Score = 84.3 bits (207), Expect(2) = 2e-17
 Identities = 45/116 (38%), Positives = 67/116 (56%)
 Frame = -2

Query: 153 DVARFVYACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVV 212
           + A+ +  CL CQ +K E +R   LL PL VP   W+ +S+DF++ L      H +I VV
Sbjct: 350 ECAQMLPNCLDCQHTKYETKRIVDLLCPLLVPHRPWEDLSLDFITGLL-PYHVHTAILVV 174

Query: 213 VDRLTKSAHFIPINISYPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRFWRSL 268
           VD  +K  H   +  S+    +A ++I ++ KLHG+P S+VSD D  F S FW+ L
Sbjct: 173 VDHFSKGIHLGMLPSSHTAHTVACLFIDSVAKLHGLPRSLVSDCDLLFVSHFWQEL 6



 Score = 22.7 bits (47), Expect(2) = 2e-17
 Identities = 9/26 (34%), Positives = 13/26 (49%)
 Frame = -1

Query: 131 HLGATKMYQDLKKLFWWSGLKKDVAR 156
           H G  K    L K  +W G++ DV +
Sbjct: 405 HTGIAKTLA*LSKNIYWFGMRTDVTQ 328


>BM084967 
          Length = 426

 Score = 85.1 bits (209), Expect = 6e-17
 Identities = 48/118 (40%), Positives = 66/118 (55%)
 Frame = -2

Query: 98  QGVLRFRGRICIPDNEELKKLILEEGHKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARF 157
           Q ++   G I +P        +L E H S    H+G TK    L + F W G++KDV +F
Sbjct: 356 QDLILKNGCIWLPSGFSFIPTLLLEYHSSPTDAHIGVTKTMARLSENFTWIGIRKDVEQF 177

Query: 158 VYACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDR 215
           V ACL CQ +K E Q+ AGLL PL VP   W+ +S +F+  L +  RG+ +I VVV R
Sbjct: 176 VAACLDCQYTKYEAQKMAGLLCPLPVPCRPWEDLSFNFIIGL-SEFRGYTAILVVVGR 6


>AW507721 weakly similar to GP|27764548|gb polyprotein {Glycine max}, partial
           (3%)
          Length = 464

 Score = 71.6 bits (174), Expect = 7e-13
 Identities = 35/95 (36%), Positives = 57/95 (59%), Gaps = 1/95 (1%)
 Frame = +1

Query: 85  SNQAEESDFKVDEQGVLRFRGRICIPDNEELKKLILEEGHKSNLSIHLGATKMYQDLKKL 144
           SN A +S   V    V+ ++GRI +P++ +L K+I+ E H S +  H G T+    +   
Sbjct: 178 SNNAGKSGDYVLHHDVIIWKGRIMLPNDSQLLKMIMTESHASKVGGHAGTTRTIVRINAQ 357

Query: 145 FWWSGLKKDVARFVYACLTCQKSKVEHQ-RPAGLL 178
           F+W  +++D+ +FV  C+ CQ++KV H   PAGLL
Sbjct: 358 FYWPKMREDIMKFVQECVICQQAKVTHSLLPAGLL 462


>TC233069 
          Length = 881

 Score = 68.6 bits (166), Expect = 6e-12
 Identities = 37/82 (45%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
 Frame = +2

Query: 150 LKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSI 209
           + +DV   + AC  CQ++K   Q+  GLL PL +P   W  ISMDFV+ LP  S G   I
Sbjct: 224 MARDVCDHICACTNCQQNKYSTQK*FGLLQPLPIP*QVWKDISMDFVTHLP-PS*GKKMI 400

Query: 210 WVVVDRLTKSAHF--IPINISY 229
           WV+VD  TK +HF  +P ++SY
Sbjct: 401 WVIVDCWTKYSHFLSLPAHLSY 466


>BQ299538 
          Length = 426

 Score = 50.4 bits (119), Expect = 2e-06
 Identities = 21/40 (52%), Positives = 28/40 (69%)
 Frame = +1

Query: 229 YPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRFWRSL 268
           Y    LAEI+ + +V LHGVP+S++SD DP F S FW+ L
Sbjct: 13  YSARVLAEIFTKEVVHLHGVPASVLSDEDPIFVSSFWKEL 132


>CA820403 weakly similar to GP|13273463|gb| pol protein integrase region
           {Ginkgo biloba}, partial (52%)
          Length = 421

 Score = 31.6 bits (70), Expect = 0.75
 Identities = 18/32 (56%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
 Frame = -3

Query: 238 YIQNIVKLHGVPSSIVSDRDPRF-TSRFWRSL 268
           +I+  VKLHG  SSIVSD D  F  S FW  L
Sbjct: 335 FIKEAVKLHGCSSSIVSDWDRLFLIS*FWTEL 240


>AW472663 
          Length = 190

 Score = 31.2 bits (69), Expect = 0.98
 Identities = 15/41 (36%), Positives = 22/41 (53%)
 Frame = -2

Query: 206 HDSIWVVVDRLTKSAHFIPINISYPVAQLAEIYIQNIVKLH 246
           HD   VV  R++K A+F   + SY +  L  I    ++KLH
Sbjct: 168 HDMAIVVTFRVSKQAYFYSFSFSYSILGLNMILYLELIKLH 46


>BU090971 
          Length = 447

 Score = 28.1 bits (61), Expect = 8.3
 Identities = 11/21 (52%), Positives = 15/21 (71%)
 Frame = -3

Query: 1   NFGLNYHPDKAKVVADALSRK 21
           NF + Y PDK+ +V DALS +
Sbjct: 361 NFDILYKPDKSNIVVDALSHQ 299


  Database: GMGI
    Posted date:  Oct 22, 2004  4:58 PM
  Number of letters in database: 37,918,896
  Number of sequences in database:  63,676
  
Lambda     K      H
   0.341    0.148    0.472 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,789,759
Number of Sequences: 63676
Number of extensions: 246768
Number of successful extensions: 1883
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 1865
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1878
length of query: 443
length of database: 12,639,632
effective HSP length: 100
effective length of query: 343
effective length of database: 6,272,032
effective search space: 2151306976
effective search space used: 2151306976
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.9 bits)
S2: 60 (27.7 bits)


Medicago: description of AC144765.5