Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC140022.1 + phase: 0 /pseudo
         (583 letters)

Database: GMGI 
           63,676 sequences; 37,918,896 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

AI959950                                                               50  4e-06
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co...    43  4e-04
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi...    39  0.005
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ...    37  0.032
BM307983                                                               37  0.032
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete              37  0.032
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl...    34  0.16
BF595216                                                               33  0.27
BI321712                                                               33  0.35
BI469652 weakly similar to GP|18149115|dbj| reverse transcriptas...    31  1.3
TC204444 homologue to UP|CADH_MEDSA (P31656) Cinnamyl-alcohol de...    29  5.1
AW459126                                                               29  6.7
TC203693 similar to UP|Q949G4 (Q949G4) N3 like protein, partial ...    29  6.7
TC226121 similar to UP|Q9LL85 (Q9LL85) DNA-binding protein p24, ...    28  8.7
TC218212 similar to UP|Q9LK27 (Q9LK27) Gb|AAF01563.1, partial (11%)    28  8.7
TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (F...    28  8.7
TC226120 similar to UP|Q9LL85 (Q9LL85) DNA-binding protein p24, ...    28  8.7

>AI959950 
          Length = 466

 Score = 49.7 bits (117), Expect = 4e-06
 Identities = 43/118 (36%), Positives = 59/118 (49%)
 Frame = -2

Query: 83  *KLWMKKSMQLRRIKHGS*LNYRKTRSQ*E*SGCTRQSTNPVVRLIAIKRGWWLKATNKN 142
           *K   K  +  +RI   S LNY+K R   E*+G    +   +VRL   K+   LK T+  
Sbjct: 396 *KRCKKNLISFKRIMSRSSLNYQKERR*LE*NGYFVTN*TRMVRL*DTKQD*LLKVTHNR 217

Query: 143 QVLIILKYLLLLQD*IQFACLFHSQLKITGKYIKWMLSLHFLMVLWKKKCMLSSLQDM 200
           +V    K L LL  *  +A  FH Q  +    IKWM  +HF M   K+K ML++  D+
Sbjct: 216 KV*TTQKPLHLLHV*K*YASYFHLQPIVI*SCIKWM*KVHF*MA*SKRKFMLNNRLDL 43


>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
          Length = 4734

 Score = 42.7 bits (99), Expect = 4e-04
 Identities = 116/432 (26%), Positives = 176/432 (39%), Gaps = 8/432 (1%)
 Frame = +2

Query: 136  LKATNKNQVLIILKYLLLLQD*IQFACLFHSQLKITGKYIKWMLSLHFLMVLWKKKCMLS 195
            LKAT + +V  ++K   LL D     C             +WM    F M    KK M S
Sbjct: 3404 LKATLRLKV*TLMKLSPLLLDLSPSDCYLV*LASSNSSCTRWM*RARF*MDT*MKKPMWS 3583

Query: 196  SLQDMWLEERRIKYID*RKHCMA*SRRQEHGTKR-LILILFKTVFRDVHSSTHSTSNSLI 254
            S +D+ ++  +I Y   R+  M *S+ QE G K     +L K + R+   +  S SN ++
Sbjct: 3584 SQRDL*IQLIQIMYTGSRRLSMD*SKLQELGMKG*QSSLLSKGIGRE-ELTRLSLSNKML 3760

Query: 255  LE-------MFLLCASMSMI*YSPVTIQR*SLNSGRL**VILK*QIWA*CPIFSALRSFN 307
                     +  LC     +    +   R +LN   L *V+L+  +     IF   +   
Sbjct: 3761 KT***HRYMLMTLCLEGCRMRCFDILSNRCNLN---LR*VLLES*L-----IFWDSK*SR 3916

Query: 308  RRM*SLSLRRSMQVIF*RNLRWSIQSQFPRRLKKS*S*QEKAMVKG*TQLITKV*LEV*D 367
             +    S + SMQ    R+L W + +        +*S Q+  +     ++ T+ *L    
Sbjct: 3917 WKTPYSSHKASMQRTLSRSLGWKMPAIKEHLHLLT*SCQKMKLAPVLIKVCTEA*LGAYY 4096

Query: 368  I*LQQGQI*YMELVYLANTWRIRVLVTCKEPRGFFVILKVL*PKEFFMVIIVM*SLLDIQ 427
            I*        M+ V++ +   I   VT  + R F  +           VI+ +   L I 
Sbjct: 4097 I*QLADLTSPMQ*VFVQDIKPILR*VT*IK*REF*NM*MAPVTMGLCTVIVQIQCWLGIV 4276

Query: 428  IVIGQEIQKQEKARQGTHFI*EPVQYHGLRRNNLWSLFQQQK*NI**QPVVLLKQCG*EE 487
            ++IG E+Q  EKA      I EP+ +HG  R+     +   K +I  Q   +    G*  
Sbjct: 4277 MLIGLEVQMTEKALLVDVSIWEPILFHGSARSRTVCPYLLLKQSILQQEAAVHN*FG*SR 4456

Query: 488  F*K*CIMSRTLLQRYIVITSQQLH*AKIQFFMDGPSILTSGFTRYES*LLRKKW*SSTVP 547
             *+   MS  +     V T   L   KI F    PS LT   T  E  L+ K    S + 
Sbjct: 4457 C*R-STMSNKMS*HCTVTT*VLLIFLKILFNTAEPSTLTLDITILEILLMIKLSHWSMLT 4633

Query: 548  LKSKLQIFLQSH 559
            L++K QIF Q H
Sbjct: 4634 LRNK*QIFSQRH 4669


>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
           partial (34%)
          Length = 407

 Score = 39.3 bits (90), Expect = 0.005
 Identities = 25/76 (32%), Positives = 38/76 (49%)
 Frame = -3

Query: 115 GCTRQSTNPVVRLIAIKRGWWLKATNKNQVLIILKYLLLLQD*IQFACLFHSQLKITGKY 174
           G T     P+VRLI ++  W LKAT++   LI +   LL  +   F C     L +TG  
Sbjct: 366 GSTLLKLGPLVRLIGLRLVW*LKATHRYMALITVILSLLSPNSPLFVCFLLWLLFVTGPS 187

Query: 175 IKWMLSLHFLMVLWKK 190
           I  +L + F  V+ ++
Sbjct: 186 ISLILRMPFSTVILRR 139


>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
           (japonica cultivar-group)}, partial (10%)
          Length = 463

 Score = 36.6 bits (83), Expect = 0.032
 Identities = 25/73 (34%), Positives = 40/73 (54%)
 Frame = -1

Query: 149 KYLLLLQD*IQFACLFHSQLKITGKYIKWMLSLHFLMVLWKKKCMLSSLQDMWLEERRIK 208
           K++L LQD*    C  H         IKWML + F MV +KKK ML++ Q +     ++ 
Sbjct: 265 KHMLQLQD*KSLECF*HMYP**ILNSIKWMLKVLF*MV*FKKKYMLNNPQALKSRINQLM 86

Query: 209 YID*RKHCMA*SR 221
           +I+ ++  M *++
Sbjct: 85  FINCKRLFMV*NK 47


>BM307983 
          Length = 406

 Score = 36.6 bits (83), Expect = 0.032
 Identities = 29/89 (32%), Positives = 42/89 (46%), Gaps = 1/89 (1%)
 Frame = +3

Query: 130 IKRGWWLKATNKNQVLIILKYLLL-LQD*IQFACLFHSQLKITGKYIKWMLSLHFLMVLW 188
           I+RGW  + T+K    I+ ++LL    +  Q   L  S+  + G+ I  ML +   M  W
Sbjct: 60  IRRGWLQRDTSKPMGSIMRRHLLSGKNEYSQDHHLLSSKHNLVGRCINLMLKMPSYMEAW 239

Query: 189 KKKCMLSSLQDMWLEERRIKYID*RKHCM 217
           KKK       DM      IK+ D*R+  M
Sbjct: 240 KKKYTWRFHLDMVPVMEEIKFAD*RRPYM 326


>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
          Length = 4731

 Score = 36.6 bits (83), Expect = 0.032
 Identities = 47/142 (33%), Positives = 63/142 (44%)
 Frame = +2

Query: 416  VIIVM*SLLDIQIVIGQEIQKQEKARQGTHFI*EPVQYHGLRRNNLWSLFQQQK*NI**Q 475
            VI+ +   L I ++IG E+Q  EKA      I E   +HG  R+     + QQK +I  Q
Sbjct: 4238 VIVQIQCWLGIVMLIGLEVQMTEKALLVDASIWETTLFHGSARSRTVCPYLQQKPSILQQ 4417

Query: 476  PVVLLKQCG*EEF*K*CIMSRTLLQRYIVITSQQLH*AKIQFFMDGPSILTSGFTRYES* 535
               +    G*   *+   MS  +     V T   L   KI F    PS LT   T  E  
Sbjct: 4418 EAAVHS*FG*SRC*R-STMSNKMS*HCTVTT*VLLIFLKILFNTAEPSTLTLDITISEIL 4594

Query: 536  LLRKKW*SSTVPLKSKLQIFLQ 557
            L+ K    S + L++K QIF Q
Sbjct: 4595 LMIK*SH*SMLTLRNK*QIFSQ 4660



 Score = 34.7 bits (78), Expect = 0.12
 Identities = 34/115 (29%), Positives = 48/115 (41%)
 Frame = +2

Query: 114  SGCTRQSTNPVVRLIAIKRGWWLKATNKNQVLIILKYLLLLQD*IQFACLFHSQLKITGK 173
            SG +R      V     +  W LKAT + +V  +++ L  L D           +     
Sbjct: 3335 SGSSRTKPMKKVS*PETRPDWLLKATLRLKV*TLMRLLPQLLDLSPSDYYLV*LVSSNSS 3514

Query: 174  YIKWMLSLHFLMVLWKKKCMLSSLQDMWLEERRIKYID*RKHCMA*SRRQEHGTK 228
              +WM   HF M    KK M SS +D+     +I Y   R+  M *S+ QE G K
Sbjct: 3515 CTRWM*RAHF*MDT*MKKSMWSSQRDLQTRLIQIMYTGSRRLSMD*SKLQELGMK 3679


>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
           partial (7%)
          Length = 336

 Score = 34.3 bits (77), Expect = 0.16
 Identities = 35/106 (33%), Positives = 48/106 (45%)
 Frame = +1

Query: 74  KKPRVMKIG*KLWMKKSMQLRRIKHGS*LNYRKTRSQ*E*SGCTRQSTNPVVRLIAIKRG 133
           KKP  M IG     K    L+   +G+*    K     E +G    +   +V L+ IK G
Sbjct: 19  KKP**MIIGSLSCKKN*TNLKETMYGN**KNLKIILLLEQNGFLEIN*MNMV*LLEIKPG 198

Query: 134 WWLKATNKNQVLIILKYLLLLQD*IQFACLFHSQLKITGKYIKWML 179
              K   K +   + K++LLLQD*    C +H     T  +IKWML
Sbjct: 199 **RKDIIKKRE*TMKKHMLLLQD*KPLECFWHMHP**TLNFIKWML 336


>BF595216 
          Length = 421

 Score = 33.5 bits (75), Expect = 0.27
 Identities = 16/33 (48%), Positives = 22/33 (66%)
 Frame = +1

Query: 170 ITGKYIKWMLSLHFLMVLWKKKCMLSSLQDMWL 202
           ITG +  WM  + FLMV +K+K + SSLQ + L
Sbjct: 280 ITGPFSSWM*IMPFLMVFYKRKSICSSLQVLTL 378


>BI321712 
          Length = 399

 Score = 33.1 bits (74), Expect = 0.35
 Identities = 38/115 (33%), Positives = 53/115 (46%)
 Frame = -1

Query: 261 CASMSMI*YSPVTIQR*SLNSGRL**VILK*QIWA*CPIFSALRSFNRRM*SLSLRRSMQ 320
           C  M M *    TIQ  S +S ++  + L+*+IW    I SA +   +     S +++M 
Sbjct: 384 CVCM*MT*SLQGTIQACSKSSRKICQMNLR*RIWGSWHIISASK*NKKTKEFSSPKKAMP 205

Query: 321 VIF*RNLRWSIQSQFPRRLKKS*S*QEKAMVKG*TQLITKV*LEV*DI*LQQGQI 375
               R+ RW    Q   R   + S*      +   QL TKV*LEV   *  QG+I
Sbjct: 204 KKSLRSSRWMTPIQLAPRWNVAAS*ASMKKERMWIQLFTKV*LEVYVT*HVQGRI 40


>BI469652 weakly similar to GP|18149115|dbj| reverse transcriptase {Silene
           noctiflora}, partial (60%)
          Length = 427

 Score = 31.2 bits (69), Expect = 1.3
 Identities = 16/37 (43%), Positives = 22/37 (59%)
 Frame = +1

Query: 164 FHSQLKITGKYIKWMLSLHFLMVLWKKKCMLSSLQDM 200
           F  QL  T  + KW+L + F M L K+KCM  +LQ +
Sbjct: 220 FPLQLIKT*IFFKWILKVVF*MTLLKRKCMSDNLQTL 330


>TC204444 homologue to UP|CADH_MEDSA (P31656) Cinnamyl-alcohol dehydrogenase 
           (CAD) , partial (22%)
          Length = 790

 Score = 29.3 bits (64), Expect = 5.1
 Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
 Frame = +1

Query: 140 NKNQVLIILKYLLLLQD*IQFACLF---HSQLKITGKYIKWMLSLHFLMVLWKKKCMLSS 196
           N N +++  KY++L+Q *+ F  ++     +  ITG +I  M     ++  WK+K + S 
Sbjct: 391 NGNTIVLYQKYIILIQ-*LHFNDVYLGCEGRKSITGSFIGSMKETEEMLEFWKEKGLSSM 567

Query: 197 LQ 198
           ++
Sbjct: 568 IE 573


>AW459126 
          Length = 339

 Score = 28.9 bits (63), Expect = 6.7
 Identities = 16/59 (27%), Positives = 30/59 (50%)
 Frame = +1

Query: 194 LSSLQDMWLEERRIKYID*RKHCMA*SRRQEHGTKRLILILFKTVFRDVHSSTHSTSNS 252
           ++ L   WLE R+I+Y    K   A +++Q   +K    I+ K + +    S  +T+N+
Sbjct: 67  INDLTGKWLESRQIRYN*ATKRASASNKKQSSNSK----IIIKLINKSSEESXETTNNN 231


>TC203693 similar to UP|Q949G4 (Q949G4) N3 like protein, partial (90%)
          Length = 1795

 Score = 28.9 bits (63), Expect = 6.7
 Identities = 19/60 (31%), Positives = 30/60 (49%)
 Frame = +3

Query: 137 KATNKNQVLIILKYLLLLQD*IQFACLFHSQLKITGKYIKWMLSLHFLMVLWKKKCMLSS 196
           K+T +N + +   +L+LL   +Q  C   + L   GK   + L L  L +LW +   LSS
Sbjct: 204 KSTRRNPLKVSSHFLMLLHCSVQ--CFGFTMLS*KGKLPSFSLPLTHLELLWSQFTFLSS 377


>TC226121 similar to UP|Q9LL85 (Q9LL85) DNA-binding protein p24, partial
           (18%)
          Length = 805

 Score = 28.5 bits (62), Expect = 8.7
 Identities = 13/27 (48%), Positives = 15/27 (55%)
 Frame = -3

Query: 162 CLFHSQLKITGKYIKWMLSLHFLMVLW 188
           C  HS L I G    W  SLH+L+ LW
Sbjct: 698 CSIHSLLHILG----WHYSLHYLLALW 630


>TC218212 similar to UP|Q9LK27 (Q9LK27) Gb|AAF01563.1, partial (11%)
          Length = 712

 Score = 28.5 bits (62), Expect = 8.7
 Identities = 12/49 (24%), Positives = 24/49 (48%)
 Frame = +2

Query: 114 SGCTRQSTNPVVRLIAIKRGWWLKATNKNQVLIILKYLLLLQD*IQFAC 162
           SGC +      + L+ I  GWWL+A   +  ++ + Y++   + +   C
Sbjct: 518 SGCLQ*K*K*YINLVFINTGWWLEAKASHPFVVRMDYVVQKIEFVNIHC 664


>TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (Fragment),
           partial (16%)
          Length = 562

 Score = 28.5 bits (62), Expect = 8.7
 Identities = 36/106 (33%), Positives = 55/106 (50%), Gaps = 1/106 (0%)
 Frame = +2

Query: 274 IQR*SLNSGRL**VILK*QIWA*CPIFSALRSFNRRM*SLSLRRSMQVIF*RNLRWSIQS 333
           +Q *  +S + *  +LK* I     IF  LRS   R    S++ +MQ  F*R+ +W   +
Sbjct: 239 MQG*LRSSSKK*CKLLK*LILVS*LIFLELRSSKVRTKC*SVKGNMQKKF*RSFKWRNAN 418

Query: 334 QFPRR-LKKS*S*QEKAMVKG*TQLITKV*LEV*DI*LQQGQI*YM 378
               + +K+  S +   ++K    +I   *L+V* I LQQGQ  Y+
Sbjct: 419 LLAHQ*IKRRSSTR*TVLIKLMKDII-GA*LDV*CISLQQGQTFYL 553


>TC226120 similar to UP|Q9LL85 (Q9LL85) DNA-binding protein p24, partial
           (50%)
          Length = 642

 Score = 28.5 bits (62), Expect = 8.7
 Identities = 13/27 (48%), Positives = 15/27 (55%)
 Frame = -1

Query: 162 CLFHSQLKITGKYIKWMLSLHFLMVLW 188
           C  HS L I G    W  SLH+L+ LW
Sbjct: 498 CSIHSLLHILG----WHYSLHYLLALW 430


  Database: GMGI
    Posted date:  Oct 22, 2004  4:58 PM
  Number of letters in database: 37,918,896
  Number of sequences in database:  63,676
  
Lambda     K      H
   0.369    0.164    0.579 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 25,042,367
Number of Sequences: 63676
Number of extensions: 345793
Number of successful extensions: 4653
Number of sequences better than 10.0: 34
Number of HSP's better than 10.0 without gapping: 1977
Number of HSP's successfully gapped in prelim test: 232
Number of HSP's that attempted gapping in prelim test: 2651
Number of HSP's gapped (non-prelim): 2286
length of query: 583
length of database: 12,639,632
effective HSP length: 102
effective length of query: 481
effective length of database: 6,144,680
effective search space: 2955591080
effective search space used: 2955591080
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 14 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 36 (21.8 bits)
S2: 62 (28.5 bits)


Medicago: description of AC140022.1