Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149471.17 - phase: 0 /pseudo
         (1391 letters)

Database: GMGI 
           63,676 sequences; 37,918,896 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co...   543  e-154
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete             542  e-154
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ...   214  2e-55
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara...   189  6e-48
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag...   177  2e-44
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi...   164  2e-40
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V...   161  2e-39
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ...   160  3e-39
BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativ...   157  2e-38
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro...   143  5e-34
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti...   136  7e-32
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr...   126  1e-31
CO982036                                                              132  1e-30
CO981879                                                               94  2e-30
BM307983                                                              128  2e-29
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ...   128  2e-29
TC232995                                                              124  3e-28
BU548243                                                              123  6e-28
BU764568                                                              100  6e-27
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti...   102  1e-26

>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
          Length = 4734

 Score =  543 bits (1398), Expect = e-154
 Identities = 346/1103 (31%), Positives = 560/1103 (50%), Gaps = 19/1103 (1%)
 Frame = +1

Query: 302  SSSSQASSNHISANPLISTTISAPESSSAGIIPKPSYWLLDSGANEHISCNLSFFSSFYR 361
            SSSS      +  + ++S  +     +SA        W LDSG + H++    F  +   
Sbjct: 1591 SSSSGRKMMWVPKHKIVSLVVHTSLRASA-----KEDWYLDSGCSRHMTGVKEFLVNIEP 1755

Query: 362  IPPVYVSLPNKTCVLVQYAGTVSFTSNFYLSHVLYSPAFTHNLISVAKLCESLSYSLHFT 421
                YV+  + +   +   G +       L+ VL     T NLIS+++LC+   ++++FT
Sbjct: 1756 CSTSYVTFGDGSKGKITGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDE-GFNVNFT 1932

Query: 422  SAHCIIQDTMSLKMIGLAKQLDGLYKYTP--SSCSSNSVFSSVSHKSCNVVATISCNSSS 479
             + C++ +  S  ++  ++  D  Y +TP  +S SS  +FS                   
Sbjct: 1933 KSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLFSKEDEVK------------- 2073

Query: 480  SIPSNALWHFRLGHLSHQRLHSMSLLY--------PNIISSNNKDVCDLCHFAKHKHLPF 531
                  +WH R GHL    L  M  +         PN+     + +C  C   K   +  
Sbjct: 2074 ------IWHQRFGHL---HLRGMKKIIDKGAVRGIPNLKIEEGR-ICGECQIGKQVKMS- 2220

Query: 532  NSSISHASTN--FELLHLDIWGPLSIASVHGHRYFLTIVDDHSRFLWVILLKSKAEVSTH 589
            +  + H +T+   ELLH+D+ GP+ + S+ G RY   +VDD SRF WV  ++ K++    
Sbjct: 2221 HQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEV 2400

Query: 590  VINFITMIQTQFHITPKFIRTDNGPEF---MLSTFYASHGIIHQKSCVETPQQNGRVERK 646
                   +Q +     K IR+D+G EF     + F  S GI H+ S   TPQQNG VERK
Sbjct: 2401 FKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERK 2580

Query: 647  HQHILNVGRALLFQSKLPPSFWSYAILHAVFLINRVPTPILHNQSPYFVLHHQLPALNLF 706
            ++ +    R +L   +LP + W+ A+  A ++ NRV        + Y +   + P +  F
Sbjct: 2581 NRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVKHF 2760

Query: 707  KVFGCLCYASTLQSHRTKLQPRARKSIFLGYKSGFKGFTLYDIQSREIFVSRHVTFHETF 766
             +FG  CY    +  R K+ P++   IFLGY +  + + +++ ++R +  S +V   +  
Sbjct: 2761 HIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLT 2940

Query: 767  LPYP---HTSLSTTPNWEYFSSSNFSDVSNQPTPINSPAIIDDILPPSPPINPPPPPPIP 823
                      + T+ +    ++ +  +  N  +  + P I      PS  I    P  + 
Sbjct: 2941 PARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRIQKMHPKELI 3120

Query: 824  VVSPASRTSTRQTTTPSYLQDYVCNNIHTSPYPINNYISHHNLSNNYSSFVMSLHTTTEP 883
            +  P      R  TT S   + V N                      S FV  +    EP
Sbjct: 3121 IGDP-----NRGVTTRSREIEIVSN----------------------SCFVSKI----EP 3207

Query: 884  KSYAEASKHDCWKQAMQVELQALEKTGTWQLVDLPSNIKPIGCRWIYKVKYHADGSIERH 943
            K+  EA   + W  AMQ EL+  ++   W+LV  P     IG +WI+K K + +G I R+
Sbjct: 3208 KNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRN 3387

Query: 944  KARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAFLHGDLQE 1003
            KARLVA+GY QIEG+D+ +T++PVA+L +IRL++ ++ I  + L+Q+DV +AFL+G L E
Sbjct: 3388 KARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNE 3567

Query: 1004 DVYMLIPPG-IKSNKPNQVCKLQKSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFV 1062
            + Y+  P G +    P+ V +L+K+LYGLKQA R WYE+LT  L+   Y +   D +LFV
Sbjct: 3568 EAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFV 3747

Query: 1063 KKTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLG 1122
            K+ + +  I  +YVDDI+  G S     +    + + F++  +G+L YFLG++V   +  
Sbjct: 3748 KQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDS 3927

Query: 1123 ISLCQRKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLLYLNT 1182
            I L Q KY  +++   G  ++    TP+ + +KL +D + +  D   YR ++G LLYL  
Sbjct: 3928 IFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTA 4107

Query: 1183 TRPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADW 1242
            +RPDIT+     +++ + P  +H     R+L+Y+ G    G+ +   S   L G+ DADW
Sbjct: 4108 SRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADW 4287

Query: 1243 AGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILYLLQD 1302
            AG  D R+S SG CF+LG +LISW +KKQ  VS S++EAEY A  S+  +L W+  +L++
Sbjct: 4288 AGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKE 4467

Query: 1303 IHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHIVREKVQAGILKLLPVSSQDQ 1362
             ++    +  LYCDN SA++I+ NPV H RTKH++I  H +R+ V   ++ L  V +++Q
Sbjct: 4468 YNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQ 4644

Query: 1363 VADFFTKALLPKPFNILLSKMGL 1385
            +AD FTKAL    F  L  K+G+
Sbjct: 4645 IADIFTKALDANQFEKLRGKLGI 4713


>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
          Length = 4731

 Score =  542 bits (1397), Expect = e-154
 Identities = 340/1064 (31%), Positives = 549/1064 (50%), Gaps = 17/1064 (1%)
 Frame = +1

Query: 339  WLLDSGANEHISCNLSFFSSFYRIPPVYVSLPNKTCVLVQYAGTVSFTSNFYLSHVLYSP 398
            W LDSG + H++    F  +       YV+  + +   +   G +       L+ VL   
Sbjct: 1684 WYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVHDGLPSLNKVLLVK 1863

Query: 399  AFTHNLISVAKLCESLSYSLHFTSAHCIIQDTMSLKMIGLAKQLDGLYKYTPSSCSSNSV 458
              T NLIS+++LC+   ++++FT + C++ +  S  ++  ++  D  Y +TP   S +S 
Sbjct: 1864 GLTANLISISQLCDE-GFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSS- 2037

Query: 459  FSSVSHKSCNVVATISCNSSSSIPSNALWHFRLGHLSHQRLHSMSLLY--------PNII 510
                           +C SS       +WH R GHL    L  M  +         PN+ 
Sbjct: 2038 ---------------TCLSSKE-DEVRIWHQRFGHL---HLRGMKKIIDKGAVRGIPNLK 2160

Query: 511  SSNNKDVCDLCHFAKHKHLPFNSSISHASTN--FELLHLDIWGPLSIASVHGHRYFLTIV 568
                + +C  C   K   +  +  + H +T+   ELLH+D+ GP+ + S+ G RY   +V
Sbjct: 2161 IEEGR-ICGECQIGKQVKMS-HQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVV 2334

Query: 569  DDHSRFLWVILLKSKAEVSTHVINFITMIQTQFHITPKFIRTDNGPEF---MLSTFYASH 625
            DD SRF WV  ++ K+E           +Q +     K IR+D+G EF     + F  S 
Sbjct: 2335 DDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSE 2514

Query: 626  GIIHQKSCVETPQQNGRVERKHQHILNVGRALLFQSKLPPSFWSYAILHAVFLINRVPTP 685
            GI H+ S   TPQQNG VERK++ +    R +L   +LP + W+ A+  A ++ NRV   
Sbjct: 2515 GITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLR 2694

Query: 686  ILHNQSPYFVLHHQLPALNLFKVFGCLCYASTLQSHRTKLQPRARKSIFLGYKSGFKGFT 745
                 + Y +   + P++  F +FG  CY    +  R K+ P++   IFLGY +  + + 
Sbjct: 2695 RGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYR 2874

Query: 746  LYDIQSREIFVSRHVTFHETFLPYP---HTSLSTTPNWEYFSSSNFSDVSNQPTPINSPA 802
            +++ ++R +  S +V   +            + T+ +    ++ +  +  N  +  +   
Sbjct: 2875 VFNSRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESN 3054

Query: 803  IIDDILPPSPPINPPPPPPIPVVSPASRTSTRQTTTPSYLQDYVCNNIHTSPYPINNYIS 862
            I       S  I    P  + +  P      R  TT S   + V N              
Sbjct: 3055 INQPDKRSSTRIQKMHPKELIIGDP-----NRGVTTRSREVEIVSN-------------- 3177

Query: 863  HHNLSNNYSSFVMSLHTTTEPKSYAEASKHDCWKQAMQVELQALEKTGTWQLVDLPSNIK 922
                    S FV  +    EPK+  EA   + W  AMQ EL+  ++   W+LV  P    
Sbjct: 3178 --------SCFVSKI----EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTN 3321

Query: 923  PIGCRWIYKVKYHADGSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSI 982
             IG +WI+K K + +G I R+KARLVA+GY QIEG+D+ +T++PVA+L +IRL++ ++ I
Sbjct: 3322 VIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACI 3501

Query: 983  HNWHLHQLDVNNAFLHGDLQEDVYMLIPPGIKS-NKPNQVCKLQKSLYGLKQASRKWYEK 1041
              + L+Q+DV +AFL+G L E+VY+  P G      P+ V +L+K+LYGLKQA R WYE+
Sbjct: 3502 LKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYER 3681

Query: 1042 LTSVLSHHHYIQASSDHSLFVKKTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFK 1101
            LT  L+   Y +   D +LFVK+ + +  I  +YVDDI+  G S     +    + + F+
Sbjct: 3682 LTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFE 3861

Query: 1102 IKDLGQLKYFLGIEVAHSKLGISLCQRKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSS 1161
            +  +G+L YFLG++V   +  I L Q +Y  +++   G  ++    TP+ + +KL +D +
Sbjct: 3862 MSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEA 4041

Query: 1162 PSYADIPSYRRLVGRLLYLNTTRPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPG 1221
             +  D   YR ++G LLYL  +RPDIT+     +++ + P  +H T   R+L+Y+ G   
Sbjct: 4042 GTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSD 4221

Query: 1222 RGLFFPRNSSINLQGFSDADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEA 1281
             G+ +   S+  L G+ DADWAG  D R+S SG CF+LGN+LISW +KKQ  VS S++EA
Sbjct: 4222 YGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEA 4401

Query: 1282 EYRALASATCELQWILYLLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCH 1341
            EY A  S+  +L W+  +L++ ++    +  LYCDN SA++I+ NPV H RTKH++I  H
Sbjct: 4402 EYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHH 4578

Query: 1342 IVREKVQAGILKLLPVSSQDQVADFFTKALLPKPFNILLSKMGL 1385
             +R+ V   ++ L  V +++Q+AD FTKAL    F  L  K+G+
Sbjct: 4579 YIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKLRGKLGI 4710


>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
          Length = 662

 Score =  214 bits (545), Expect = 2e-55
 Identities = 113/188 (60%), Positives = 142/188 (75%), Gaps = 4/188 (2%)
 Frame = +3

Query: 1208 AALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADWAGCLDTRRSISGQCFFLGNSLISWR 1267
            AA RVL+YLKGCP +GL F R S I + GFSDADWA C+D+ +SI+  CFFLG+SLISW+
Sbjct: 18   AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197

Query: 1268 TKKQITVSR--SSSEAEYRALASATCELQWILYLLQDIHISCPKLPVLYCDNQSALH-IA 1324
             KKQ TVSR  SSSEA+YRAL S TCELQW+ YLL+D+H++     ++YCDNQSAL  + 
Sbjct: 198  AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHVT-----LIYCDNQSALQ*LP 362

Query: 1325 ANPVFHERTKHLEIDCHIVREKVQAGILK-LLPVSSQDQVADFFTKALLPKPFNILLSKM 1383
               ++H +   LEIDCHIVREK Q G++  LLPVSS +Q+AD FTKAL PK F+  LSK+
Sbjct: 363  IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533

Query: 1384 GLINIYQP 1391
            GL +I+ P
Sbjct: 534  GLSDIFLP 557


>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
            thaliana}, partial (18%)
          Length = 421

 Score =  189 bits (481), Expect = 6e-48
 Identities = 94/140 (67%), Positives = 113/140 (80%)
 Frame = -2

Query: 1026 KSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFVKKTSSSFTILLVYVDDIIIAGDS 1085
            KSLYGLKQASRKWYEKLT++L    YIQ+ SD+SLF     ++FT LLVYVDDII+AGDS
Sbjct: 420  KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241

Query: 1086 LTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGISLCQRKYCLDLLADSGTIDSKP 1145
            + EF  IK+VLD +FKIK+LG+LKYFLG+EVAHS+LGI++ QRKYCLDLL DSG +  KP
Sbjct: 240  IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61

Query: 1146 VSTPSDSSIKLHQDSSPSYA 1165
             STP D+SIKLH  +   YA
Sbjct: 60   ASTPLDTSIKLHSAAGTPYA 1


>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
            (30%)
          Length = 687

 Score =  177 bits (450), Expect = 2e-44
 Identities = 85/159 (53%), Positives = 112/159 (69%), Gaps = 1/159 (0%)
 Frame = +2

Query: 1234 LQGFSDADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCEL 1293
            L G+ DADWAGC   RRS SG C F+G +L+SW++KKQ  V+RSS+EAEYR++A  TCEL
Sbjct: 17   LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196

Query: 1294 QWILYLLQDIHISCPKLPV-LYCDNQSALHIAANPVFHERTKHLEIDCHIVREKVQAGIL 1352
             WI   LQ++   C +L + LYCDNQ+ALHIA+NPVFHERTKH+EIDCH +REK+ +  +
Sbjct: 197  MWIKQFLQELRF-CEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373

Query: 1353 KLLPVSSQDQVADFFTKALLPKPFNILLSKMGLINIYQP 1391
                + S DQ  D  TK+L      I+ SK+G  ++Y P
Sbjct: 374  VTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLYAP 490


>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
            partial (34%)
          Length = 407

 Score =  164 bits (416), Expect = 2e-40
 Identities = 76/133 (57%), Positives = 99/133 (74%), Gaps = 1/133 (0%)
 Frame = -2

Query: 915  VDLPSNIKPIGCRWIYKVKYHADGSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIR 974
            V LP    P+GCRW+Y VK    G ++R KARLVAKGY Q+ G+DY DT+SPVAKLTT+R
Sbjct: 406  VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227

Query: 975  LVIALSSIHNWHLHQLDVNNAFLHGDLQEDVYMLIPPG-IKSNKPNQVCKLQKSLYGLKQ 1033
            L +A+++I +W LHQLD+ NAFLHGDL+ED+YM  PPG +   +   VCKL +SLYGLKQ
Sbjct: 226  LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47

Query: 1034 ASRKWYEKLTSVL 1046
            + R W+ K + V+
Sbjct: 46   SPRAWFGKFSHVV 8


>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
            vinifera}, partial (34%)
          Length = 409

 Score =  161 bits (408), Expect = 2e-39
 Identities = 74/134 (55%), Positives = 97/134 (72%)
 Frame = +3

Query: 877  LHTTTEPKSYAEASKHDCWKQAMQVELQALEKTGTWQLVDLPSNIKPIGCRWIYKVKYHA 936
            L + T P +  EA  H  W+QAM  E+QALE  GTW+LV LP     +GCRW+Y VK   
Sbjct: 3    LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182

Query: 937  DGSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAF 996
            +G ++R KARLVAKGY Q+ G++Y DT+SPV  LTT+RL +A+++I +W LHQLD+ NAF
Sbjct: 183  NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362

Query: 997  LHGDLQEDVYMLIP 1010
            LHGDL+ED+YM  P
Sbjct: 363  LHGDLEEDIYMEQP 404


>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
            (japonica cultivar-group)}, partial (10%)
          Length = 463

 Score =  160 bits (406), Expect = 3e-39
 Identities = 79/150 (52%), Positives = 109/150 (72%), Gaps = 1/150 (0%)
 Frame = -3

Query: 898  AMQVELQALEKTGTWQLVDLPSNIKPIGCRWIYKVKYHADGSIERHKARLVAKGYNQIEG 957
            AMQ EL   E+   W+LV+ P N   IG +W+++ K    G I R+KARLVAKGYNQ EG
Sbjct: 458  AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279

Query: 958  LDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAFLHGDLQEDVYMLIPPGIK-SN 1016
            +DY +TY+PVA+L  IR+++A  SI N+ L+Q+DV +AFL+G +QE+VY+  PPG +  +
Sbjct: 278  IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99

Query: 1017 KPNQVCKLQKSLYGLKQASRKWYEKLTSVL 1046
            KP  V KLQK+LYGLKQA R WYE++++ L
Sbjct: 98   KPTHVYKLQKALYGLKQAPRAWYERISNFL 9


>BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativa (japonica
           cultivar-group)}, partial (1%)
          Length = 408

 Score =  157 bits (398), Expect = 2e-38
 Identities = 68/136 (50%), Positives = 104/136 (76%)
 Frame = -1

Query: 33  RCNHLVQSWLINSVSDSIAQTIVFYDTAFEVWHDLQERFSKVDRIRIANLRSTINNLKQG 92
           RCN L+ SW++NSV  SI+++IVF D A +VW DL+ERFS+ D +R++ ++  I  L QG
Sbjct: 408 RCNMLIHSWILNSVEPSISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQG 229

Query: 93  SKSVLDYFTEMKALWEELASHRPIPNCSCIHPCRCEASKVAKIHRNEDQIMQFLTGLNDQ 152
           ++SV  ++++ KALWEEL  + PIPNC+C H C C+A ++A+ H +   +M+FLTGLND+
Sbjct: 228 TRSVTTFYSDKKALWEELEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDE 49

Query: 153 FSIVRTQVLLLDPLPS 168
           F+ V++Q+LL++PLPS
Sbjct: 48  FNAVKSQILLIEPLPS 1


>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
            (7%)
          Length = 804

 Score =  143 bits (361), Expect = 5e-34
 Identities = 73/221 (33%), Positives = 130/221 (58%), Gaps = 3/221 (1%)
 Frame = +1

Query: 1166 DIPSYRRLVGRLLYLNTTRPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGLF 1225
            D+  +RRL+G L YL  +RP+I F    +S+F+ +P  +H  AA RVLR +KG  G G+ 
Sbjct: 10   DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189

Query: 1226 FP---RNSSINLQGFSDADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAE 1282
            FP   ++   +L G++D+DW    +  +S  G  F   ++ ++  +KKQ  ++ S+ EAE
Sbjct: 190  FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369

Query: 1283 YRALASATCELQWILYLLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHI 1342
            Y A +   C+  W++ LL+++ +   K   L  DN+SA+++A +P  H R+KH+E+  H 
Sbjct: 370  YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549

Query: 1343 VREKVQAGILKLLPVSSQDQVADFFTKALLPKPFNILLSKM 1383
            +R++V  G + +    +++Q+AD  TK +    F  + S++
Sbjct: 550  IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672


>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
          Length = 558

 Score =  136 bits (342), Expect = 7e-32
 Identities = 78/179 (43%), Positives = 106/179 (58%), Gaps = 2/179 (1%)
 Frame = +1

Query: 938  GSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAFL 997
            G+I++ KARLVAK Y Q+ G DY  T+SPVAK+  + L+ +++ + +W L  LD  NAFL
Sbjct: 28   GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207

Query: 998  HGDLQEDVYMLIPPGI--KSNKPNQVCKLQKSLYGLKQASRKWYEKLTSVLSHHHYIQAS 1055
            HG L+E+VYM  P G   +    N VC+L +S YGLKQ+ R W        +   Y    
Sbjct: 208  HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSHE 381

Query: 1056 SDHSLFVKKTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGI 1114
            +DHS+F   +      L+VYVDDI I G      T +K  L   F+ KDLG+L+YFLGI
Sbjct: 382  ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558


>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
            Hopscotch polyprotein, partial (7%)
          Length = 1446

 Score =  126 bits (317), Expect(2) = 1e-31
 Identities = 61/109 (55%), Positives = 77/109 (69%)
 Frame = +2

Query: 1239 DADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILY 1298
            DA+WA     R S  G C  +G +L+ W++ K   V+RSS+EAEY+A+  ATCEL WI  
Sbjct: 8    DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187

Query: 1299 LLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHIVREKV 1347
            LLQ++     +   L CDNQ+ALHIA+NPVFHERTKH+EIDCH VREKV
Sbjct: 188  LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334



 Score = 30.0 bits (66), Expect(2) = 1e-31
 Identities = 14/33 (42%), Positives = 20/33 (60%)
 Frame = +3

Query: 1357 VSSQDQVADFFTKALLPKPFNILLSKMGLINIY 1389
            VSS DQ+A+ FTK+L       + SK+G   +Y
Sbjct: 363  VSSNDQLANIFTKSLRGPRIQNICSKLGAFELY 461


>CO982036 
          Length = 674

 Score =  132 bits (332), Expect = 1e-30
 Identities = 84/211 (39%), Positives = 116/211 (54%), Gaps = 3/211 (1%)
 Frame = -2

Query: 1064 KTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGI 1123
            KT      LLVYVD III G S T    + S L++SF +K LG+L YF+ IEV  S   +
Sbjct: 670  KTHILTVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVK-SMPDL 497

Query: 1124 SLCQRKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLLYLNTT 1183
                R    ++        ++P+S+P  ++ KL +  S  ++    YR +VG L Y    
Sbjct: 496  LFSLRTSIFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVI 317

Query: 1184 RPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGL-FFPRNSS--INLQGFSDA 1240
            RP+I+F   ++ QF+S P  +H T   R+LRYLKG    GL   P  SS  + ++GF DA
Sbjct: 316  RPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDA 137

Query: 1241 DWAGCLDTRRSISGQCFFLGNSLISWRTKKQ 1271
            DWA  +D +RS SG   FLG +LISW   KQ
Sbjct: 136  DWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44


>CO981879 
          Length = 576

 Score = 94.4 bits (233), Expect(2) = 2e-30
 Identities = 48/92 (52%), Positives = 59/92 (63%), Gaps = 3/92 (3%)
 Frame = -1

Query: 593 FITMIQTQFHITPKFIRTDNGPEFM---LSTFYASHGIIHQKSCVETPQQNGRVERKHQH 649
           F  MIQTQF +  K  R+DNG E+    LS     +GIIHQ SCV+TPQQNG  ERK++H
Sbjct: 558 FFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAERKNRH 379

Query: 650 ILNVGRALLFQSKLPPSFWSYAILHAVFLINR 681
           +  V RALLFQ+K P   W  AIL   +L N+
Sbjct: 378 LXEVARALLFQNKAPKYXWGEAILTGTYLKNK 283



 Score = 58.5 bits (140), Expect(2) = 2e-30
 Identities = 31/89 (34%), Positives = 50/89 (55%), Gaps = 5/89 (5%)
 Frame = -2

Query: 681 RVPTPILHNQSPYFVLHHQLPALNL-----FKVFGCLCYASTLQSHRTKLQPRARKSIFL 735
           R+P+ IL+ ++P  V     P   L      K+FGC  +    + ++ KL+PRA+K +F+
Sbjct: 284 RMPSKILNFRTPLDVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVFV 105

Query: 736 GYKSGFKGFTLYDIQSREIFVSRHVTFHE 764
           GY    KG+  +D  S++ FV+  VTF E
Sbjct: 104 GYAPNQKGYKCFDPTSKKTFVTIDVTFFE 18


>BM307983 
          Length = 406

 Score =  128 bits (321), Expect = 2e-29
 Identities = 65/133 (48%), Positives = 89/133 (66%), Gaps = 2/133 (1%)
 Frame = +2

Query: 924  IGCRWIYKVKYHADGSIERHKARLVAKGYNQIEGLDYFDTYSPVAK-LTTIRLVIALSSI 982
            +GCRWIY VKY AD +++R+KARLVAKGY Q  G+DY +T++   K + +        + 
Sbjct: 2    VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181

Query: 983  HNWHLHQLDVNNAFLHGDLQEDVYMLIPPGI-KSNKPNQVCKLQKSLYGLKQASRKWYEK 1041
              W +HQ DV NAFLHG L+E+VYM IPPG   SN  N+VC+L+K+LYGLKQ+ R W+ +
Sbjct: 182  FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361

Query: 1042 LTSVLSHHHYIQA 1054
             T  +    Y Q+
Sbjct: 362  FTQAMLSLGYKQS 400


>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6  (Fragment)
           , partial (21%)
          Length = 912

 Score =  128 bits (321), Expect = 2e-29
 Identities = 54/102 (52%), Positives = 76/102 (73%)
 Frame = -2

Query: 663 LPPSFWSYAILHAVFLINRVPTPILHNQSPYFVLHHQLPALNLFKVFGCLCYASTLQSHR 722
           +PP+FW+YA+LHA +LIN +PTP L N SPY  LH  +P ++  ++FGCLCYAST++++R
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732

Query: 723 TKLQPRARKSIFLGYKSGFKGFTLYDIQSREIFVSRHVTFHE 764
            KL+PRA   IF+G+K   KG+ LYD+ S  I  SR+V F+E
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYE 606


>TC232995 
          Length = 1009

 Score =  124 bits (311), Expect = 3e-28
 Identities = 63/170 (37%), Positives = 102/170 (59%), Gaps = 1/170 (0%)
 Frame = +2

Query: 1010 PPGIK-SNKPNQVCKLQKSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFVKKTSSS 1068
            PPG + S+KPN V KLQK+LYGLKQA R WYE+L++ L    + +   D +LF+K+  + 
Sbjct: 11   PPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRKHND 190

Query: 1069 FTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGISLCQR 1128
              ++ +YVDDII    + +        + + F++  +G+LKYFLG+++  ++ GI + Q 
Sbjct: 191  ILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFINQS 370

Query: 1129 KYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLL 1178
            KYC +L+   G   +K +STP  ++  L +D S    DI  YR  +G ++
Sbjct: 371  KYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520


>BU548243 
          Length = 599

 Score =  123 bits (308), Expect = 6e-28
 Identities = 67/145 (46%), Positives = 89/145 (61%)
 Frame = -1

Query: 1239 DADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILY 1298
            DA WA  +D  RS  G   FLG +LISW ++KQ   ++SS+EAEYR++A  + EL WI  
Sbjct: 587  DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408

Query: 1299 LLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHIVREKVQAGILKLLPVS 1358
            LL ++ I     PV+ CDN+SA+ IA N VFH RTKH+EID   V EKV +  L++  + 
Sbjct: 407  LLMELQIPFTP-PVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231

Query: 1359 SQDQVADFFTKALLPKPFNILLSKM 1383
            + DQ A   TK L    F  L SK+
Sbjct: 230  ALDQWAGILTKPLSSARFTFLKSKL 156


>BU764568 
          Length = 420

 Score =  100 bits (248), Expect(2) = 6e-27
 Identities = 46/84 (54%), Positives = 61/84 (71%)
 Frame = +3

Query: 1253 SGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILYLLQDIHISCPKLPV 1312
            SG C  +G +LISW++KKQ  V++SS+EAEYRA+A  TCEL W+  LL ++         
Sbjct: 168  SGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQMT 347

Query: 1313 LYCDNQSALHIAANPVFHERTKHL 1336
            L CDNQ+ALHIA+NP+FH RTKH+
Sbjct: 348  LICDNQAALHIASNPIFH*RTKHI 419



 Score = 40.8 bits (94), Expect(2) = 6e-27
 Identities = 20/50 (40%), Positives = 27/50 (54%)
 Frame = +1

Query: 1195 SQFLSQPTQAHHTAALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADWAG 1244
            SQFL+ P Q H  A   +L+  K  PG+GL +       + G+SDAD  G
Sbjct: 1    SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VG 150


>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
          Length = 1213

 Score =  102 bits (253), Expect(2) = 1e-26
 Identities = 59/186 (31%), Positives = 94/186 (49%)
 Frame = +3

Query: 1068 SFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGISLCQ 1127
            +F I+ +YVDDII    S         ++   F+    G+LK+ LG+++     GI + Q
Sbjct: 483  TFLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQ 662

Query: 1128 RKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLLYLNTTRPDI 1187
             KY    L      ++KP++TP   S  + +D   ++     Y  ++  L YL ++RPDI
Sbjct: 663  EKYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDI 842

Query: 1188 TFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADWAGCLD 1247
             F+    ++F S P  +H TA  R+LRYL G     L+F + S  +L G+ D  +AG   
Sbjct: 843  VFVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKV 1022

Query: 1248 TRRSIS 1253
             R+S S
Sbjct: 1023 ERKSTS 1040



 Score = 38.1 bits (87), Expect(2) = 1e-26
 Identities = 18/41 (43%), Positives = 26/41 (62%)
 Frame = +2

Query: 1023 KLQKSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFVK 1063
            K    +YGLKQA R WYE+L+S L  + + +  +D +LF K
Sbjct: 347  KTLSCVYGLKQALRAWYERLSSFLVSNGFTRGITDPALFRK 469


  Database: GMGI
    Posted date:  Oct 22, 2004  4:58 PM
  Number of letters in database: 37,918,896
  Number of sequences in database:  63,676
  
Lambda     K      H
   0.320    0.133    0.403 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75,241,980
Number of Sequences: 63676
Number of extensions: 1392486
Number of successful extensions: 25098
Number of sequences better than 10.0: 761
Number of HSP's better than 10.0 without gapping: 13138
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18505
length of query: 1391
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1282
effective length of database: 5,698,948
effective search space: 7306051336
effective search space used: 7306051336
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)


Medicago: description of AC149471.17