GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:58:51 Sequence gi568815584f:58538203_58747131 : 208929 bp : 41.12% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1883 1934 52 1 1 55 110 27 0.447 2.97 1.02 Term + 9579 9730 152 1 2 99 47 179 0.525 11.99 1.03 PlyA + 9858 9863 6 1.05 2.03 PlyA - 10190 10185 6 1.05 2.02 Term - 21381 21005 377 1 2 55 37 163 0.029 1.82 2.01 Init - 41993 41795 199 2 1 83 64 85 0.468 4.71 2.00 Prom - 42171 42132 40 -4.25 3.00 Prom + 49903 49942 40 -5.75 3.01 Init + 53095 53829 735 0 0 55 86 307 0.216 22.00 3.02 Intr + 57945 58049 105 2 0 83 79 29 0.211 0.99 3.03 Intr + 59659 59815 157 0 1 50 77 98 0.188 3.56 3.04 Intr + 67939 68094 156 2 0 63 45 100 0.029 2.26 3.05 Intr + 89745 89849 105 1 0 35 86 72 0.001 0.97 3.06 Intr + 95696 95983 288 0 0 39 86 159 0.117 7.09 3.07 Intr + 99903 100396 494 1 2 52 36 468 0.058 29.79 3.08 Intr + 102534 102666 133 2 1 89 109 142 0.999 15.70 3.09 Intr + 103390 103545 156 2 0 92 110 53 0.967 6.96 3.10 Intr + 107056 108926 1871 2 2 59 6 1357 0.006 110.97 3.11 Intr + 125488 125631 144 0 0 78 37 90 0.009 2.46 3.12 Term + 139754 139870 117 1 0 98 42 64 0.021 0.36 3.13 PlyA + 140028 140033 6 1.05 4.09 PlyA - 140850 140845 6 1.05 4.08 Term - 141949 141843 107 2 2 75 49 108 0.402 3.19 4.07 Intr - 142940 142762 179 1 2 68 45 52 0.092 -2.46 4.06 Intr - 149106 149007 100 1 1 124 40 112 0.148 8.25 4.05 Intr - 149784 149610 175 0 1 37 46 82 0.365 -2.51 4.04 Intr - 153853 153744 110 2 2 134 105 113 0.989 16.68 4.03 Intr - 165703 165478 226 1 1 73 43 118 0.094 2.44 4.02 Intr - 183944 183844 101 0 2 66 38 125 0.073 4.21 4.01 Init - 185938 185737 202 1 1 85 34 117 0.667 5.09 4.00 Prom - 186865 186826 40 -6.95 5.00 Prom + 191185 191224 40 -5.25 5.01 Init + 194898 194979 82 2 1 65 85 61 0.390 4.68 5.02 Term + 199597 199820 224 2 2 59 47 205 0.961 9.60 5.03 PlyA + 199887 199892 6 1.05 6.03 PlyA - 200699 200694 6 1.05 6.02 Term - 204573 204418 156 0 0 81 32 108 0.404 1.45 6.01 Init - 207713 207708 6 2 0 93 100 10 0.578 3.06 6.00 Prom - 208239 208200 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 80589 80395 195 1 0 88 62 156 0.826 11.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:58538203_58747131|GENSCAN_predicted_peptide_1|67_aa MDRTQIELNPYLTCVFSGGKAVPLSASQMPPAKMSVMLPSVNLEDCSQSLSLSTMQEDME SSGADTF >gi568815584f:58538203_58747131|GENSCAN_predicted_CDS_1|204_bp atggatcggacacaaattgagcttaatccgtacctcacatgtgtattttcaggtgggaaa gcagtgccactctccgcttcacagatgccccctgccaagatgtcagtgatgctgccgtca gtgaacctcgaggactgctctcagtctctgagtctcagcacaatgcaggaggacatggag tcttcgggggcagataccttctga >gi568815584f:58538203_58747131|GENSCAN_predicted_peptide_2|191_aa MDEAGNHHSQQTNTRTENQTPHVLTHKWELNNENTWTQGREHHTMGPVSGWGSMGDMALR EIPNVDAARNGEIPYARLTLLPVTIISSGQNIKTSCLKALENDQKQAETENTYTLKGGTQ LVRSTFIQGTSHCLCAWMTHSCSSDSVAGLGGQNKYPELPERLKIEQKNRIKKRATDGGI PNICIQTLSNS >gi568815584f:58538203_58747131|GENSCAN_predicted_CDS_2|576_bp atggatgaagctggaaatcatcattctcagcaaactaacacaagaacagaaaaccaaaca cctcatgttctcactcataagtgggagttgaacaatgagaacacttggacacagggaagg gaacatcacacaatggggcctgtcagtgggtgggggtctatgggagatatggcattaaga gaaatacctaatgtagatgctgccaggaatggtgaaattccttatgccagactaaccctc ctgccagtaacaattataagctctggacaaaatataaaaaccagttgtttgaaagcactg gagaatgaccaaaagcaagcagaaactgagaatacttacactttgaaaggaggaacacag ttggtgagatccacatttatccagggcacttcccattgcctgtgtgcatggatgacacac tcatgcagcagtgacagtgttgctgggcttggaggtcagaacaagtatccagagctccca gaaaggctaaaaattgagcagaaaaatcgcataaagaagagagctacagatggggggatc cccaatatctgcatacagactctctcaaattcttag >gi568815584f:58538203_58747131|GENSCAN_predicted_peptide_3|1486_aa MCKSINVIHHINRTKDKNHMIISIDAEKAFDKIQQRFMLKTLNKLGIDGTYLEIIRAIYD KPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNILLEVLARAIRQEKEIKGIQIGKEEV KLSLFADDMIVYLENPIISAQNILKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMS ELPFTVASKRIKYLGIQLTRDVKDLFKENYKPLFNEIEEDTNKWKNIPCSWVGRINIVKM AILPKETEEGERTERVCARSCLPFKWLSYKSCNPRSSVFWMQADLAAWWLFVSNGFSGDE FSGFTSLYREGTHQVRKAQSPGLLPFASSKAGDRPSAVSQLQIWADTIQPSCSVSRRRIS FKINEDSQVQWLTSVITVLWDAEAGARCSVMHTGTPPRGAAEPSFPPSVLTEAAAGSWQG ICTHGTLHLPSVKMTGRNGRAKGQKSEAGGDLITLISDYDGGECNSLPDILESLESTVLC RRKQEPLEKLTTKLHGITLAFFNKNLFNSTKTQLKGRHEPTRGRSPSALLLRLGGPAAVT ALAARLGAMKPSPAGTAKELEPPAPARGEQRTAEPEGRWREKGEADTERQRTRERQEATL AGLAELEYLRQRQELLVRGALRGAGGAGAAAPRAGELLGEAAQRSRLEEKFLEENILLLR KQLVGRARRWSTAVPAQGLENCLRRRDAGLLNQLQELDKQISDLRLDVEKTSEEHLETDS RPSSGFYELSDGASGSLSNSSNSVFSECLSSCHSSTCFCSPLEATLSLSDGCPKSADLIG LLEYKEGHCEDQASGAVCRSLSTPQFNSLDVIADVNPKYQCDLVSKNGNDVYRYPSPLHA VAVQSPMFLLCLTGNPLREEDRLGNHASDICGGSELDAVKTDSSLPSPSSLWSASHPSSS KKMDGYILSLVQKKTHPVRTNKPRTSVNADPTKGLLRNGSVCVRAPGGVSQGNSVNLKNS KQACLPSGGIPSLNNGTFSPPKQWSKESKAEQAESKRVPLPEGCPSGAASDLQSKHLPKT AKPASQEHARCSAIGTGESPKESAQLSGASPKESPSRGPAPPQENKVVQPLKKMSQKNSL QGVPPATPPLLSTAFPVEERPALDFKSEGSSQSLEEAHLVKAQFIPGQQPSVRLHRGHRN MGVVKNSSLKHRGPALQGLENGLPTVREKTRAGSKKCRFPDDLDTNKKLKKASSKGRKSG GGPEAGVPGRPAGGGHRAGSRAHGHGREAVVAKPKHKRTDYRRWKSSAEISYEEALRRAR RGRRENVGLYPAPVPLPYASPYAYVASDSEYSAECESLFHSTVVDTSEDEQSNYTTNCFG DSESSVSEGEFVGESTTTSDSEESGGLIWSQFVQTLPIQTVTAPDLHNHPAKTFVKIKAS HNLKKKILRFRSGSLKLMTTEAQVRASLKADGSGETTSGQQLVFCKQPSSARCLPVLQLV LETKSSVKALSVAYFGRPSSVLKTTDAIHAVQHQGRQQSSYGTHQS >gi568815584f:58538203_58747131|GENSCAN_predicted_CDS_3|4461_bp atgtgcaaatcaataaacgtaatccatcacataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggccttcgacaaaattcagcagcgcttcatgctaaaa actctcaataaattaggtattgatgggacatatctcgaaataataagagctatttatgac aaacccacagccaatatcatactgaatggccaaaaactggaagcattccctttgaaaacc ggcacaagacaaggatgccctctctcaccactcctattcaacatactattggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcaaataggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccatcatctcagcc caaaatatccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaagtcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacagttgcttcaaaaagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgttcaacgaaatagaagaggac acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatg gccatactgcccaaggaaactgaagaaggagaaaggacagaaagagtgtgtgctagatcc tgtcttccgttcaagtggctttcttacaaatcttgtaacccaaggtcctcagttttttgg atgcaagcagatctggctgcctggtggctgtttgtttctaatggattcagtggagatgag ttcagtgggttcaccagcttatacagagaagggactcaccaggtcagaaaagcccagagc cctgggttgttgccttttgcatcttctaaggctggagatagaccctctgccgtgtctcag ctacagatatgggccgacaccattcagccctcatgctcagtttcaagaaggcgaatctca tttaagattaatgaagatagccaggtgcagtggctcacatctgtaatcacagtactttgg gacgccgaagcaggtgcacggtgcagcgtgatgcatacagggacacccccccgtggtgca gcagaaccaagttttcctccctctgttctgaccgaagcagctgctggcagttggcaaggt atttgcacacatggaactctgcaccttccatctgtaaagatgaccggtagaaatggcaga gctaagggacaaaagtctgaggctggtggagacttaatcaccttaatcagtgattacgac ggtggggaatgcaactcacttccagacattctagaatctttggaatccacagtgctctgc aggaggaagcaggagcctcttgaaaaacttaccaccaaacttcatggaatcaccctggcc ttcttcaacaaaaacctctttaacagcactaaaacacagctaaaagggcggcatgagccc acccgcggccgcagccctagcgccctgctcctccgcctgggcggcccggctgcggtgacg gctctcgctgcccgactgggggccatgaagccgagtccggccgggacggcgaaggagctg gagcctccggcgccggcccgaggcgagcagcgcacggcggagcccgaggggcgctggcgg gagaagggcgaggcagacaccgagcggcagcgcacccgggagcggcaggaggccacgctg gccgggctggcggagctggagtacctgcgccagcgccaagagctgctggtcaggggcgcc ctgcgcggcgccgggggtgcgggagccgctgcgccccgcgctggggagctactgggggag gcggcgcagcgcagtcgcctggaggagaagttcttggaggagaacatcttgctgctaaga aagcaattggtaggtcgtgcccgaaggtggagcacggctgttcctgcccaggggctggag aactgtttgaggcgaagagatgctggtttgttgaatcagttgcaagagcttgacaagcag ataagtgacctgagactggatgtagaaaagacatctgaagagcacctggagacagacagt cggcctagctcagggttttatgagctgagtgatggggcttcaggatccctttccaattcc tctaactcggtgttcagtgagtgtttatccagttgtcattccagcacctgcttttgcagc cccttggaggcgaccttgagtctctcagatggttgccccaaatctgcagatctcatagga ttgttggaatataaagaaggccactgtgaagaccaggcctcaggggcagtttgccgttcc ctctccacaccacaatttaattcccttgatgtcattgcagatgtgaatcccaagtaccag tgtgatctggtgtctaaaaacgggaatgatgtatatcgctatcccagtccacttcatgct gtggctgtgcagagcccaatgtttctcctttgtctgacgggcaaccctctgagggaagag gacaggcttggaaaccatgccagtgacatttgcggtggatctgagctagatgccgtcaaa acagacagttccttaccgtccccaagcagtctgtggtctgcttcccatccttcatccagc aagaaaatggatggctacattctgagcctggtccagaaaaaaacacaccctgtaaggacc aacaaaccaagaaccagcgtgaacgctgaccccacgaaagggcttctgaggaacgggagc gtttgtgtcagagccccgggcggtgtctcacagggcaacagtgtgaaccttaagaattcg aaacaggcgtgtctgccctctggcgggataccttctctgaacaatgggacattctcccca ccgaagcagtggtcgaaagaatcaaaggccgaacaagccgaaagcaagagggtgcccctg ccagagggctgcccctcaggcgctgcctccgaccttcagagtaagcacctgccaaaaacg gccaagccagcctcgcaagaacatgctcggtgttccgccattgggacaggggagtcccct aaggaaagcgctcagctctcaggggcctctccaaaagagagtcctagcagaggccctgcc ccgccgcaggagaacaaagttgtacagcccctgaaaaagatgtcacagaaaaacagcctg cagggcgtccccccggccactcctcccctgctgtctacagctttccccgtggaagagagg cctgccttggatttcaagagcgagggctcttcccaaagcctggaggaagcgcacctggtc aaggcccagtttatcccggggcagcagcccagtgtcaggctccaccggggccacaggaac atgggcgtcgtgaagaactccagcctgaagcaccgcggcccagccctccaggggctggag aacggcttgcccaccgtcagggagaaaacgcgggccgggagcaagaagtgtcgcttccca gatgacttggatacaaataagaaactcaagaaagcctcctccaaggggaggaagagtggg ggcgggcccgaggctggtgttcccggcaggcccgcgggcgggggccacagggcggggagc agggcgcatggccacggacgggaggcggtggtggccaaacctaagcacaagcgaactgac taccggcggtggaagtcctcggccgagatttcctacgaagaggccctgaggagggcccgg cgcggtcgccgggagaatgtggggctgtaccccgcgcctgtgcctctgccctacgccagc ccctacgcctacgtggctagcgactccgagtactcggccgagtgcgagtccctgttccac tccaccgtggtggacaccagtgaggacgagcagagcaattacaccaccaactgcttcggg gacagcgagtcgagtgtgagcgagggcgagttcgtgggggagagcacaaccaccagcgac tctgaagaaagcgggggcttaatttggtcccagtttgtccagactctgcccattcaaacg gtaacggccccagaccttcacaaccaccccgcaaaaacctttgtcaaaattaaggcctca cataacctcaagaagaagatcctccgctttcggtctggctctttgaaactgatgacgacg gaagcccaggtgagggcctctttaaaagcagatggctctggagaaaccacctcaggccag cagctggtgttctgtaaacagccctcatctgccagatgcttgcctgttctccagttggtg ttagagactaagagctctgtcaaagccctttcagtggcatactttggcagaccttcaagc gtcctgaagaccactgatgccattcatgcagtccagcaccagggcagacagcagagttca tatggcacacaccagagttaa >gi568815584f:58538203_58747131|GENSCAN_predicted_peptide_4|399_aa MDGYGGIKKNESMSFATTWVQLKAIILSELTQKQKTKYYMFSLISGSSTLSTRGHKEGNK RHWGLFEDMSTNDVPGYVLRNGDKGEKADSGTNSLMGKDKQIMEEKQLSEESGFELGFEL PEQYKSQRLSSQSPSFPLRIQAAGEREKKHKTEVSSCYTHNLTANCQGDIDLNHIKGTYS HEGVASKAADMIPFQGLGLLVQIPTAKPIKARELPVLRNGQAHSKIHIEMQVTQKSQNSF EKEEQSQRIYTPHFQNLLQSYSNQDSVIVVRAPGPIDHPKAEECRPTAWDWQASPPGATG VGSTSFLFDLTISWKRALGLKIYKPLFVLILILLRCLAPNHKVEHYFNLGISTTPAYRGA LKPLATSTRNQALPDESDRQDSYDRSHRCGKIEIQTVVN >gi568815584f:58538203_58747131|GENSCAN_predicted_CDS_4|1200_bp atggatggctatggaggcataaaaaagaatgaaagcatgtcctttgcaacaacatgggtg cagctgaaggccattatcctaagtgaactaacacagaaacagaaaaccaaatattacatg ttctcacttataagtgggagctcaacactgagtacacgtggacacaaagaaggcaacaaa agacactggggcctgtttgaggatatgagcaccaatgatgtgccaggctatgtgctacgc aatggagataaaggtgaaaaagcagactcagggaccaacagtctcatgggaaaagataag caaataatggaggagaagcaattatctgaagagtcaggatttgagctgggatttgaactt cctgagcaatacaagagtcaaaggttgagctcccaatcccccagcttcccactcagaatt caggctgctggtgaaagagagaaaaagcataaaactgaggtctcgagttgctatactcat aacctcacagcaaactgccaaggggacattgatctgaatcacattaaaggtacttactct catgaaggtgttgcctccaaggcagctgacatgattcccttccagggtctaggccttctg gtgcagattcccacagccaaacccataaaggccagagagctgcctgttttgagaaatggg caagctcattctaaaattcatatagaaatgcaagtaacccagaagagccaaaatagtttt gaaaaagaagaacaaagccagaggatttacactccccatttccaaaacttactacaaagc tacagtaatcaagatagtgtgatagtggtgagagcacccggtcccatcgaccacccaaag gctgaggagtgcaggcccacagcatgggactggcaggcatctccacctggggccacgggt gtgggatccactagctttctctttgatctaacgatctcatggaaaagggcattagggctg aaaatttataagcctctctttgtccttattctgatcctgcttcgctgcttagccccaaat cataaagtagagcactatttcaatttgggtatcagcaccacacctgcttatcgtggagcg cttaagccacttgctacatccaccaggaaccaagccctcccagatgagtctgacaggcag gactcctatgaccgttctcacagatgtggaaaaatagaaattcagactgtagtcaactag >gi568815584f:58538203_58747131|GENSCAN_predicted_peptide_5|101_aa MGNLERTEIYILTILEAGTSKIKVPASAQLLSLFESVLLLLMRKKRVFVGQPQAGSLGGI SEEGIVIIGDDSSVHVIAPEDLPVGQDVEVEDSDIDDLAPV >gi568815584f:58538203_58747131|GENSCAN_predicted_CDS_5|306_bp atgggtaacttagaaagaacagaaatttatattcttactattctggaagctggaacatcc aagatcaaggtgccagcatctgcacagttgttgtcattatttgagagtgtactccttcta cttatgcggaaaaaaagggtttttgtaggacagcctcaggcaggttctttaggtggtatt tcagaagaaggcattgttatcataggagatgacagctctgtgcatgttattgcccctgaa gaccttccagtgggacaagatgtggaggtggaagacagtgatattgatgatcttgcccct gtgtag >gi568815584f:58538203_58747131|GENSCAN_predicted_peptide_6|53_aa MKVHRLKDHFCGNTGKNKKCNRNKPLLFPHALQSSSSTCIIEPNAEPAGKGEM >gi568815584f:58538203_58747131|GENSCAN_predicted_CDS_6|162_bp atgaaggtgcatagattaaaggaccatttttgtggcaacactggcaagaataaaaaatgc aacaggaacaagcccctcctttttcctcatgccttacagtcttcctccagtacctgtatc atagagcctaatgcagagccagctggtaaaggagaaatgtaa