GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:55:51 Sequence gi568815592f:85350140_85571466 : 221327 bp : 39.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9934 10049 116 2 2 106 80 71 0.197 7.35 1.02 Intr + 11624 11750 127 1 1 9 66 109 0.086 0.13 1.03 Term + 16797 18307 1511 1 2 57 43 582 0.040 40.28 1.04 PlyA + 18314 18319 6 1.05 2.00 Prom + 19226 19265 40 -7.45 2.01 Sngl + 28088 28477 390 1 0 88 54 317 0.968 24.27 2.02 PlyA + 28501 28506 6 1.05 3.00 Prom + 30282 30321 40 -1.45 3.01 Init + 48486 48646 161 2 2 86 76 79 0.526 5.84 3.02 Intr + 49747 49823 77 1 2 86 66 52 0.097 1.04 3.03 Intr + 54497 54583 87 0 0 58 78 54 0.027 0.42 3.04 Intr + 54906 55002 97 1 1 77 38 102 0.030 2.25 3.05 Intr + 55379 55698 320 2 2 18 91 167 0.021 4.68 3.06 Intr + 56870 56982 113 0 2 59 47 121 0.278 4.28 3.07 Intr + 66953 67096 144 1 0 34 77 75 0.014 0.56 3.08 Intr + 76877 76973 97 1 1 55 64 85 0.050 1.46 3.09 Term + 77568 77830 263 1 2 84 45 187 0.197 8.70 3.10 PlyA + 77992 77997 6 1.05 4.04 PlyA - 78707 78702 6 1.05 4.03 Term - 86456 86376 81 2 0 47 48 66 0.512 -4.79 4.02 Intr - 87187 86997 191 2 2 74 78 99 0.563 5.88 4.01 Init - 87746 87695 52 2 1 64 82 34 0.775 1.77 4.00 Prom - 96957 96918 40 -2.15 5.00 Prom + 98172 98211 40 -7.05 5.01 Init + 100001 100339 339 1 0 41 95 704 0.982 61.70 5.02 Intr + 111777 111827 51 2 0 97 65 53 0.019 2.09 5.03 Intr + 116921 117143 223 1 1 100 68 123 0.988 8.28 5.04 Intr + 121098 121286 189 1 0 92 91 141 0.967 13.44 5.05 Term + 126395 126783 389 0 2 23 34 260 0.570 8.22 5.06 PlyA + 129037 129042 6 1.05 6.00 Prom + 131190 131229 40 -1.25 6.01 Init + 134322 134424 103 2 1 53 63 61 0.217 0.55 6.02 Intr + 135096 135293 198 1 0 91 115 210 0.995 22.40 6.03 Intr + 137196 137350 155 1 2 106 115 53 0.997 8.67 6.04 Intr + 139355 139460 106 1 1 66 68 125 0.999 7.17 6.05 Intr + 140369 140518 150 0 0 46 115 225 0.999 20.21 6.06 Intr + 141838 142038 201 2 0 82 107 155 0.984 15.14 6.07 Term + 143702 143865 164 0 2 79 42 27 0.227 -5.68 6.08 PlyA + 144993 144998 6 1.05 7.14 PlyA - 145483 145478 6 1.05 7.13 Term - 147399 146994 406 2 1 37 54 264 0.889 11.57 7.12 Intr - 152974 152923 52 2 1 116 95 20 0.076 2.55 7.11 Intr - 157920 157774 147 1 0 70 58 190 0.496 13.49 7.10 Intr - 163756 163661 96 2 0 86 44 56 0.494 0.06 7.09 Intr - 164490 164367 124 0 1 52 82 85 0.975 3.54 7.08 Intr - 167736 167617 120 0 0 64 79 79 0.870 4.37 7.07 Intr - 167909 167869 41 0 2 85 105 0 0.802 -1.58 7.06 Intr - 176098 175987 112 1 1 64 111 75 0.743 6.43 7.05 Intr - 178223 178123 101 0 2 52 75 40 0.659 -1.99 7.04 Intr - 183661 183460 202 1 1 70 91 169 0.912 13.34 7.03 Intr - 186785 186653 133 1 1 87 78 108 0.456 9.43 7.02 Intr - 199740 199584 157 1 1 59 75 97 0.381 3.65 7.01 Init - 207876 207837 40 0 1 87 85 18 0.393 1.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 147966 147734 233 0 2 59 58 141 0.925 5.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_1|584_aa XNCLITKANQRTVPPFHSSTTCTSEILLPLEIILHCHTKRKQKSQLPMWATVVNSFNSEG SSKAGSCPGLTTQEACQRTRDDSSYYFEIRPIFNTPLSTLDRSTRQKVNKDTQELNSALH QADLIDIYRTLHPKSTEYIFFSAPHHTYSKIDNIVGSKALLSKCKRTEIITNCLSDHSAI KLELRLKKLTQNHSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDITYQNLWDTFKA VCRGKFIALNAHKRKQERSKIEILTSQLKELEKQEQTHSKASRRQEITKIRAELKEIDTQ KTLQKINESRSWVFERINKIDRPVARLTKKKREKNQIDTIKNDKGDITTDPTEIQTTIRE YHKHLYANKLENLEEMDKFLDTYTIPRLNQEEIESLNRPLTGSEIVAIINSLPTKKSPGP DGFTAEFYQRYKEELVPFLLKLFQSIEQEGILPNSFYENSIILIPKPGRDTTKKENFRPI FLMNIDAKILKKIMANRIQQQIKKLIHHDQLGFIPGMQGWFNICKSINVIQHINRTKDKN HMIISIDAEKAFDKIQQRFMLKTVNKLGIDGMKTSFLVRKTGTL >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_1|1755_bp nggaactgcctcatcaccaaagcaaatcagaggactgtccctcctttccacagtagcacc acctgcacttctgagatcttgcttccccttgaaattatccttcactgtcatacgaaacgc aagcagaagtcgcaactgcccatgtgggccacagtagtgaacagctttaacagtgaaggc agtagtaaagctggcagctgcccagggttgaccactcaggaagcctgccagaggaccaga gatgatagctcttattattttgagatacgtcccatctttaacaccccactgtcaacatta gacagatcaacgagacagaaagttaacaaggatacccaggaattgaactcagctctgcac caagcggacctaatagacatctacagaactctccaccccaaatcaacagaatatatattt ttttcagcaccgcaccacacctattccaaaattgacaacatagttggaagtaaagctctc ctcagcaaatgtaaaagaacagaaattataacaaactgtctctcagaccacagtgcaatc aaactagaactcaggcttaagaaactcactcaaaaccactcaactacatggaaactcaac aacctgctcctgaatgactactgggtacataatgaaatgaaggcagaaataaagatgttc tttgaaaccaatgagaacaaagacataacataccagaatctctgggacacattcaaagca gtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatccaaa attgagatcctaacatcacaattaaaagaactagaaaagcaagagcaaacacattcaaaa gctagcagaaggcaagaaataactaaaatcagagcagaactgaaggaaatagacacacaa aaaacccttcaaaaaattaatgaatccaggagctgggtttttgaaaggatcaacaaaatt gatagaccagtagcaagactaacaaagaagaaaagagagaagaatcaaatagacacaata aaaaatgataaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaa taccacaaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctg gacacatacaccatcccaagactaaaccaggaagaaattgaatctctgaatagaccatta acaggctctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccaggacca gatggattcacagccgaattctaccagaggtacaaggaggaactggtaccattccttctg aaattattccaatcaatagaacaagagggaatcctccctaactcattttatgagaacagc atcatcctgataccaaagccaggcagagacacaaccaaaaaagagaattttagaccaata ttcttgatgaacattgatgcaaaaatcctcaagaaaataatggcaaaccgaatccagcag cagatcaaaaagcttatccaccatgatcaattgggcttcatccctgggatgcaaggctgg ttcaatatatgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaac cacatgattatctccatagatgcagaaaaggcctttgacaaaattcagcaacgcttcatg ctaaaaaccgtcaataaattaggtattgatgggatgaaaacatcttttcttgtcaggaag acaggcaccttatag >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_2|129_aa MGKKQSRKAENSKNQSTSPPPKERSSSPATQQSWTENDFDKLKEEDFRRANFSELKEEVR THCKEAKIHEKRLDEWLTRIISVEKFLNDLMELKTMAQELHDKCTSFSSQFDQLQGRVSV IEDQINEMK >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_2|390_bp atggggaaaaaacagagcagaaaagctgaaaattctaaaaatcagagcacctctccccct ccaaaggaacgcagctcctcaccagcaacgcagcaaagctggacggagaatgactttgac aagttgaaagaagaagacttcagaagagcaaacttctccgagctaaaggaggaagttcga acccattgcaaagaagctaaaatacatgaaaaaagattagacgaatggctaactagaata atcagtgtagagaagttcttaaatgatctgatggagctgaaaactatggcacaagaatta catgataaatgcacaagcttcagtagccaattcgatcaactacaaggaagggtatcagtg attgaagatcaaattaatgaaatgaagtga >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_3|452_aa MNRAASCYPKQTNAGTENQIPQVFTYKWDLNIENMDTKKSTTDTEAYLRVEGEGERNKEP WTGKRGIYQCTLELSAVAAARLHPSEINSLVAHTKPVWWYLHTDAHDRAQRDLTANLAEG LSETRTQELICASELMEGNHRMDNCPNSNQTMSPLWANPDSDLGKSGGWGSPALCRQPPK SLLLSGLSPPELPTPTQLYPLVLFSKKLLHRETTSPFSPFVGNMKMHLTAPLTQGASERQ GAPCSEPDTHIIIIIIIIIIIFGSHLENHIFHLPLLRTLPPHEFSSNPKYLGRSSGHSIK ANPMTHSTIDRKAGSEMQSYQETVTGIHGVLCKTLRLEPWHAQRAGQGTAACHPPPQASR FAGGNGFFCEACFLQVMVHDSKDTSDYCAFGTFAFSAFSGGISTNEQVINVTLCPTYQFP LHLQAQPIRSLPDFVNVGEHLIMADEVPVDKS >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_3|1359_bp atgaatagagctgcaagctgttatcctaagcaaactaatgcaggaacagaaaaccaaata ccacaggttttcacttataaatgggatctaaacattgagaacatggacacaaagaagagc acaacagacactgaggcctacctgagggtggagggtgaaggggagagaaacaaagagcca tggacaggtaagaggggaatctaccagtgtaccctagagctctctgctgtagctgcagcc cgcctgcacccaagtgaaataaacagtcttgttgctcacacaaagcctgtttggtggtat cttcacacggacgcgcatgacagagcccagagagatcttactgcaaacctagctgaaggg ctctcagaaaccagaacacaggagctcatttgtgcttccgagttgatggaagggaatcac aggatggacaactgccccaactctaaccagacaatgtctcctctgtgggccaaccctgat tctgacttggggaaatctgggggctggggaagccctgctctgtgcagacaacctccaaaa tccctcctgctttctggcctttctccacctgaactgcccactccaacacaactttatcca cttgttctcttcagcaagaaactccttcacagagaaaccaccagcccattcagccccttt gtggggaatatgaaaatgcacctcacagcaccactgacccagggtgcctctgagaggcag ggagcaccctgtagcgagccagacacacacatcatcatcatcatcatcatcatcatcatc atctttggttcccaccttgaaaaccacatctttcatcttccactactcaggactcttcct cctcatgagttttcttctaaccccaaatacctaggaagatcctcaggccacagcatcaaa gccaaccccatgacccacagcaccatagatagaaaagctggctcagaaatgcaatcttac caggaaacagtgactggaatccatggagtgctttgcaagaccctcaggttggagccatgg catgctcagagagctgggcaaggcacagctgcatgtcaccctcctccacaggccagccga tttgctgggggaaatggcttcttttgtgaagcttgtttcctgcaggtgatggttcatgac agtaaggataccagtgattactgtgcttttggtaccttcgctttcagtgccttcagcgga ggcatttccaccaatgagcaagtcatcaatgttaccctctgtcctacttaccaatttccc ctccacctccaggcacagcccatccgcagtctcccggattttgtaaatgtgggagaacat ctcatcatggctgatgaggtcccagtagataaatcatga >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_4|107_aa MGKISNRYLTKEDIHIANCDLECLTVWECSPADLTLILHSPYSRWSCSGSNASDNTTTHL LEWSKSRTLTTPSAGEGVEQQEMRPDPDPKRGFLDFTQERIQGESVK >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_4|324_bp atgggcaaaatatctaacagataccttactaaagaagacatccacattgcaaactgtgac ttagaatgcctaactgtctgggaatgcagcccagcggatcttacccttattttacacagc ccctattcacgatggagttgctctggttcaaatgcctctgacaataccactacacacctt ttagaatggtcaaaatccagaacactgacaacaccaagtgctggtgagggtgtagagcaa caggaaatgaggcctgacccagatcccaaaagagggttcttggatttcacacaagaaaga atccaaggagagtctgtaaagtga >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_5|396_aa MCPRAARAPATLLLALGAVLWPAAGAWELTILHTNDVHSRLEQTSEDSSKCVNASRCMGG VARLFTKVQQIRRAEPNVLLLDAGDQYQGTIWFTVYKGAEVAHFMNALRYDAMANHQQRR ALSSSDAVSEALGNHEFDNGVEGLIEPLLKEAKFPILSANIKAKGPLASQISGLYLPYKV LPVGDEVVGIVGYTSKETPFLSNPGTNLVFEDEITALQPEVDKLKTLNVNKIIALGHSGF EMDKLIAQKVRGVDVVVGGHSNTFLYTALLSMDSLSVLTAQCDSPLYPELLFSNQEGSAC TNKLKMVNVENFIADENGSQWDGELERGWSGKVSPPGVRPSPAELFSEVLPSSHPSEVKL LFTSVKLLLFSPSLPFHCQWILGFLWGQDGGQGGPG >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_5|1191_bp atgtgtccccgagccgcgcgggcgcccgcgacgctactcctcgccctgggcgcggtgctg tggcctgcggctggcgcctgggagcttacgattttgcacaccaacgacgtgcacagccgg ctggagcagaccagcgaggactccagcaagtgcgtcaacgccagccgctgcatgggtggc gtggctcggctcttcaccaaggttcagcagatccgccgcgccgaacccaacgtgctgctg ctggacgccggcgaccagtaccagggcactatctggttcaccgtgtacaagggcgccgag gtggcgcacttcatgaacgccctgcgctacgatgccatggccaaccaccagcagagaagg gcgcttagctccagtgacgctgtgagtgaggcactgggaaatcatgaatttgataatggt gtggaaggactgatcgagccactcctcaaagaggccaaatttccaattctgagtgcaaac attaaagcaaaggggccactagcatctcaaatatcaggactttatttgccatataaagtt cttcctgttggtgatgaagttgtgggaatcgttggatacacttccaaagaaacccctttt ctctcaaatccagggacaaatttagtgtttgaagatgaaatcactgcattacaacctgaa gtagataagttaaaaactctaaatgtgaacaaaattattgcactgggacattcgggtttt gaaatggataaactcatcgctcagaaagtgaggggtgtggacgtcgtggtgggaggacac tccaacacatttctttacacagctttgctgtccatggacagcttaagtgttttaacagct cagtgtgacagccctctgtatcctgaactcttgttcagcaaccaggaaggatcagcttgc acgaacaaattgaagatggtgaatgtggagaactttattgctgatgaaaatggctctcag tgggatggggagctggaaaggggatggagtgggaaggtgtctccccctggagttaggcca tccccagctgaactcttctccgaggtcctaccgtcaagccatccctctgaagtcaagctg cttttcaccagtgtcaagcttcttctcttctctccttctctgccattccactgccagtgg atcctggggtttttatggggacaggatggggggcagggcgggccagggtga >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_6|358_aa MSSSHISPLRQRPAEPDQCEGWLGSGVCLLVGPQGNPPSKEVPAGKYPFIVTSDDGRKVP VVQAYAFGKYLGYLKIEFDERGNVISSHGNPILLNSSIPEDPSIKADINKWRIKLDNYST QELGKTIVYLDGSSQSCRFRECNMGNLICDAMINNNLRHTDEMFWNHVSMCILNGGGIRS PIDERNNGTITWENLAAVLPFGGTFDLVQLKGSTLKKAFEHSVHRYGQSTGEFLQVGGIH VVYDLSRKPGDRVVKLDVLCTKCRVPSYDPLKMDEVYKVILPNFLANGGDGFQMIKDELL RHDSGDQDINVVSTYISKMKVIYPAVEGRIKFSTGSHCHGSFSLIFLSLWAVIFVLYQ >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_6|1077_bp atgagcagctcccacatctcccctttgagacagaggcctgcagaacctgaccagtgtgag ggctggctggggtcaggagtgtgcctgctggttggccctcaaggcaatccaccttccaaa gaggtgcctgctgggaagtacccattcatagtcacttctgatgatgggcggaaggttcct gtagtccaggcctatgcttttggcaaatacctaggctatctgaagatcgagtttgatgaa agaggaaacgtcatctcttcccatggaaatcccattcttctaaacagcagcattcctgaa gatccaagcataaaagcagacattaacaaatggaggataaaattggataattattctacc caggaattagggaaaacaattgtctatctggatggctcctctcaatcatgccgctttaga gaatgcaacatgggcaacctgatttgtgatgcaatgattaacaacaacctgagacacacg gatgaaatgttctggaaccacgtatccatgtgcattttaaatggaggtggtatccggtcg cccattgatgaacgcaacaatggcacaattacctgggagaacctggctgctgtattgccc tttggaggcacatttgacctagtccagttaaaaggttccaccctgaagaaggcctttgag catagcgtgcaccgctacggccagtccactggagagttcctgcaggtgggcggaatccat gtggtgtatgatctttcccgaaaacctggagacagagtagtcaaattagatgttctttgc accaagtgtcgagtgcccagttatgaccctctcaaaatggacgaggtatataaggtgatc ctcccaaacttcctggccaatggtggagatgggttccagatgataaaagatgaattatta agacatgactctggtgaccaagatatcaacgtggtttctacatatatctccaaaatgaaa gtaatttatccagcagttgaaggtcggatcaagttttccacaggaagtcactgccatgga agcttttctttaatatttctttcactttgggcagtgatctttgttttataccaatag >gi568815592f:85350140_85571466|GENSCAN_predicted_peptide_7|576_aa MKHIEVIVKARQKVKNTEFLQQAALEEYGPELHVALRSRRDELHYLRKLTELLFPYILPP KATDCRNTQKRGESFGISRIGSKIKGVFKSTTMEGAMLPNYGVAEGEDDFIEEGIVVMED DSPVEAVSTPNTPRNLAAWKISIPYVDFFEDPSSERKEKKERIPVFCIDVERNDRRAGAF PDAQLPSKRIIGPKNYEFLKSKREEFQEYLQKLLQHPELSNSQLLADFLSPNGGETQFLD KILPDVNLGKIIKSVPGKLMKEKGQHLEPFIMNFINSCESPKPKPSRPELTILSPTSENN KKLFNDLFKNNANRAENTERKQNQNYFMEVMTVEGVYDYLMYVDAIFCENTEPRSLQDKQ KGAKQTFEEMMNYIPDLLVKCIGEETKYESIRLLFDGLQQPVLNKQVKLIKARKLMLARF GERRILLSKVSVTQGQPRSENRDLVPCIPAALAPTVAKGGQVAAWAMPSEGANPKPWQLP CGVEPAGAQKSRIKIWEPPPRFQRMYGNAWMSRQKFTVVAGPSWRASARAVLNGNVGLDP TYTVPTGALPNGAMRRGPTGALPNGAMRRGAVRRGL >gi568815592f:85350140_85571466|GENSCAN_predicted_CDS_7|1731_bp atgaagcatatagaagtgatagttaaagccagacagaaagtaaaaaatacagagttttta cagcaagctgctttagaagaatatggtccagagcttcatgttgctttgagaagtcgaaga gatgaattgcactatttaaggaaacttactgaactgctttttccttatattttgcctcct aaagcaacagactgcaggaacacacagaaaaggggagaatcatttggaatcagcagaata ggtagcaaaattaaaggagtattcaaaagtaccacaatggagggagctatgttgcctaat tatggtgtagctgaaggtgaagatgattttattgaagaaggtattgttgtaatggaagat gattctccagtggaggctgtgagcacacctaatactccccgaaaccttgctgcatggaaa attagcattccatatgtagacttttttgaggatccctcctctgaaaggaaggagaaaaaa gaaagaattcctgtgttttgtattgatgttgaaagaaatgatagaagagcaggtgcattt cctgatgcccagcttccttctaagaggatcattggccccaaaaattatgaattcttaaag tcaaagagggaagagttccaagaatatctacagaaacttctgcagcatccagaactgagt aatagtcaacttctggcagactttctttcccctaatggtggggaaacacaatttcttgat aagatactaccagatgtaaatcttgggaaaattataaaatctgttcctggaaaactaatg aaagagaaaggtcagcatttggaaccttttatcatgaatttcattaattcttgtgagtct ccaaagcctaaaccaagtagaccagaactgaccattctcagccctacttcagaaaacaac aagaagcttttcaatgatctgtttaaaaataatgcaaaccgtgctgaaaatacagagaga aagcaaaatcagaattattttatggaggtgatgactgtagaaggagtctatgattacctg atgtatgtagatgctatattctgtgaaaacactgaacctcgctctctccaagataagcaa aaaggagcaaaacagacttttgaagaaatgatgaattacattccagatctgttagtcaag tgtattggtgaagaaaccaagtatgaaagcatcagacttctgtttgatggcttacagcaa ccagtactcaacaagcaggtaaagctcattaaagctcgtaaactcatgctggctaggttt ggtgaacgacgtattttgctttccaaggtttcagttacccaaggtcaaccaaggtctgaa aatagggacttggtgccctgcatcccagctgctttagctccaaccgtggctaaagggggc caagttgcagcttgggctatgccttcagagggtgcaaaccccaagccttggcagcttcca tgtggtgttgagcctgcaggtgcacagaaatcaagaatcaagatttgggaaccacctcct agatttcagaggatgtatggaaatgcctggatgtccaggcagaagtttactgtggtggca gggccctcatggagagcctctgctagggcagtgttgaatggaaatgtggggttggatccc acatacacagttcccactggggcactacctaatggagctatgagaagagggcccactggg gcactgcctaatggagccatgagaagaggagctgttagaagggggctgtga