GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:14:56 Sequence gi568815596r:47420910_47670245 : 249336 bp : 45.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8833 9032 200 1 2 26 87 154 0.737 8.09 1.02 Intr + 24639 24748 110 1 2 88 85 67 0.674 6.40 1.03 Intr + 30997 31017 21 0 0 137 72 13 0.480 2.34 1.04 Intr + 42122 42245 124 1 1 98 91 64 0.922 7.86 1.05 Intr + 45749 45899 151 0 1 34 41 147 0.995 3.82 1.06 Intr + 50056 50153 98 1 2 75 97 76 0.976 6.85 1.07 Intr + 54116 54361 246 0 0 75 58 175 0.981 10.63 1.08 Intr + 55458 55662 205 1 1 77 111 70 0.981 6.46 1.09 Intr + 57363 57610 248 0 2 52 95 82 0.958 2.40 1.10 Intr + 59787 59962 176 1 2 104 68 75 0.952 6.76 1.11 Term + 61870 62040 171 0 0 56 48 107 0.908 1.33 1.12 PlyA + 62291 62296 6 1.05 2.00 Prom + 62777 62816 40 -3.86 2.01 Init + 63583 63667 85 0 1 76 115 43 0.432 6.78 2.02 Term + 72035 72213 179 0 2 55 49 149 0.511 5.55 2.03 PlyA + 72789 72794 6 1.05 3.00 Prom + 79684 79723 40 -4.26 3.01 Init + 89315 89414 100 1 1 104 50 56 0.548 4.06 3.02 Intr + 91394 91511 118 0 1 130 77 35 0.985 6.12 3.03 Intr + 96887 96940 54 2 0 78 77 43 0.529 0.29 3.04 Intr + 98801 98976 176 2 2 97 36 137 0.954 8.98 3.05 Term + 99282 99352 71 1 2 123 54 29 0.962 1.10 3.06 PlyA + 99869 99874 6 -0.45 4.11 PlyA - 99896 99891 6 1.05 4.10 Term - 100899 99998 902 1 2 119 41 2185 0.526 209.41 4.09 Intr - 122680 122609 72 2 0 86 44 64 0.002 1.18 4.08 Intr - 142545 142521 25 0 1 85 100 43 0.412 2.80 4.07 Intr - 148843 148764 80 2 2 103 33 65 0.265 1.77 4.06 Intr - 149747 149032 716 1 2 -12 71 960 0.420 75.38 4.05 Intr - 158820 158702 119 0 2 74 82 48 0.373 2.06 4.04 Intr - 165996 165712 285 0 0 74 65 153 0.904 9.14 4.03 Intr - 167991 167940 52 2 1 32 110 32 0.480 -1.29 4.02 Intr - 168835 168700 136 2 1 106 80 46 0.642 5.23 4.01 Init - 178298 178250 49 2 1 100 81 39 0.085 5.55 4.00 Prom - 181861 181822 40 -5.86 5.00 Prom + 193794 193833 40 -1.36 5.01 Init + 206308 206396 89 0 2 87 94 71 0.355 7.71 5.02 Term + 230610 230658 49 0 1 113 48 42 0.049 -0.52 5.03 PlyA + 231396 231401 6 1.05 6.04 PlyA - 231513 231508 6 1.05 6.03 Term - 238600 238404 197 2 2 97 44 103 0.933 4.37 6.02 Intr - 239271 239122 150 1 0 45 65 126 0.973 6.13 6.01 Intr - 239622 239522 101 2 2 66 75 93 0.972 5.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:47420910_47670245|GENSCAN_predicted_peptide_1|583_aa XLNLVEAFVEDAELRQTLQEDLLRRFPDLNRLAKKFQRQAANLQDCYRLYQGINQLPNVI QALEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIETTLDMDQVNSVNILVENHEFLVK PSFDPNLSELREIMNDLEKKMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRVTCKEEK VLRNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKNKTEYEEAQDAIVKEIVNISSGYVEP MQTLNDVLAQLDAVVSFAHVSNGAPVPYVRPAILEKGQGRIILKASRHACVEVQDEIAFI PNDVYFEKDKQMFHIITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESAEVSIVDCILAR VGAGDSQLKGVSTFMAEMLETASILRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYIA TKIGAFCMFATHFHELTALANQIPTVNNLHVTALTTEETLTMLYQVKKGVCDQSFGIHVA ELANFPKHVIECAKQKALELEEFQYIGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKV KQMPFTEMSEENITIKLKQLKAEVIAKNNSFVNEIISRIKVTT >gi568815596r:47420910_47670245|GENSCAN_predicted_CDS_1|1752_bp nnattgaatttagtggaagcttttgtagaagatgcagaattgaggcagactttacaagaa gatttacttcgtcgattcccagatcttaaccgacttgccaagaagtttcaaagacaagca gcaaacttacaagattgttaccgactctatcagggtataaatcaactacctaatgttata caggctctggaaaaacatgaaggaaaacaccagaaattattgttggcagtttttgtgact cctcttactgatcttcgttctgacttctccaagtttcaggaaatgatagaaacaacttta gatatggatcaggtaaacagtgttaacatcctggtggaaaaccatgaattccttgtaaaa ccttcatttgatcctaatctcagtgaattaagagaaataatgaatgacttggaaaagaag atgcagtcaacattaataagtgcagccagagatcttggcttggaccctggcaaacagatt aaactggattccagtgcacagtttggatattactttcgtgtaacctgtaaggaagaaaaa gtccttcgtaacaataaaaactttagtactgtagatatccagaagaatggtgttaaattt accaacagcaaattgacttctttaaatgaagagtataccaaaaataaaacagaatatgaa gaagcccaggatgccattgttaaagaaattgtcaatatttcttcaggctatgtagaacca atgcagacactcaatgatgtgttagctcagctagatgctgttgtcagctttgctcacgtg tcaaatggagcacctgttccatatgtacgaccagccattttggagaaaggacaaggaaga attatattaaaagcatccaggcatgcttgtgttgaagttcaagatgaaattgcatttatt cctaatgacgtatactttgaaaaagataaacagatgttccacatcattactggccccaat atgggaggtaaatcaacatatattcgacaaactggggtgatagtactcatggcccaaatt gggtgttttgtgccatgtgagtcagcagaagtgtccattgtggactgcatcttagcccga gtaggggctggtgacagtcaattgaaaggagtctccacgttcatggctgaaatgttggaa actgcttctatcctcaggtctgcaaccaaagattcattaataatcatagatgaattggga agaggaacttctacctacgatggatttgggttagcatgggctatatcagaatacattgca acaaagattggtgctttttgcatgtttgcaacccattttcatgaacttactgccttggcc aatcagataccaactgttaataatctacatgtcacagcactcaccactgaagagacctta actatgctttatcaggtgaagaaaggtgtctgtgatcaaagttttgggattcatgttgca gagcttgctaatttccctaagcatgtaatagagtgtgctaaacagaaagccctggaactt gaggagtttcagtatattggagaatcgcaaggatatgatatcatggaaccagcagcaaag aagtgctatctggaaagagagcaaggtgaaaaaattattcaggagttcctgtccaaggtg aaacaaatgccctttactgaaatgtcagaagaaaacatcacaataaagttaaaacagcta aaagctgaagtaatagcaaagaataatagctttgtaaatgaaatcatttcacgaataaaa gttactacgtga >gi568815596r:47420910_47670245|GENSCAN_predicted_peptide_2|87_aa MQIREMQVKTTMSYNFTLITIVKIKKSDGHLMAQDDCQGFNQCIHILGSKNDESAKMLIP PLKKQKYTASTCIFHGRTESHIYAVGA >gi568815596r:47420910_47670245|GENSCAN_predicted_CDS_2|264_bp atgcagatcagggaaatgcaagtcaaaaccacaatgagctacaacttcacactgattacg atagttaaaatcaaaaagtcagatggtcacctcatggcccaagatgactgccagggcttc aaccaatgcatccacattttaggcagcaagaacgatgagagtgcaaagatgcttatccca cctttaaagaaacaaaagtacacagcttccacttgcatcttccatgggagaactgagtca cacatctatgctgttggagcctag >gi568815596r:47420910_47670245|GENSCAN_predicted_peptide_3|172_aa MERMAEREGGASLTDGITVTALAPDHLLQSSGYEPQSDRAKRPVPHFADMEKETSWGKEI CLQSARETEPGQCSEKAASQDSGFSVATVIWLCMEKTNPLDKQYLERDSIFPDPKHIMAS GAKYPDICKMASFGFLVLQALVPKDAAGRGALDRPPLEVAALCSVLNGLTLS >gi568815596r:47420910_47670245|GENSCAN_predicted_CDS_3|519_bp atggaaaggatggcagagcgggaaggaggagccagccttaccgatggcatcactgtcact gcgctagccccagaccacctgctccagagttctggttatgaacctcagagtgacagagcc aaaagaccagtgcctcattttgctgacatggaaaaggaaacttcgtgggggaaagagatc tgcttgcagtcggccagagagacagaaccagggcagtgctctgagaaggctgctagccaa gactctggattctctgtggccacagtcatatggctgtgcatggagaagacaaatcctctt gataaacaatatttagaaagggattctatctttcctgaccccaaacacatcatggcctct ggagccaaataccctgacatttgcaagatggcttcttttgggttcctggtgctgcaggcc ctggttcccaaggacgcagctggcagaggtgccctggaccgcccaccattagaagtagct gccctgtgctctgtgctaaatggactaactctgagctga >gi568815596r:47420910_47670245|GENSCAN_predicted_peptide_4|811_aa MEAGKSKIKALADLVADEQERVSSLAQSHTDNHRLHEPGLQEGIRAVPREDPQWNYQADS PRGPLDHHRRRASGNSQWRQAKLIALTRALTLAKGLRINIYTDSKYAFRILHHHAVIWAE RGFLPTQGSSIINATLIKTLLKAALLPKEAGVIHCKGHQKASDPITQGNAYADKPIGFGL EKLLTFHLSQLQEYRGTKWREKSHRKVNHDENTRGLAAEPRARERRVRSRGSRRGVGARR GAPLPGHPLGHGTRVWPERRGGGGGAPLSGLWEGDEGLCEGGEGLRGGPGPLATILTLLL PPGGCRGPGPERRPGPGAELRTMSSRSPRPPPRRSRRRLPRPSCCCCCCRRSHLNEDTGR FVLLAALIGLYLVAGATVFSALESPGEAEARARWGATLRNFSAAHGVAEPELRAFLRHYE AALAAGVRADALRPRWDFPGAFYFVGTVVSTIVREESPPLALTPGRLCSNTGRLCDLTFK SYINIAKEQEHPAIQQSFPRVSTVSSENRKEGFGMTTPATVGGKAFLIAYGLFGCAGTIL FFNLFLERIISLLAFIMRACRERQLRRSGLLPATFRRGSALSEADSLAGWKPSVYHVLLI LGLFAVLLSCCASAMYTSVEGWDYVDSLYFCFVTFSTIGFGDLVSSQHAAYRNQGLYRLG NFLFILLGVCCIYSLFNVISILIKQVLNWMLRKLSCRCCARCCPAPGAPLARRNAITPGS RLRRRLAALGADPAARDSDAEGRRLSGELISMRDLTASNKVSLALLQKQLSETANGYPRS VCVNTRQNGFSGGVGALGIMNNRLAETSASR >gi568815596r:47420910_47670245|GENSCAN_predicted_CDS_4|2436_bp atggaggctgggaagtccaagatcaaggcactggcagatttggtagctgatgaacaggaa agagtttcttctctagcccaatctcacactgacaaccaccggcttcatgagccaggcctc caggaaggcattagagcagttccccgagaagatccccaatggaactatcaggcagattcc cccagaggccccctggaccatcatagacgccgagcttcaggtaactcacagtggaggcaa gccaaactcattgccttaactcgggccctcactcttgcaaagggactacgcatcaatatt tatactgactctaaatatgccttccgtatcctgcaccaccatgctgttatatgggctgaa agaggtttcctccctacgcaagggtcctccatcattaatgccactttaataaaaactctt ctcaaggccgctttacttccaaaggaagctggagtcattcactgcaagggccatcaaaag gcatcagatcccatcactcagggcaacgcttatgctgataagcctattggttttggattg gaaaagttattgacatttcatctctcccaattgcaagaatatagaggaaccaagtggagg gaaaaatcccacaggaaagtcaaccatgatgagaacacaaggggcctagctgcggagccc cgcgcccgagagcggcgggtaaggagccgcgggagccggcgaggcgtcggggcgcgcaga ggagcgcccctgcccgggcacccgctgggccacgggactcgcgtgtggcctgagcgccgg ggaggaggcggaggcgcccctctgtccgggctctgggaaggcgacgaggggctctgcgaa ggcggcgaggggctccgcggcggccccggacccctggccaccatcctcacgctcctgctc ccgccggggggatgtcgtggcccgggccccgagcgccgccccggccccggggctgagctc cggaccatgtcctcccgcagcccccggcccccgccccgccgtagccgccgccgcctgccg cgcccctcctgctgctgctgctgctgccgccgttcgcacctcaacgaggacaccggccgc ttcgtgctgctggcggcgctcatcggcctctacctggtggcgggtgccacagtcttctcg gcgctcgagagccccggcgaggcggaggcgcgggcgcgctggggcgccacgctgcgcaac ttcagcgctgcgcacggcgtggccgagccagagctgcgcgccttcctccggcactacgag gccgcgctggccgccggcgtccgcgccgacgcgctgcgcccgcgctgggacttccccggc gccttctacttcgtgggcaccgtggtgtcaaccatagtgagggaagaaagcccacctctg gcgctcaccccgggccgcctgtgctccaacactggccggctctgtgatctgacctttaag agttacatcaatattgccaaagaacaggagcacccagcaatacagcagagcttcccacgg gtttctacagtgtcttcagagaaccgcaaggagggtttcggcatgaccacccccgcgacg gtgggcgggaaggccttcctcatcgcctacgggctgttcggctgcgctgggaccatcctg ttcttcaacctcttcctggagcgcatcatctcgctgctggccttcatcatgcgcgcctgc cgggagcgccagctgcgccgcagcggcctgctgcccgccaccttccgccgcggctccgcg ctctcggaggccgacagcctggcgggctggaagccctcggtgtaccacgtgctgctcatc ctgggcctgttcgccgtgctgctgtcctgctgcgcctcggccatgtacaccagcgtggag ggctgggactacgtggactcgctctacttctgcttcgtcaccttcagcaccatcggcttc ggggacctggtgagcagccagcacgccgcctaccggaaccaggggctctaccgcctgggc aacttcctcttcatcctgctcggcgtgtgctgcatttactcgctcttcaacgtcatctcc atcctcatcaagcaggtgctcaactggatgctgcgcaagctgagctgccgctgctgcgcg cgctgctgcccggctcctggcgcgcccctggcccggcgcaatgccatcaccccaggctcc cggctgcgccgccgcctggccgcgctcggtgccgaccccgcggcccgcgacagcgacgcc gagggccgccgcctctcgggcgagctcatctccatgcgcgacctcacggcctccaacaag gtgtcgctggcgctgctgcagaagcagctgtcggagacggccaacggctacccgcgcagc gtgtgcgtcaacacgcgccagaacggcttctcgggcggcgtgggcgcgctgggcatcatg aacaaccggctggccgagaccagcgcctccaggtag >gi568815596r:47420910_47670245|GENSCAN_predicted_peptide_5|45_aa MMKRDFQGAVGGMEEKTSYGPGQRQGGLNRALSKPFFLWGRVPVT >gi568815596r:47420910_47670245|GENSCAN_predicted_CDS_5|138_bp atgatgaagagggactttcaaggtgcagttggagggatggaagagaagacaagttatggt ccaggacaaagacaaggaggcctgaacagggctctgtcgaagcccttcttcctctggggc agggtgccggtcacttga >gi568815596r:47420910_47670245|GENSCAN_predicted_peptide_6|149_aa XPVTKKVKKNRLYCDCFSLWEDCLANWEVKPDEKGHNGKVRATGKKVPGVDKNKETVIPA KRKDRPFQRETEQKAMMYDLQATCAVHRVASPMGYGQGLKAKATNMCGSPQGTLLILSDY GCAFTVWGEDVPLEALMVKFQDYHNLALL >gi568815596r:47420910_47670245|GENSCAN_predicted_CDS_6|450_bp nngccagttaccaaaaaggtaaagaagaaccgcctgtactgtgactgcttctccttatgg gaagactgtttagctaactgggaagtcaaacctgatgaaaaaggccacaacggcaaagtt agagccacaggaaaaaaagttccaggagttgacaaaaataaggagacagttatcccagcc aagcgaaaagatagaccttttcaaagggaaacagaacagaaggcaatgatgtatgacctg caagccacgtgtgccgtacaccgagttgccagccccatgggatatggccagggcctcaag gcaaaggctaccaacatgtgtgggtctcctcagggcacactccttattttgtctgactat gggtgtgcatttacagtctggggtgaagatgtgcctctagaagcactaatggtgaagttt caggactatcacaacctggccttgctctag