GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:28:09 Sequence gi568815596f:75546808_75755282 : 208475 bp : 40.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 288 283 6 1.05 1.12 Term - 2050 1952 99 1 0 33 49 109 0.514 -1.35 1.11 Intr - 2961 2796 166 2 1 -1 79 96 0.126 -1.16 1.10 Intr - 6814 6609 206 1 2 39 98 148 0.579 7.98 1.09 Intr - 7370 7206 165 2 0 92 53 84 0.732 4.54 1.08 Intr - 7625 7505 121 1 1 127 81 -25 0.359 0.28 1.07 Intr - 13491 13232 260 0 2 38 74 120 0.413 0.94 1.06 Intr - 13910 13726 185 0 2 56 41 151 0.709 5.79 1.05 Intr - 14112 14007 106 0 1 74 69 72 0.665 2.77 1.04 Intr - 19658 19583 76 1 1 62 98 42 0.235 1.20 1.03 Intr - 23192 23065 128 2 2 24 75 84 0.346 -0.74 1.02 Intr - 24343 24137 207 1 0 101 84 140 0.705 13.25 1.01 Init - 33207 33085 123 0 0 94 59 43 0.307 2.22 1.00 Prom - 44288 44249 40 -3.75 2.09 PlyA - 45682 45677 6 1.05 2.08 Term - 51015 50751 265 2 1 67 39 266 0.787 13.60 2.07 Intr - 51817 51556 262 2 1 64 -3 216 0.356 5.72 2.06 Intr - 52271 52129 143 1 2 4 72 64 0.244 -4.52 2.05 Intr - 53942 53602 341 2 2 116 62 205 0.538 14.05 2.04 Intr - 62041 61915 127 0 1 143 89 6 0.004 5.96 2.03 Intr - 62574 62393 182 0 2 22 2 217 0.001 4.24 2.02 Intr - 68872 68685 188 2 2 56 44 98 0.001 0.69 2.01 Init - 74757 74685 73 0 1 100 82 71 0.182 8.98 2.00 Prom - 82419 82380 40 -6.55 3.00 Prom + 86335 86374 40 -5.45 3.01 Init + 100001 100103 103 1 1 91 92 96 0.699 9.70 3.02 Intr + 100295 100412 118 0 1 121 94 358 0.977 38.30 3.03 Intr + 105335 105453 119 2 2 109 57 54 0.998 3.59 3.04 Intr + 105716 105850 135 0 0 70 115 153 0.949 15.82 3.05 Intr + 107929 108110 182 2 2 72 64 226 0.945 17.27 3.06 Term + 108257 108478 222 1 0 83 37 197 0.986 10.03 3.07 PlyA + 108620 108625 6 1.05 4.00 Prom + 133242 133281 40 -5.45 4.01 Init + 135676 135903 228 0 0 88 57 211 0.436 16.42 4.02 Term + 136609 136752 144 0 0 46 39 195 0.458 7.33 4.03 PlyA + 136849 136854 6 1.05 5.14 PlyA - 137960 137955 6 1.05 5.13 Term - 141170 141006 165 2 0 29 37 145 0.667 0.43 5.12 Intr - 142418 142219 200 0 2 80 63 105 0.907 5.35 5.11 Intr - 143274 143162 113 2 2 26 86 98 0.971 2.50 5.10 Intr - 143912 143831 82 0 1 49 77 106 0.948 3.38 5.09 Intr - 145293 145170 124 0 1 55 37 79 0.855 -1.26 5.08 Intr - 147620 147434 187 1 1 66 83 114 0.979 7.47 5.07 Intr - 149508 149393 116 0 2 90 71 69 0.975 3.73 5.06 Intr - 154480 154383 98 2 2 59 74 95 0.975 4.01 5.05 Intr - 155616 155473 144 1 0 87 44 166 0.416 11.43 5.04 Intr - 159844 159716 129 2 0 31 92 121 0.801 6.55 5.03 Intr - 164037 163453 585 1 0 30 -40 494 0.803 22.49 5.02 Intr - 165235 164960 276 2 0 69 68 307 0.974 23.37 5.01 Init - 166028 165926 103 2 1 60 68 120 0.649 5.66 5.00 Prom - 185402 185363 40 -4.75 6.03 PlyA - 185546 185541 6 1.05 6.02 Term - 187117 186924 194 2 2 21 40 166 0.107 1.70 6.01 Init - 192321 192249 73 0 1 77 87 49 0.416 5.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 62388 62534 147 1 0 55 109 154 0.917 13.49 S.002 Init - 90814 90752 63 1 0 54 66 79 0.868 3.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:75546808_75755282|GENSCAN_predicted_peptide_1|613_aa MSTARLLPTFTQGPRALQSACGKCCQPGSLPSEQEAPLWLRDLAATQLMTCNFYLPPPPN HAFSFLTGWAKIKEIKIIHEKKWVSAAAANTLRQQPKSAAQVSSSNKLGHVECANLRQRD EQEPIDLTGLGYGRRNRVQSSTVKMCHRCPGLRGLSSMKSSFERLTCAPGILSYSFVQLL RGLAPPSVSTQAGTPHPQASPDPGGPSAGIPFRAQSPEVRAGAPAKVRGARERSLNSGGA RHPQRAPASPKRPLFGNTQSTSDLKQLLCSQVPAQPARGGPARNVRAAPWTIYLVCFGGS QLAARIWRPPGPRKPSASGSERLLAPPKVPVGRFAFGEGRADSPRCSIWEVCVGVPTPTI EGLASEAVPGGSEMLFVLGFKNSLPPLAPSLITKAACTLPSMMGLVVAAGSRLQGILFLQ GDFDQKGCLKLSFSGHLHKHTHLVKTRQAIAAHGRRRRQLGGLGVSAQLKLPTLIDTFMC QAQFQALETGMNKILFVPKELSNPVWRQMRKSEIVMQCFAVHLSLQPHGKWPWGIFLSPH KAAAQLNYLFDLEPSGIRKSKQYIIQESHFQLEGSLEDSLKLLRDGHSRSKYTDKIHDDY KLVTAILDKNRVL >gi568815596f:75546808_75755282|GENSCAN_predicted_CDS_1|1842_bp atgagtactgccagactactgccaacgttcactcaaggcccaagggctcttcagtcagct tgtggtaaatgctgccagcctgggtctctcccctcagagcaggaggctcctctgtggctc agggatttagctgccactcagctcatgacctgtaacttctatctcccgccccctcccaac cacgctttcagttttctcacaggctgggcaaaaattaaagagattaaaatcatccatgag aaaaagtgggttagtgcagctgctgcaaataccctgcgtcagcagcccaagtcagcagct caagtcagcagctcaaataaacttggccatgtggagtgtgcaaacctgagacagagagat gaacaagaacctattgacctcacaggactcggatatgggagacgtaacagggtgcagagc tctacagtgaaaatgtgtcacagatgcccaggactcagaggtcttagttccatgaaaagt tcttttgaacgactgacgtgtgctcctggcatcctctcatattcctttgtacagcttctg cggggcttggcgccgccgtccgtgagcacccaggccgggaccccacatccccaggccagc cctgaccccggcggcccctccgccgggattcctttccgagcgcagtccccggaagtgcgc gccggggccccagcaaaggtaagaggtgcccgagagcgctccctgaactctgggggcgcc cggcatccccagcgggctccggcttctccgaagcgccctcttttcggaaacacccagagc acttctgacctgaagcagctgctttgttcccaagtccctgcccagcctgcgcggggcgga cctgccagaaatgtacgtgctgcaccgtggactatttacctggtttgtttcggaggcagc caactcgccgcgcgcatttggcgacccccaggaccccgaaagccgagtgcatcagggagt gaaaggctgttggcgccccctaaagtcccagtgggacgctttgcttttggagaggggcgc gcagacagtccacgctgctctatctgggaggtctgcgtgggggtccccacccctactatt gagggcctggcatctgaggctgtgcctgggggcagtgagatgctctttgttctggggttc aaaaactctcttcctcctcttgccccctctttgatcaccaaggcagcctgcactttgccc tccatgatggggctggtggtagctgcagggagcagacttcaggggatcttgtttctgcaa ggggactttgatcaaaagggctgcctaaagctgtctttctctgggcacctccacaaacac actcatcttgtcaagacaagacaggccatagcagcccatgggaggagacggagacagctt ggtggccttggtgtgtctgctcagctcaagctgccaacacttattgacaccttcatgtgc caggcacagttccaggctctggaaacagggatgaacaagatactctttgtgcccaaggag ctcagcaatccagtgtggagacagatgaggaaatcagagatagtaatgcaatgttttgct gtccatctctcattacagccacacgggaaatggccttggggtatttttctttccccacac aaagcagcagctcagctgaattatttgtttgatcttgagccttcaggtatcaggaaaagt aaacagtacatcattcaagagtcacatttccagcttgaaggttccctggaggacagcttg aagttattacgggatgggcattcaagaagtaaatacacagataaaatacatgatgattat aaactggtaactgctatcctggacaagaacagggtgctctag >gi568815596f:75546808_75755282|GENSCAN_predicted_peptide_2|526_aa MKNLGSLGGALDFTWHHVEAEPLESNSPEQGQEEETCGVDSRLDVHICPSCEDHYVSTLF FAAQNICLILTATPGMGAFMPFLMMRPQVVIRKPASPRDSPVLEGPGVCENRDALSAAAR RRHTVAPLDGPTVFHKAAFVRIPAQPPRLAPLMEARLVFPLVYKPKKLALFLPGKGTTDL IFKSFASTPLQALVRPAATSQSQQGYYKMLGPAASLNSALQYSIDKANKASLFVLRMSSE MSWQFCSNTEELHFSASLNYSCMSIQVLSTEYTSSPSYPIPPTLRQSVKPAFKYQRKLQI LSRRLSGSRAHSLNPFIPLASCVDLFLKSPPLVTRFAYELLPMRKGKNDVEGEKPEAHRS EQGLWWIKNDSELAQVHRAALHGGRPVGVSVGLEDLGLIIGRQGWGWLLLRGTAALERWL VVGGQRVGILVGFAEGGRGPYGTSGFHFLLWESGLSEPKCHTARKLQQPQRHMWEGVRVP GQQPLPTVSILELTRKWILQLHSRLPCWKYMEQEQPSAQSSPQNAD >gi568815596f:75546808_75755282|GENSCAN_predicted_CDS_2|1581_bp atgaagaatcttgggagtcttggaggggcactggacttcacatggcatcatgtagaagca gaacctttggaatcaaatagcccagagcaggggcaggaagaagaaacctgtggtgtggac tccaggcttgatgttcacatctgcccttcctgtgaggaccactatgtctctacactgttc tttgctgcacaaaacatttgtttaatccttacagctactcctggaatgggcgcatttatg ccctttttaatgatgaggccacaagtagtgatcagaaaaccagcatctcctcgagactca cctgtcttagaaggtccaggtgtgtgtgagaaccgggatgcgctgtctgccgctgcccgg cgccgccacacggtggcgccgttggacggtcccacagtcttccacaaagcggcttttgtc cgtatccccgcacagccgccgaggttagcacccctcatggaagccaggctggttttccct ctggtctacaagccaaagaagctggcgctattcttaccaggaaaggggactacagattta atttttaaatcttttgcaagcacccctcttcaagcactggtgagacctgcagccacctct cagagtcagcagggctattacaagatgttggggccagccgcctccttaaactcagcacta cagtacagcattgataaagcaaacaaggcaagcttgtttgtacttcgtatgtcttcagaa atgagctggcagttttgtagtaacactgaggaattacactttagtgcatcactgaactat tcctgcatgtccatacaagtgttgtccacagagtacacatcttccccgtcctatcccata cccccaaccctaagacagtctgtcaagccagctttcaagtatcagagaaagctgcagatt ctatctagaagattgtctggatctagagcccactcattgaaccctttcattccacttgcg agctgtgttgatctctttttgaaatctcctccactggttacacgttttgcttatgagtta ttgcccatgagaaagggcaagaatgatgtggaaggtgagaagcctgaggcccacaggtca gagcaaggattatggtggattaaaaatgacagcgaattggcccaggtacacagggctgcc ctacatggtgggcgtcctgtaggcgtctcagttggcttggaggacctggggcttataatt gggcgccagggttgggggtggctgcttctgcggggcaccgcagccttggagaggtggctg gtagttggagggcagcgagtaggcattctggtgggctttgcggaaggagggcgggggcct tacgggaccagcggcttccacttcctgctctgggaatctggactctcggaacccaaatgc catactgcgaggaagcttcagcagccccagagacacatgtgggaaggagttagggtccct ggtcagcagcctctgccgacagtcagcattttggagctaactaggaagtggatcctccag ctccattcaagacttccttgttggaaatacatggaacaagaacagccgtctgctcagagt tctccccaaaatgcagattaa >gi568815596f:75546808_75755282|GENSCAN_predicted_peptide_3|292_aa MAACIAAGHWAAMGLGRSFQAARTLLPPPASIACRVHAGPVRQQSTGPSEPGAFQPPPKP VIVDKHRPVEPERRFLSPEFIPRRGRTDPLKFQIERKDMLERRKVLHIPEFYVGSILRVT TADPYASGKISQFLGICIQRSGRGLGATFILRNVIEGQGVEICFELYNPRVQEIQVVKLE KRLDDSLLYLRDALPEYSTFDVNMKPVVQEPNQKVPVNELKVKMKPKPWSKRWERPNFNI KGIRFDLCLTEQQMKEAQKWNQPWLEFDMMREYDTSKIEAAIWKEIEASKRS >gi568815596f:75546808_75755282|GENSCAN_predicted_CDS_3|879_bp atggcggcctgcattgcagcggggcactgggctgcaatgggcctaggccggagtttccaa gccgccaggactctgctccccccgccggcctctatcgcctgcagggtccacgcggggcct gtccggcagcagagcactgggccttccgagcccggtgcgttccaaccgccgccgaaaccg gtcatcgtggacaagcaccgccccgtggaaccggaacgcaggttcttgagtcctgaattc attcctcgaaggggaagaacagatcctctgaaatttcaaatagaaagaaaagatatgtta gaaaggagaaaagtactccacattccagagttctatgttggaagtattcttcgtgttact acagctgacccatatgccagtggaaaaatcagccagtttctggggatttgcattcagaga tcaggaagaggacttggagctactttcatccttaggaatgttatcgaaggacaaggtgtc gagatttgctttgaactttataatcctcgggtccaggagattcaggtggtcaaattagag aaacggctggatgatagcttgctatacttacgagatgcccttcctgaatatagcactttt gatgtgaatatgaagccagtagtacaagagcctaaccaaaaagttcctgttaatgagctg aaagtaaaaatgaagcctaagccctggtctaaacgctgggaacgtccaaattttaatatt aaaggaatcagatttgatctttgtttaactgaacagcaaatgaaagaagctcagaagtgg aatcagccatggcttgaatttgatatgatgagggaatatgatacttcaaaaattgaagct gcaatatggaaggaaattgaagcgtcgaaaaggtcttga >gi568815596f:75546808_75755282|GENSCAN_predicted_peptide_4|123_aa MGRNQCKKAENSKNQNARSSKNHNSSPAREQNRMENEFDEVTEVGFRRWVITNSSELKEH VLTQWKEAKNLEKRLEACLTRAPEGSAKHGKVQLVPDAAKTYRIVKTINAMKKLRQLMSK MTS >gi568815596f:75546808_75755282|GENSCAN_predicted_CDS_4|372_bp atggggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaatgcccgttcttca aagaatcacaactcatcaccagcaagggaacaaaaccggatggagaatgagtttgacgaa gtgacagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaggagcat gttctaacccaatggaaggaagctaagaaccttgaaaaaaggttagaggcctgccttaca agagctcctgaaggaagcgctaaacatggaaaggtacaactggtaccagacgctgcaaaa acataccgaattgtaaagaccatcaacgctatgaagaaactgcgtcaactaatgagcaaa atgaccagctag >gi568815596f:75546808_75755282|GENSCAN_predicted_peptide_5|773_aa MKLRTLAVSVTALKVARLGSAPSDVQMCLEFLPSDSGAQLASPSGSRTGAAGGAACQSGA VRSYSSALGWSMGLGAVEQGVVLVGDARAAQEPMEWMGGSGMAGCRSRALHRGKAAKARR EIEHSAGRKGLFGSARLIPATAMAPRSRLLSLGRRGNFRSRVLRRKSRPLEEAARRWRDC PTGFGALVAGAGSGRAPGVPPKRLPARTKAQVSRETRGSGEVTAQGGKGPRGLWILLWGF YNLLPTTSHAESYPERLIQKVVSEKIARALARYLHLPFGLLGGNKTIVYSWEALRFHHQL TPSVITNTVIKVYEEPKLSQQKSRTLDVSTDEEDKIHHSSESKDDQGLSSDSSSSLGEKE LSSTVKIPDAAFIQAARRKRELARAQDDYISLDVQHTSSISGMKRESEDDPEISRNEETS EESQEDEKQDTWEQQQMRKAVKIIEERDIDLSCGNGSSKVKKFDTSISFPPVNLEIIKKQ LNTRLTLLQETHRSHLREYEKYVQDVKSSKSTIQNLESSSNQALNCKFYKSMKIYVENLI DCLNEKIINIQEIESSMHALLLKQAMTFMKRRQDELKHESTYLQQLSRKDETSTSGNFSV DEKTQWILEEIESRRTKRRQARVLSGNCNHQEGTSSDDELPSAEMIDFQKSQGDILQKQK KVFEEVQDDFCNIQNILLKFQQWREKFPDSYYEAFISLCIPKLLNPLIRVQLIDWNPLKL ESTGLKEMPWFKSVEEFMDSSVEDSKKESSSDKKVLSAIINKTIIPRLTGTAS >gi568815596f:75546808_75755282|GENSCAN_predicted_CDS_5|2322_bp atgaagctgcggaccctcgcggtgagcgttacagctcttaaggtggcgcgtctggggtct gccccttctgatgttcagatgtgtttggagtttcttccttctgactcaggagcccaactg gcttcacctagtggatcccgcaccggggctgcaggtggagctgcctgccagtccggcgcc gtgcgctcgtactcctcagcccttgggtggtcgatgggactgggcgccgtggagcagggg gtggtgctcgtcggggatgctcgggcagcacaggagcccatggagtggatgggaggctca ggcatggcgggctgcaggtcccgagccctgcaccgcgggaaggcagctaaggcccggcga gaaatcgagcacagcgccggccgaaaaggacttttcggcagcgcgcggctgattccagcg acagcgatggcgccgaggagtcgcctgctgagcctggggcgccgagggaacttccggtcc cgggttctgcggaggaagagccgccctctggaggaggccgcgcgcaggtggcgggactgc cccaccgggttcggggccctcgtggccggggccgggtctgggcgagctcccggcgtgcca ccaaagcggctccccgcgcggacgaaggctcaggtgtccagggagacgcggggaagcggg gaggtgacggcacagggcgggaaggggccgcggggtctctggatccttctctgggggttc tataacttgctgccgaccacaagccatgctgagtcttatccagaaagacttatccagaag gtagtgagcgagaaaatagctcgagcacttgcccgctatctgcatctgccttttggactc cttgggggaaataaaacaatcgtgtacagttgggaggctcttcgttttcaccatcagttg actccatcagttataactaacacagttatcaaagtttatgaagaacctaaactgtcacaa caaaaatccagaacccttgatgtgtccacagatgaagaggataaaatacatcactcctca gaaagtaaggatgatcagggtttgtcttctgacagttctagctctcttggagaaaaagaa ctttcatcaacagttaagatcccagatgcagcttttattcaggcagcccgcagaaaacgt gaattggccagggcccaagatgactatatttctttggatgtacaacatacctcctccatc tctggtatgaagagagagagcgaagatgaccctgagataagcagaaatgaagaaacaagt gaagaaagtcaggaagatgaaaagcaagatacttgggaacaacagcaaatgaggaaagca gttaaaatcatagaggaaagagacatagatctttcctgtggcaatggatcttcaaaagtg aagaaatttgatacttccatttcatttccgccagtaaatttagaaattataaagaagcaa ttaaatactagattaacattactacaggaaactcaccgctcacacctgagggagtatgaa aaatacgtacaagatgtcaaaagctcaaagagtaccatccagaacctagagagttcatca aatcaagctctaaattgtaaattctataaaagcatgaaaatttatgtggaaaatttaatt gactgccttaatgaaaagattatcaacatccaagaaatagaatcatccatgcatgcactc cttttaaaacaagctatgacctttatgaaacgcaggcaagatgaattaaaacatgaatca acgtatttacaacagttatcacgcaaagatgagacatccacaagtggaaacttctcagta gatgaaaaaactcagtggattttagaagagattgaatctcgaaggacaaaaagaagacaa gcaagggtgctttctgggaattgtaaccatcaggaaggaacatctagtgatgatgaactg ccttcagcagagatgattgacttccaaaaaagccaaggtgacattttacagaaacagaag aaagtttttgaagaagtgcaagatgatttttgtaacatccagaatattttgttgaaattt cagcaatggcgagaaaagtttcctgactcctattatgaagctttcattagtttatgcata ccaaagcttttaaatcccctaatacgagttcagttgattgattggaatcctcttaagttg gaatccacaggtttaaaagagatgccatggttcaaatctgtagaagaatttatggatagc agtgtggaagattcaaagaaggaaagtagttcagataaaaaagtcttgtctgcaatcatc aacaaaacaattattccccgacttacaggtactgcttcgtaa >gi568815596f:75546808_75755282|GENSCAN_predicted_peptide_6|88_aa MQALAVPPTREHVCWRWLFYRSDAGSNVQLGPWLQRVEAPSLGIFHMPGCPGKSLLQGQG LMENLCYGNAEGKCGVGAPTQSPYWGTA >gi568815596f:75546808_75755282|GENSCAN_predicted_CDS_6|267_bp atgcaggctctggctgttccaccaacccgggagcatgtctgctggaggtggctgttttac aggtcagatgcagggtccaatgtacagcttgggccatggctccagagggtggaagcccca agccttggcatcttccacatgcctggatgcccaggcaaaagtttgctgcaggggcagggc ctcatggagaacctctgctacggcaatgcagaagggaaatgtggggtgggagcccccaca cagagtccctactggggcactgcctag