GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:52:38 Sequence gi568815597r:186343982_186546482 : 202501 bp : 36.10% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.25 Intr - 597 394 204 0 0 45 62 225 0.960 14.07 1.24 Intr - 1715 1599 117 2 0 68 108 163 0.999 16.04 1.23 Intr - 2306 2154 153 2 0 16 99 228 0.999 16.05 1.22 Intr - 3477 3311 167 1 2 103 91 205 0.995 21.06 1.21 Intr - 6407 6242 166 2 1 40 94 52 0.890 -0.39 1.20 Intr - 7489 7349 141 1 0 75 76 67 0.908 3.83 1.19 Intr - 8129 7995 135 2 0 77 91 66 0.980 5.64 1.18 Intr - 9869 9707 163 1 1 53 81 190 0.998 13.86 1.17 Intr - 11571 11429 143 0 2 63 127 107 0.882 10.33 1.16 Intr - 11787 11654 134 1 2 -2 103 146 0.971 6.54 1.15 Intr - 12468 12305 164 2 2 64 78 173 0.999 12.60 1.14 Intr - 13642 13416 227 1 2 65 108 201 0.995 15.66 1.13 Intr - 14669 14562 108 2 0 11 66 133 0.855 2.96 1.12 Intr - 16015 15818 198 1 0 71 81 299 0.995 26.13 1.11 Intr - 16383 16292 92 1 2 23 94 103 0.679 3.19 1.10 Intr - 16924 16784 141 2 0 37 86 74 0.674 1.50 1.09 Intr - 17728 17641 88 1 1 21 115 118 0.999 6.42 1.08 Intr - 17888 17808 81 2 0 59 94 56 0.822 2.22 1.07 Intr - 18399 18307 93 0 0 111 61 55 0.953 4.34 1.06 Intr - 19020 18856 165 0 0 44 107 118 0.972 8.54 1.05 Intr - 27062 26989 74 0 2 37 110 105 0.998 5.81 1.04 Intr - 29482 29378 105 2 0 72 89 74 0.862 5.17 1.03 Intr - 31029 30897 133 0 1 60 115 193 0.507 18.50 1.02 Intr - 31749 31578 172 2 1 132 103 260 0.702 30.92 1.01 Init - 32598 32588 11 0 2 72 69 2 0.790 -3.39 1.00 Prom - 33365 33326 40 -2.25 2.00 Prom + 34770 34809 40 -9.15 2.01 Init + 35805 35903 99 2 0 58 91 53 0.854 2.91 2.02 Intr + 39041 39175 135 1 0 25 28 148 0.590 2.34 2.03 Intr + 42007 42102 96 0 0 73 89 29 0.574 0.79 2.04 Intr + 46730 46870 141 1 0 68 76 36 0.470 0.03 2.05 Intr + 47715 47810 96 2 0 30 111 89 0.896 4.69 2.06 Intr + 54332 54460 129 1 0 11 87 118 0.830 3.97 2.07 Intr + 59163 59220 58 2 1 16 87 7 0.159 -9.06 2.08 Intr + 62102 62287 186 0 0 101 41 145 0.437 9.84 2.09 Term + 75028 75095 68 2 2 102 46 44 0.107 -1.28 2.10 PlyA + 75153 75158 6 1.05 3.02 PlyA - 76629 76624 6 1.05 3.01 Sngl - 83452 83045 408 1 0 9 49 310 0.926 15.24 3.00 Prom - 91079 91040 40 -4.75 4.06 PlyA - 91848 91843 6 1.05 4.05 Term - 93219 93152 68 1 2 124 41 96 0.955 5.62 4.04 Intr - 97750 97636 115 1 1 74 72 33 0.007 -0.60 4.03 Intr - 100525 100046 480 1 0 109 73 361 0.669 29.00 4.02 Intr - 102596 102445 152 0 2 60 116 61 0.419 5.06 4.01 Init - 112563 112557 7 0 1 64 85 0 0.128 -1.54 4.00 Prom - 114073 114034 40 -5.35 5.00 Prom + 130138 130177 40 -3.25 5.01 Init + 136260 136306 47 2 2 45 115 52 0.655 3.81 5.02 Term + 156563 156698 136 2 1 102 48 109 0.346 4.81 5.03 PlyA + 161461 161466 6 1.05 6.03 PlyA - 161676 161671 6 1.05 6.02 Term - 168055 167683 373 0 1 39 44 202 0.508 4.08 6.01 Init - 168663 168569 95 0 2 87 51 79 0.506 4.00 6.00 Prom - 170817 170778 40 -6.35 7.00 Prom + 173681 173720 40 -6.25 7.01 Init + 179011 179221 211 0 1 60 57 186 0.897 11.69 7.02 Intr + 179751 179910 160 1 1 29 75 131 0.294 4.02 7.03 Intr + 185556 185610 55 0 1 79 89 26 0.268 -0.14 7.04 Term + 197237 197518 282 1 0 64 46 170 0.581 4.84 7.05 PlyA + 198574 198579 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 94032 93981 52 0 1 73 86 22 0.823 1.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_1|1125_aa MIQKASAKATAVATPPPSGRHRERGGIPEPAVRPYRDRCPLVAQCPEAANEDDPWHPGAL TQVLERTELNKLPKSVQNKLEKFLADQQSEIDGLKGRHEKFKVESEQQYFEIEKRLSHSQ ERLVNETRECQSLRLELEKLNNQLKALTEKNKELEIAQDRNIAIQYREKRLEQEKELLHS QNTWLNTELKTKTDELLALGREKGNEILELKCNLENKKEEVSRLEEQMNGLKTSNEHLQK HVEDLLTKLKEAKEQQASMEEKFHNELNAHIKLSNLYKSAADDSEAKSNELTRAVEELHK LLKEAGEANKAIQDHLLEVEQSKDQMEKEMLEKIGRLEKELENANDLLSATKRKGAILSE EELAAMSPTAAAVAKIVKPGMKLTELYNAYVETQDQLLLEKLENKRINKYLDEIVKEVEA KAPILKRQREEYERAQKAVASLSVKLEQAMKEIQRLQEDTDKANKQSSVLERDNRRMEIQ VKDLSQQIRVLLMELEEARGNHVIRDEEVSSADISSSSEVISQHLVSYRNIEELQQQNQR LLVALRELGETREREEQETTSSKITELQLKLESALTELEQLRKSRQHQMQLVDSIVRQRD MYRILLSQTTGVAIPLHASSLDDVSLASTPKRPSTSQTVSTPAPVPVIESTEAIEAKAAL KQEIFENYKKEKAENEKIQNEQLEKLQEQVTDLRSQNTKISTQLDFASKRYEMLQDNVEG YRREITSLHERNQKLTATTQKQEQIINTMTQDLRGANEKLAVAEVRAENLKKEKEMLKLS EVRLSQQRESLLAEQRGQNLLLTNLQTIQGILERSETETKQRLSSQIEKLEHEISHLKKK LENEVEQRHTLTRNLDVQLLDTKRQLDTETNLHLNTKELLKNAQKEIATLKQHLSNMEVQ VASQSSQRTGKGQPSNKEDVDDLVSQLRQTEEQVNDLKERLKTSTSNVEQYQAMVTSLEE SLNKEKQVTEEVRKNIEVRLKESAEFQTQLEKKLMEVEKEKQELQDDKRRAIESMEQQLS ELKKTLSSVQNEVQEALQRASTALSNEQQARRDCQEQAKIAVEAQNKYERELMLHAADVE ALQAAKEQVSKMASVRQHLEETTQKAESQLLECKASWEERERMLK >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_1|3375_bp atgattcaaaaagcctcagccaaagcgactgcggtcgccaccccgcccccttctgggcgc catagagagcgtggcggtatcccggagcctgccgtccggccctaccgggaccgatgccct ctagtggctcagtgtccagaagccgccaacgaggacgatccatggcatcccggggctttg acgcaagtcctggagcgcacggagctgaacaagctgcccaagtctgtccagaacaaactt gaaaagttccttgctgatcagcaatccgagatcgatggcctgaaggggcggcatgagaaa tttaaggtggagagcgaacaacagtattttgaaatagaaaagaggttgtcccacagtcag gagagacttgtgaatgaaacccgagagtgtcaaagcttgcggcttgagctagagaaactc aacaatcaactgaaggcactaactgagaaaaacaaagaacttgaaattgctcaggatcgc aatattgccattcagtatcgagaaaaacgcttggagcaagaaaaggaattgctacatagt cagaatacatggctgaatacagagttgaaaaccaaaactgatgaacttctggctcttgga agagaaaaagggaatgagattctagagcttaaatgtaatcttgaaaataaaaaagaagag gtttctagactggaagaacaaatgaatggcttaaaaacatcaaatgaacatcttcaaaag catgtggaggatctgttgaccaaattaaaagaggccaaggaacaacaggccagtatggaa gagaaattccacaatgaattaaatgcccacataaaactttctaatttgtacaagagtgcc gctgatgactcagaagcaaagagcaatgaactaacccgggcagtagaggaactacacaaa cttttgaaagaagctggtgaagccaacaaagcaatacaagatcatcttctagaggtggag caatccaaagatcaaatggaaaaagaaatgcttgagaaaatagggagattggagaaggaa ttagagaatgcaaatgaccttctttctgccacaaaacgtaaaggagccatattgtctgaa gaagagcttgccgccatgtctcctactgcagcagctgtagctaagatagtgaaacctggg atgaaactaactgagctctataatgcttatgtggaaactcaggatcagttgcttttggag aaactagagaacaaaagaattaataagtacctagatgaaatagtgaaagaagtggaagcc aaagcaccaattttgaaacgccagcgtgaggaatatgaacgtgcacagaaagctgtagca agtttatctgttaagcttgaacaagctatgaaggagattcagcgattgcaggaggacact gataaagccaacaagcaatcatctgtacttgagagagataatcgaagaatggaaatacaa gtaaaagatctttcacaacagattagagtgcttttgatggaacttgaagaagcaaggggt aaccacgtaattcgtgatgaggaagtaagctctgctgatataagtagttcatctgaggta atatcacagcatctagtatcttacagaaatattgaagagcttcaacaacaaaatcaacgt ctcttagtggcccttagagagcttggggaaaccagagaaagagaagaacaagaaacaact tcatccaaaatcactgagcttcagctcaaacttgagagtgcccttactgaactagaacaa ctccgcaaatcacgacagcatcaaatgcagcttgttgattccatagttcgtcagcgtgat atgtaccgtattttattgtcacaaacaacaggagttgccattccattacatgcttcaagc ttagatgatgtttctcttgcatcaactccaaaacgtccaagtacatcacagactgtttcc actcctgctccagtacctgttattgaatcaacagaggctatagaggctaaggctgccctt aaacaggaaatttttgagaactacaaaaaagaaaaagcagaaaatgaaaaaatacaaaat gagcagcttgagaaacttcaagaacaagttacagatttgcgatcacaaaataccaaaatt tctacccagctagattttgcttctaaacgttatgaaatgctgcaagataatgttgaagga tatcgtcgagaaataacatcacttcatgagagaaatcagaaactcactgccacaactcaa aagcaagaacagattatcaatacgatgactcaagatttgagaggagcaaatgagaagcta gctgtcgcagaagtaagagcagaaaatttgaagaaggaaaaggaaatgcttaaattgtct gaagttcgtctttctcagcaaagagagtctttgttagctgaacaaagggggcaaaactta ctgctaactaatctgcaaacaattcagggaatactggagcgatctgaaacagaaaccaaa caaaggcttagtagccagatagaaaaactggaacatgagatctctcatctaaagaagaag ttggaaaatgaggtggaacaaaggcatacacttactagaaatctagatgttcaactttta gatacaaagagacaactggatacagagacaaatcttcatcttaacacaaaagaactatta aaaaatgctcaaaaagaaattgccacattgaaacagcacctcagtaatatggaagtccaa gttgcttctcagtcttcacagagaactggtaaaggtcagcctagcaacaaagaagatgtg gatgatcttgtgagtcagctaagacagacagaagagcaggtgaatgacttaaaggagaga ctcaaaacaagtacgagcaatgtggaacaatatcaagcaatggttactagtttagaagaa tccctgaacaaggaaaaacaggtgacagaagaagtgcgtaagaatattgaagttcgttta aaagagtcagctgaatttcagacacagttggaaaagaagttgatggaagtagagaaggaa aaacaagaacttcaggatgataaaagaagagccatagagagcatggaacaacagttatct gaattgaagaaaacactttctagtgttcagaatgaagtacaagaagctcttcagagagca agcacagctttaagtaatgagcagcaagccagacgtgactgtcaggaacaagctaaaata gctgtggaagctcagaataagtatgagagagaattgatgctgcatgctgctgatgttgaa gctctacaagctgcgaaggagcaggtttcaaaaatggcatcagtccgtcagcatttggaa gaaacaacacagaaagcagaatcacagttgttggagtgtaaagcatcttgggaggaaaga gagagaatgttaaag >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_2|335_aa MGRTYIVEETVGQYLSNINLQGKAFVSGLLIGQCSSQKDYVILATRTPPKEEQSENLKHP KAKLDNLDEEWATEHACQVSRMLPGGLLVLGVFIITTLELANDFQNALRRSSARPADWKY QSGLSSSWLSLECTVHINIHIPLSATSVSYTLEKNTKNGLTRWAKEIENGVYLINGQVKD EDCDLLEGQLLNSDHRSTATVQICSGSVNLKGAVKCRAYIHSSKPKVKDAVQDNYSNNIL VDLRYQDKRIHYSEKEFHVLPYRVFVPLPGSTVMLCDYKFDDESAEEIRDHFMEMLDHTI QIEDLEIAEETNTGVIAAFTVAVLAAGISFHYFSD >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_2|1008_bp atgggaagaacctacattgtagaagagactgttggccagtatctttcaaacataaatctc caaggaaaggcttttgtctctggccttttaataggacagtgttcgtcacaaaaggattat gtgattcttgccactagaacgccacccaaagaggagcaaagtgagaacctcaaacatccc aaagctaagttggataacttggatgaagaatgggccacagaacatgcctgccaggtatcc agaatgctaccagggggacttttagttcttggagtatttattattactactttagaactg gcaaatgattttcaaaatgccctgcgtagaagttcagcaagaccagcagattggaagtat caaagtggattatcatcctcatggctttctttagagtgtacagttcacattaatattcac atcccactttctgctacttctgtcagctatactctggagaaaaatacaaagaatggactt acacgctgggccaaggaaatagaaaatggtgtttatttgattaatggacaagttaaagat gaagattgtgacctattagaaggacagctcctgaattcagaccacagatccacagccaca gtccagatatgtagcggttctgtaaaccttaagggtgctgtgaaatgcagagcttatatc cacagcagtaaacccaaagttaaagatgctgtgcaggataattattccaacaacatatta gttgatcttaggtatcaagataaaagaatacattattctgaaaaagagttccacgtcctc ccttatcgagtctttgttccccttcctggatccactgtaatgttgtgtgattataaattt gacgatgagtcagctgaagaaatcagggaccattttatggagatgttggatcacacaatt caaatagaagatttggaaattgcagaggaaacaaacacaggtgtgattgcagcatttaca gttgcagtccttgctgcgggtatctcctttcattacttcagtgattag >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_3|135_aa MNEELFLKDEQIKWLLEIEPTPGKDAVNTVEMTTKDSECHTNLVEKEAIGFEGTDSNSES FTVGKMLSNGITCYRKIFHEKKSQLMWQMSLSFKKLPWPPQPSATTTLISQQPSTSRQKP STIKKIMTEGYEDYD >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_3|408_bp atgaatgaggagttgtttcttaaggatgagcaaataaagtggcttctggagatagaacct actcctggcaaagatgctgtgaacactgttgaaatgacaacaaaggattcagaatgtcac acaaacttagttgaaaaagaagccatagggtttgaggggactgactccaattctgaaagt tttactgtgggtaaaatgctatcaaatggcatcacctgctacagaaaaatctttcatgaa aagaagagtcaattgatgtggcaaatgtcattgtctttcaagaaattgccatggccaccc caaccttcagcaaccaccaccctaatcagtcaacagccatcaacatcaagacaaaaacct tccaccatcaaaaagattatgactgaaggttatgaagattatgactga >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_4|273_aa MSGPKGVINDWRKFKLESQDSDSIPPSKKEILRQMSSPQSRNGKDSKERVSRKMSIQEYE LIHKEKEDENCLRKYRRQCMQDMHQKLSFGPRYGFVYELETGKQFLETIEKELKITTIVV HIYEDGIKGCDALNSSLTCLAAEYPIVKFCKIKASNTGAGDRFSLDVLPTLLIYKGGELI SNFISVAEQFAEEFFAGDVESFLNEYGLLPERETLKSYVSIAGREERRKQMVWQWRGRRS AIVLDFGETKVEPFIVRIQLKARGQENPVDEVC >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_4|822_bp atgtcaggacccaaaggagtaataaatgattggagaaagtttaaattagagagtcaagac agtgattcaattccacctagcaagaaggagattctcaggcaaatgtcttctcctcagagt aggaatggcaaagattcaaaggaacgagtcagcagaaagatgagcattcaagaatatgaa ctaatccataaagagaaagaggatgaaaactgccttcgtaaataccgtagacagtgtatg caggatatgcaccagaagctgagttttgggcctagatatgggtttgtgtatgagctggaa actggaaagcaattcctagaaacaattgaaaaggaactgaagatcaccacaattgttgtt cacatttatgaagatggtattaagggttgtgatgctctaaacagtagtttaacatgcctt gcagcagaataccctatagttaagttttgtaaaataaaagcttcgaatacaggtgctggg gaccgcttttccttagatgtacttcctacactgctcatctataaaggtggggaactcata agcaattttattagtgttgctgaacagtttgctgaagaattttttgctggggatgtggag tctttcctaaatgaatatgggttactacctgaaagagagactctaaaatcctatgtctca attgcaggaagggaagaaaggaggaagcaaatggtgtggcagtggaggggaagaaggtct gcgattgtactagattttggtgaaacaaaagtggaacctttcattgtcagaatccagctg aaagccagaggacaagagaaccctgttgatgaagtctgttaa >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_5|60_aa MKNDTEMELEVEETIWLVGSSALHHPYCRTQPGAAVTIWGIAGHCGKGKERKCEELCTGS >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_5|183_bp atgaaaaatgacactgaaatggagctagaggtagaggaaacaatatggttggttggcagt tctgctctgcatcatccttactgcaggacccagcctggtgcagcagtgactatctggggc attgctggtcattgtggcaaagggaaagaaaggaagtgtgaggaattgtgcactggctct taa >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_6|155_aa MRDCAMRNGAPGPKYYAFPVVFANRRPGDSLSTLGERVAVGRASADLSIPACWLCAVDLP AQRSSSAKGQTASSSGSLTTLSPDWETSPSRGQQTPHIGELWLASGGCPSGINLPEEGTG SNLCCSGAFTGDTQANRAWSGPRGNSSRPATQGPD >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_6|468_bp atgagggactgtgccatgaggaacggtgcacccgggcccaaatactatgcttttcccgtg gtctttgcaaaccgcagaccaggagattccctcagcaccctgggggaaagggtggctgtg ggcagagcttcagcagacttaagcattcctgcctgctggctctgtgcagtggatctccca gcacagcgttcgagctctgctaagggtcagactgcctcctcaagtgggtccctgaccacc ttgtctcctgactgggagacatctcccagcaggggtcaacagacacctcatataggagag ctctggctggcatctggtgggtgcccctctgggataaatcttccagaggaaggaacaggc agcaatctttgctgttctggagccttcactggtgatacccaggcaaacagggcctggagt ggacctcgaggaaactccagcagacccgcaacacaggggcctgactga >gi568815597r:186343982_186546482|GENSCAN_predicted_peptide_7|235_aa MWKRLWNWVTGRDWKNLEGSEEDRKMQESLELPRDLLTGFDKNADSDMDNEVQAEVVSDG DAKLAGNWNKGHHAAQAIASEGKSPKPWQLTHGVGPIGAQKLRIEVWEPQPGFQRMYKNT YMSSSFAFAEEGFTSDYVIHFRLVPAVTTGWPEHRPVYPGFSPPFRTENIAWGPGDCPIQ ATTSGISALLVEPAVGPKLPAATTSAGTYLQVPSLGLDIGLSSPSQPLPISRHTA >gi568815597r:186343982_186546482|GENSCAN_predicted_CDS_7|708_bp atgtggaagcgactttggaactgggtaacaggcagagattggaaaaatttggagggctca gaagaagacaggaaaatgcaggaaagtttggaacttcctagagacttgttgactggcttt gacaaaaatgctgatagtgatatggacaatgaagtccaggctgaggtggtctcagatgga gatgcaaaacttgctgggaactggaataaaggccaccatgctgctcaggccattgcttca gagggtaaaagccccaagccttggcagcttacacatggtgttgggcctattggtgcacag aagttaagaattgaggtttgggaacctcaacctggatttcagaggatgtataaaaatacc tatatgtccagctcctttgcatttgctgaggagggttttacttccgattatgtgatccat tttagactggtgcctgctgtcaccactgggtggcctgagcacaggcctgtctaccctggt ttcagccctcccttcaggacagagaacatagcttggggtcctggagattgcccaattcaa gctaccacttcgggcatctcagcactccttgtggagcctgcggttggacctaaactccca gctgctaccacttcagctggcacctacctgcaagtgccatctttaggcttggatattggc ctgtctagtccatcgcagccactaccaatatcaaggcacactgcttga