GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:29:01 Sequence gi568815595f:190288120_190510033 : 221914 bp : 36.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 39 34 6 1.05 1.01 Sngl - 4789 4340 450 1 0 59 47 317 0.438 20.66 1.00 Prom - 11245 11206 40 -7.15 2.07 PlyA - 11412 11407 6 1.05 2.06 Term - 11842 11736 107 2 2 83 48 99 0.927 2.99 2.05 Intr - 18606 18474 133 0 1 94 61 98 0.383 7.00 2.04 Intr - 20320 20164 157 0 1 107 -7 99 0.704 1.39 2.03 Intr - 22134 22050 85 1 1 82 97 53 0.970 3.56 2.02 Intr - 24917 24753 165 0 0 29 90 119 0.170 5.21 2.01 Init - 34087 33865 223 1 1 87 73 437 0.479 38.86 2.00 Prom - 34361 34322 40 -4.15 3.00 Prom + 36907 36946 40 -5.85 3.01 Init + 48223 48303 81 0 0 71 98 72 0.779 7.52 3.02 Intr + 52920 52985 66 2 0 81 86 46 0.410 1.88 3.03 Term + 53259 53384 126 2 0 90 44 81 0.498 1.20 3.04 PlyA + 54375 54380 6 1.05 4.02 PlyA - 56503 56498 6 1.05 4.01 Sngl - 58729 58511 219 1 0 89 54 146 0.841 6.23 4.00 Prom - 68120 68081 40 -4.65 5.00 Prom + 70213 70252 40 -6.85 5.01 Init + 71773 71821 49 0 1 69 63 90 0.310 5.76 5.02 Intr + 82774 82882 109 2 1 85 84 99 0.982 7.62 5.03 Term + 84837 84939 103 0 1 139 38 63 0.883 3.17 5.04 PlyA + 85096 85101 6 1.05 6.00 Prom + 91391 91430 40 -5.35 6.01 Init + 100211 100324 114 1 0 96 94 70 0.900 8.82 6.02 Intr + 110162 110263 102 1 0 67 11 121 0.615 1.75 6.03 Intr + 114218 114320 103 1 1 84 57 100 0.958 5.33 6.04 Intr + 116643 116807 165 1 0 94 86 138 0.901 13.21 6.05 Intr + 120195 120386 192 1 0 99 116 -21 0.420 0.34 6.06 Term + 121784 121917 134 0 2 70 38 97 0.802 0.17 6.07 PlyA + 122819 122824 6 1.05 7.02 PlyA - 123574 123569 6 1.05 7.01 Sngl - 129285 128608 678 0 0 59 54 266 0.973 16.23 7.00 Prom - 139173 139134 40 -6.75 8.03 PlyA - 139900 139895 6 1.05 8.02 Term - 141612 141476 137 1 2 75 34 117 0.088 2.20 8.01 Init - 152164 152125 40 1 1 99 101 35 0.174 6.30 8.00 Prom - 159621 159582 40 -3.15 9.03 PlyA - 159846 159841 6 1.05 9.02 Term - 163657 163520 138 1 0 69 41 111 0.120 1.48 9.01 Init - 186767 186630 138 2 0 85 61 105 0.451 7.69 9.00 Prom - 187391 187352 40 -4.95 10.03 PlyA - 188661 188656 6 1.05 10.02 Term - 189641 189435 207 2 0 82 49 90 0.689 0.86 10.01 Init - 193903 193853 51 1 0 50 105 96 0.736 6.72 10.00 Prom - 202275 202236 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 46595 46302 294 2 0 66 39 171 0.925 5.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_1|149_aa MWESLELPRDLLNGFDQNADSDVDNEVQAEVVSDGDEELTGNWNEDHFCYALAKRLVAFC LFPRDMWNFKLERDDLKLEVMFKREAEHKSLENLQPDDAIEKEKPISGEKFKPAAEICIS NKEPNINSQDNGENISRACQRSLRQPFPS >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_1|450_bp atgtgggaaagtttggaacttcctagagacttgttgaatggttttgaccaaaatgctgat agtgatgtggacaatgaagtccaagctgaggtggtctcagatggagatgaggaacttact gggaactggaatgaagatcacttttgctatgctttagcaaagagactggtggcattttgc ctcttccctagagatatgtggaactttaaacttgagagagatgatctgaaattggaagtt atgtttaaaagggaagcagagcataaaagtttggaaaatttgcagcctgatgatgcaata gaaaaagaaaaacccatttctggggagaaattcaagccagctgcagaaatttgcataagt aataaggagccaaacattaatagccaagacaatggggaaaatatctccagggcatgtcag agatctttgaggcagccctttccatcatag >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_2|289_aa MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA LLCCSCPRKTTSYPTPRPYPKPAPSSGKDYSVQNAISLEQDDVMERVLALVSGDLDLSLG AINHRLCLSKAFGCLYGFRSHGYFGSRKVTLPSLLPPYASQEDEDAKEG >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_2|870_bp atggccaacgcggggctgcagctgttgggcttcattctcgccttcctgggatggatcggc gccatcgtcagcactgccctgccccagtggaggatttactcctatgccggcgacaacatc gtgaccgcccaggccatgtacgaggggctgtggatgtcctgcgtgtcgcagagcaccggg cagatccagtgcaaagtctttgactccttgctgaatctgagcagcacattgcaagcaacc cgtgccttgatggtggttggcatcctcctgggagtgatagcaatctttgtggccaccgtt ggcatgaagtgtatgaagtgcttggaagacgatgaggtgcagaagatgaggatggctgtc attgggggtgcgatatttcttcttgcaggtctggctattttagttgccacagcatggtat ggcaatagaatcgttcaagaattctatgaccctatgaccccagtcaatgccaggtacgaa tttggtcaggctctcttcactggctgggctgctgcttctctctgccttctgggaggtgcc ctactttgctgttcctgtccccgaaaaacaacctcttacccaacaccaaggccctatcca aaacctgcaccttccagcgggaaagactactctgtacagaatgctatttcacttgagcaa gatgatgtaatggaaagggtgttggcattggtgtctggagacctggatttgagtcttggt gctatcaatcaccgtctgtgtttgagcaaggcatttggctgcttgtatggcttccgttct catggttattttggttccaggaaggttactctaccctcactcttaccaccttatgcaagt caagaagatgaagacgcaaaagaaggataa >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_3|90_aa MQIQWLATYTMLLGIQWSGAYSNIPSKGRAFPPAAFMGWHDIECLQLFQAQHQVEAAKVW ELHPLKSWPELYVGPFQPWLEQLGCRVPSP >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_3|273_bp atgcagattcagtggcttgccacatatacgatgcttttagggatccagtggtctggtgca tactcgaacatcccttccaagggtagagcttttcctccagctgctttcatgggctggcat gacattgagtgtctgcagcttttccaggctcaacaccaggtggaagctgccaaggtttgg gaattgcaccctctgaagtcatggcctgagctctatgttggcccctttcagccatggctg gagcagctgggatgcagggtaccaagtccctag >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_4|72_aa MGRGKAMGDTGENVMWKGRGWGDGSASQATPGIVSNHYKLGEGREGYFPRSPSDKAYDLT GILISDVWAPEK >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_4|219_bp atgggcagagggaaggcaatgggagacacaggggagaatgtcatgtggaaaggcagaggc tggggtgatggatctgcaagccaagcaacaccagggattgtgagtaaccactacaagcta ggagaaggcagggaaggatacttcccaagatccccttcagacaaagcctatgaccttact ggtatcctgatctcagacgtctgggctccagaaaaatga >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_5|86_aa MRLDVKKHGSRVLALKDDSCPQTSDSKFLSFGTQTGFLAPQLADELLWDLVIIISACVGI LTVHADNLPDSSKGSPFVCSAFQSLT >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_5|261_bp atgaggctcgatgtgaagaaacatggatctcgagttttggcactgaaagatgattcctgc cctcaaacatcggactccaagttcttgagttttgggactcagactggctttcttgctcct cagcttgcagatgaactactgtgggatcttgtgatcatcatttctgcctgtgtcggaatc cttactgtacatgctgataatctgcctgactcttcaaagggcagcccctttgtttgttct gccttccagtccctgacatag >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_6|269_aa MRDLLQYIACFFAFFSAGFLIVATWTDCWMVNADDSLEFWRLEAYNQEVGKAALPLKVLE ESVLSLFQLLVAVSTKCRGLWWECVTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITAD ILAGFGFLTLLLGLDCVKFLPDEPYIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVER STLVLHNIFLGIQYKFGWSCWLGMAGSLGCFLAGAVLTCCLYLFKDVGPERNYPYSLRKA YSAAGVSMAKSYSAPRTETAKMYAVDTRV >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_6|810_bp atgagggatcttcttcaatacatcgcttgcttctttgcctttttctctgctgggtttttg attgtggccacctggactgactgttggatggtgaatgctgatgactctctggagttctgg aggctggaagcttataatcaagaagttggcaaggctgcgctgcccctgaaagttctggaa gaatccgttcttagcctcttccagcttctggtggctgtgagcacaaaatgccgaggcctc tggtgggaatgcgtcacaaatgcttttgatgggattcgcacctgtgatgagtacgattcc atacttgcggagcatcccttgaagctggtggtaactcgagcgttgatgattactgcagat attctagctgggtttggatttctcaccctgctccttggtcttgactgcgtgaaattcctc cctgatgagccgtacattaaagtccgcatctgctttgttgctggagccacgttactaata gcaggtaccccaggaatcattggctctgtgtggtatgctgttgatgtgtatgtggaacgt tctactttggttttgcacaatatatttcttggtatccaatataaatttggttggtcctgt tggctcggaatggctgggtctctgggttgctttttggctggagctgttctcacctgctgc ttatatctttttaaagatgttggacctgagagaaactatccttattccttgaggaaagcc tattcagccgcgggtgtttccatggccaagtcatactcagcccctcgcacagagacggcc aaaatgtatgctgtagacacaagggtgtaa >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_7|225_aa MWESLEPPRDLLNGLHQNADSDVDSEVQAEVVSNGDKELSRSWNEDYSCCALAKRLVAFC LCPIYLWNFELQRDYLKLDLMFKKEIEHKSLENLWPDNVIEKKNPFSGEKFKLAVEISLS NKKPNINSQDNWENISRACQRLLRQPLPSQAWKPRRKRWFRGPGPWLCCFVQPWDLVSCV PAAPAPVGTTRGQGTAQAIVLEAANPWQLPHGVGPVGAQNSRIKV >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_7|678_bp atgtgggaaagtttggaacctcctagagatttgttgaatggtttgcaccaaaatgctgat agtgatgtggacagtgaagttcaagccgaggtggtctcaaatggagataaggaacttagt aggagctggaatgaagattactcttgctgtgctttagcaaagagactggtggcattttgc ctctgccctatatatctgtggaactttgaacttcaaagagattatctgaaattggatctt atgtttaaaaaggaaatagagcataaaagtttggaaaatttgtggcctgacaatgtgata gaaaagaaaaacccattttctggggagaaattcaagctggctgtagaaattagcctgagt aataagaagccaaacattaatagccaagacaattgggaaaatatctccagggcatgtcag agacttttgaggcagcccctcccatcacaggcctggaagcctaggaggaaaagatggttt cgtgggccaggcccctggctctgctgctttgtgcagccttgggatttggtgtcctgtgtc ccagcagctccagctccagttggcactacaaggggccaaggaacagctcaggccatagtt ttagaagctgccaatccttggcagcttccacatggtgttgggcctgtgggtgcacagaat tcaagaattaaggtttga >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_8|58_aa MAVFAVGDLDSIYGTEAAVSPTVGIHLQTQTPDLYPVPAPCFGPLGSPPPYEEIVKTT >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_8|177_bp atggcagtttttgctgttggagacttggactctatttatgggacagaagcagctgtgagt ccaactgttggaattcaccttcaaactcaaacccctgacctatatcctgttcctgctcca tgttttggccctttaggctccccacctccatatgaagaaattgtaaaaacaacctga >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_9|91_aa MGPYWVWLCLAAGSPNTEDMHISAGLTMVICPNPRIQRLKTKYSHTASETLKMERIWGIN NTVGLVYFSPIGIGTKPSSALKFMAATVAAY >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_9|276_bp atggggccatattgggtttggctctgccttgcagctggcagtccaaacactgaggacatg catatttctgcaggactcactatggtcatctgtccaaaccccagaatccagagactcaaa acgaaatacagtcatacagcttcagagacactcaaaatggagcgtatctgggggataaat aatactgtaggtttggtttacttttctcctattggcattggcaccaagccttcctctgct ttgaaattcatggctgcgaccgtggctgcatactaa >gi568815595f:190288120_190510033|GENSCAN_predicted_peptide_10|85_aa MRWGFSLLLSLLREPQLLGLQTLLGCCKSDYGFCHFLMAKTTVTLQQPNMLITVVKVDIF VMFQILEERLSVFPIQYDNGYGSAI >gi568815595f:190288120_190510033|GENSCAN_predicted_CDS_10|258_bp atgcgctggggattctctctgctcctgtctctgctcagggagccccagctgctaggcctt caaacactattaggttgctgcaaaagtgactatggtttttgccattttttaatggcaaaa accacagttacgttgcagcagcctaatatgttgataacagtggtgaaagtggacatcttt gtcatgttccagatcttagaggaaagactttcagttttccctattcagtatgataatggc tatgggtctgccatatag