GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:30:40 Sequence gi568815583f:80350882_80693695 : 342814 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12490 12648 159 0 0 115 59 56 0.847 5.46 1.02 Intr + 13799 13924 126 1 0 70 99 63 0.963 6.48 1.03 Intr + 14478 14654 177 2 0 72 61 63 0.721 2.12 1.04 Term + 44487 44606 120 2 0 97 46 69 0.265 2.07 1.05 PlyA + 45270 45275 6 1.05 2.11 PlyA - 51243 51238 6 1.05 2.10 Term - 54065 53820 246 2 0 68 44 165 0.170 5.89 2.09 Intr - 54528 54425 104 1 2 138 91 -30 0.129 2.19 2.08 Intr - 56497 56353 145 1 1 44 11 121 0.011 -0.14 2.07 Intr - 68623 68587 37 0 1 138 81 19 0.089 4.46 2.06 Intr - 74270 74206 65 2 2 84 80 67 0.496 3.02 2.05 Intr - 80978 80886 93 2 0 82 70 58 0.737 3.56 2.04 Intr - 81332 81143 190 1 1 68 61 91 0.754 3.99 2.03 Intr - 85506 85313 194 0 2 132 52 75 0.939 6.59 2.02 Intr - 85788 85684 105 0 0 68 26 92 0.679 1.41 2.01 Init - 86516 86463 54 2 0 81 72 16 0.553 0.58 2.00 Prom - 88914 88875 40 -1.86 3.00 Prom + 95948 95987 40 -3.86 3.01 Init + 100001 100113 113 1 2 80 102 108 0.457 11.22 3.02 Term + 100735 100954 220 1 1 69 41 258 0.998 15.71 3.03 PlyA + 101522 101527 6 1.05 4.00 Prom + 104858 104897 40 -5.36 4.01 Init + 105226 105347 122 0 2 55 85 71 0.596 3.16 4.02 Intr + 107048 107095 48 2 0 83 115 87 0.899 8.70 4.03 Intr + 111339 111504 166 0 1 50 22 126 0.521 2.16 4.04 Intr + 113889 113930 42 2 0 58 87 61 0.049 1.44 4.05 Intr + 117731 117851 121 1 1 110 40 37 0.177 1.17 4.06 Intr + 119339 119550 212 0 2 -13 80 293 0.289 17.03 4.07 Intr + 124129 124342 214 0 1 85 82 190 0.411 16.39 4.08 Intr + 132731 132898 168 0 0 66 39 98 0.011 2.62 4.09 Term + 151362 151468 107 1 2 71 54 159 0.240 9.37 4.10 PlyA + 152556 152561 6 1.05 5.05 PlyA - 152665 152660 6 1.05 5.04 Term - 154937 154832 106 1 1 87 39 70 0.108 -0.12 5.03 Intr - 157428 157250 179 0 2 5 76 199 0.466 9.12 5.02 Intr - 162527 162458 70 1 1 66 80 84 0.697 4.58 5.01 Init - 163581 163574 8 0 2 99 80 11 0.708 1.12 5.00 Prom - 167261 167222 40 -1.56 6.00 Prom + 170465 170504 40 -6.66 6.01 Init + 176806 176857 52 0 1 56 95 40 0.046 2.82 6.02 Intr + 200318 200394 77 0 2 100 110 74 0.306 10.03 6.03 Intr + 201759 201893 135 2 0 99 109 141 0.999 18.06 6.04 Intr + 204184 204258 75 0 0 65 88 34 0.623 0.81 6.05 Intr + 208476 208577 102 2 0 46 81 64 0.464 1.87 6.06 Intr + 210188 210316 129 1 0 80 92 30 0.732 3.49 6.07 Intr + 212207 212358 152 1 2 111 70 297 0.933 29.16 6.08 Intr + 223267 223339 73 1 1 146 109 59 0.998 13.21 6.09 Intr + 224106 224229 124 2 1 40 103 90 0.957 5.86 6.10 Intr + 225985 226084 100 2 1 86 91 41 0.471 3.37 6.11 Intr + 227605 227709 105 1 0 42 46 131 0.340 3.63 6.12 Intr + 229530 229668 139 0 1 96 80 8 0.468 1.27 6.13 Intr + 229886 230045 160 1 1 99 59 55 0.539 3.26 6.14 Intr + 231098 231126 29 0 2 45 99 23 0.392 -3.07 6.15 Intr + 232327 232434 108 0 0 61 80 107 0.973 7.68 6.16 Intr + 240687 240776 90 2 0 45 89 110 0.736 6.99 6.17 Term + 242719 242817 99 0 0 145 37 108 0.957 9.53 6.18 PlyA + 247027 247032 6 1.05 7.09 PlyA - 248270 248265 6 1.05 7.08 Term - 260853 260779 75 0 0 107 48 55 0.527 1.24 7.07 Intr - 264411 264262 150 0 0 77 103 19 0.208 2.66 7.06 Intr - 270160 270036 125 2 2 82 32 102 0.137 4.30 7.05 Intr - 280685 280542 144 0 0 100 28 69 0.143 2.35 7.04 Intr - 287994 287873 122 2 2 111 38 41 0.706 1.54 7.03 Intr - 289862 289819 44 2 2 96 76 41 0.751 0.74 7.02 Intr - 290309 290208 102 2 0 79 116 29 0.732 5.17 7.01 Init - 318311 318231 81 2 0 87 110 -9 0.095 2.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 138934 138925 10 1 1 111 109 18 0.808 6.24 S.002 Init + 309384 309433 50 2 2 79 75 25 0.830 0.72 S.003 Term + 309827 309923 97 2 1 111 33 91 0.894 3.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_1|193_aa MALLLLTVLGPWNCGHPGLSKIGNQQKFKSEVKAQMAEYEKNLFENKNHFCGGRSWQEGK AQKSLDLIKQQGQPQWNWRLSEVTQQRKTKRMFSLCWDYRCEPLCPATFQVFEAELSSQS IGPPPSTDHTRSLVSPVKPLEEEPNACTLKPAGQLIESWFPVQVLDELATQGNGEFLEQS AVTAAIARTTSGF >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_1|582_bp atggccctgctgctcctcactgttctgggcccatggaactgtggacacccaggactctca aaaattgggaatcagcagaagtttaaaagtgaagtaaaagcacaaatggcagaatatgaa aaaaatttatttgaaaacaaaaaccatttctgtggaggtaggtcctggcaagaagggaaa gcccaaaagagcctggacctgataaaacaacagggccagcctcaatggaattggagacta agtgaagtaactcagcaacggaaaaccaaacgtatgttctcactctgctgggattacagg tgtgagccactatgcccggccaccttccaagtctttgaagctgaactttcaagccagagt attggtcctcctcccagtacagaccacaccaggtccctggtcagccccgtcaagcctctt gaggaggagcctaatgcctgcacacttaagcctgctgggcagttaatagaaagttggttt ccagtgcaagtattagatgaattagcaacacagggaaatggtgagtttctggagcagtca gcagttactgcagccatagctagaactacttctgggttctga >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_2|410_aa MRPQKRREDKNGIREMEWTQREQLPSFLVPGPTQVMKQTQPPPGYQMVNHLEDVGGTPRG QEAVSIHLSYGITTPFSDGHAFLVAFFEIEWAPQNGVPRSLGDCSDPSCFTCQHPETQDA GEGLVRVHYFIKVHTLQGAVVEWETDAAFLPIDSRPLIFAAELSGDSSINSPWRKTFTRI MEVLREGFGHQGVFEEALEEQSPPSAPKGRELKQGYCEQLFDKTKTSVPRLGHRSPRESQ CDSGKKSNMSNMSVKTPTSPVTNKTLENPHYTSQKSVIVNGSAVVKWSAGKKGTRTQQPA VPTAASAQHPKVCTSLDVHVSLLPHPAHQRGALHQPGLADAVMPQKPGPEEKNAWDGEGK GGYFQNFGGSRACPSGSRGCEASVRPPAGPDRRLPGTGWAAIGMQMPLWC >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_2|1233_bp atgaggccacaaaagaggagagaagacaaaaatgggataagagagatggagtggacacaa cgagagcagctaccatccttcctggtgccagggcccacacaagtcatgaagcaaacacag cccccgccaggataccagatggttaatcatctggaggatgtaggaggaactccccgaggt caggaagcagtgtctattcatttgtcgtatggaatcaccacacccttcagtgatggccat gctttccttgtcgccttctttgagattgagtgggccccacagaacggcgttcccagaagc cttggtgactgctcagatcccagctgctttacctgccagcatcctgaaactcaagacgca ggtgaagggctggtgcgggtgcactatttcatcaaagtccacaccctccagggagcagta gttgaatgggaaacagatgctgcatttttacctattgacagtaggcccttaatctttgca gctgagctttcaggagacagcagcatcaattctccgtggagaaaaactttcacaaggatc atggaggtcctgagagagggcttcggccaccagggggtctttgaggaagcactggaggaa caaagcccaccctcagcaccaaaagggagagagctgaaacaaggctactgtgagcagctc ttcgataaaacaaaaaccagtgtgccaagacttggccacagatcacccagggaaagccag tgtgacagtggaaagaagagtaacatgagtaacatgagcgtgaagaccccaacatcaccg gtcaccaacaagaccctagagaatccccactacacatcacagaagagtgtcattgtaaat ggttctgctgttgtgaagtggtcagctgggaaaaaaggaacaaggacccagcaacctgcc gtgcccacagcagcctctgcacaacacccaaaagtttgtactagcttagatgttcacgtt tctcttctcccacaccctgctcatcagcgcggagccctccaccagccaggactggctgac gccgtgatgccacaaaagccgggaccggaagagaaaaatgcgtgggatggggaaggcaaa ggcggttattttcagaacttcgggggctcccgggcctgcccttcgggttcccgaggatgc gaggccagcgtccggccgcccgcgggtccggaccggcgcctgcccgggaccggatgggca gcgattgggatgcagatgccgctctggtgctga >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_3|110_aa MASDIPGSVTLPVAPMAATGQVRMAGAMPARGGKRRSGPEKPTNQPTNQPTNQPTTHQPS NQPTNQPTNQPINQPTNQPINQPTNYPPTIQPTKILQEKHDGHFQTQDIT >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_3|333_bp atggcttcagacatacctggatctgtgacgttgcccgttgcccccatggcggccaccgga caggtgaggatggcgggggccatgcctgcccgtggaggaaagcggcgttccggacctgaa aaaccaaccaaccaaccaactaaccaaccaaccaaccaaccaactacccaccaaccatcc aaccaaccaaccaatcaaccaactaaccaaccaatcaaccaaccaaccaaccaaccaatc aaccaaccaaccaactaccccccaaccatccaaccaaccaaaatcctgcaggaaaaacat gatggacattttcagactcaagatatcacctag >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_4|399_aa MQNIDSFDSFRKSCIQLPLVHNIGNSKAAARCLVSQTTLQRMDFDDEDGEGPSKFSREFQ DTFSYRQLGSAATSTWRHLKYCFIRGITNIPELSSTESKLYTVLIVCLTYVNHAWERLAM KDVPGKHPQPQDHLLLFAISFSIVVFAFHLPFEAFNHTLRFISTMQENHSEIERRRRNKM TQYITELSDMVPTCSALARKPDKLTILRMAVSHMKSMRGTGNKSTDGAYKPSFLTEQELK HLILEAADGFLFVVAAETGRVIYVSDSVTPVLNQPQSEWFGSTLYEQVHPDDVEKLREQL CTSENSMTGSAPSLQVQQAASSTVEQESEELVACSSCHPGRTQTRAGRKPVIHDAARVSV HLLEHGAVIDARKFPDVAVGMAGCPGQEESVLRIEEDTS >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_4|1200_bp atgcagaatattgatagctttgattcgtttagaaagtcttgcattcagctcccacttgtc cacaacatcggcaactctaaagctgctgcccggtgcctggtttcccaaactaccttgcag agaatggacttcgatgatgaagatggtgaaggccccagtaaattttcaagggagttccag gacaccttctcataccggcagctggggtcagctgccaccagtacatggaggcatcttaaa tattgtttcataaggggcattactaacatcccagagctgtcatcaacagagtcaaaatta tacaccgtgctcatcgtgtgccttacgtatgtgaatcatgcctgggagcgactggccatg aaggatgttcctggaaagcaccctcagccccaggaccacctcctcctcttcgctatatcg ttctccattgtggtttttgcttttcacctgccttttgaagcatttaatcatactctgcgt ttcatttccactatgcaagagaatcatagtgaaatcgaaaggcgcagacggaacaagatg actcagtacatcacggagctctccgacatggtccccacatgcagcgcactggctcggaag ccagacaagctcaccatcctccgcatggccgtctcgcacatgaagtccatgaggggtaca gggaacaagtccaccgatggcgcgtacaagccttccttcctcacagagcaggaactgaag catctcatccttgaagcagctgatggatttctgtttgtggtggctgctgagacagggcga gtgatttatgtgtctgactccgtcacccctgttctgaaccagccccagtcagagtggttt gggagcacactgtatgaacaggtgcatcctgatgacgtggagaagctgagagagcaactg tgcacctcagaaaactcaatgacagggtccgcaccaagtttgcaggttcagcaggcggca tcctccactgtggaacaggagagcgaggagctggtagcatgctcatcctgtcaccctgga aggacccagactagagcaggaagaaagccagttatacatgatgctgcaagagtcagcgtg cacctcctggaacatggagcagttatagatgcccgcaagttcccagatgtggcagtgggc atggctggctgccccggacaggaggaaagcgtgctcagaattgaggaggacacgtcctga >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_5|120_aa MVSTRDNQTAGSLLMAEESVGMRDKQKKEQTNHLMACCSPKTDLILQMKDRREPMHILMD DCCPSFLTVPVFRSKIRPKRERRTVTSKSKFFFLSKAALSPQSPIFVIGTTTTQLLKLEI >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_5|363_bp atggtcagcactagggacaaccagacagctggaagtctgctgatggcagaggagagcgtg gggatgagagataaacagaaaaaggagcaaaccaaccacctgatggcctgctgctccccc aaaacagacctcatcctgcagatgaaagaccgccgcgagcccatgcacatcctcatggat gactgctgcccttctttcttgaccgtcccagtcttcaggtccaagatccggcctaagaga gagagaaggacagttacgtccaaaagcaaattcttcttcctgtccaaagctgccctttct ccacagtcccctatctttgtcattggcaccaccactacccagttgctcaaactagaaatt tag >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_6|582_aa MRIGIVKKSAQFAQLVTGMTIPEEDADVGQGSKYCLVAIGRLQVTSSPVCMDMNGMSVPT EFLSRHNSDGIITFVDPRCISVIGYQPQDLLGKDILEFCHPEDQSHLRESFQQGTVGSGQ AATGTSWSTPHLGPVLVFFGLRPLTAKVLRIEMGYSQVTKMSLSPCCMQSGQDDRHHRRI DHKCGGCEEWVVKLKGQVLSVMYRFRTKNREWMLIRTSSFTFQNPYSDEIEYIICTNTNV KQLQQQQAELEVHQRDGLSSYDLSQVPVPNLPAGVHEAGKSVEKADAIFSQERDPRFAEM FAGISASEKKMMSSASAAGTQQIYSQGSPFPSGHSGKAFSSSRGDHAEEEDTITDDLWEK SEAADRAQGRQSSVSSSVVHVPGVNDIQSSSSTGQNMSQISRQLNQSQVAWTGSRPPFPG QPCPEAAGRERLGRLRHLIKNEPLTTLDLWLPLRQEGMKPPTLVTSPDPHSHCEGQAHKR PVVLARTVVMSDGVKRGFGAELTRNRGKVLEHPTVAVTRQLKVDKVAGSSKGGPRKSGRS GKASTMASRADMLPMPGDPTQGTGNYNIEDFADLGMFPPFSE >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_6|1749_bp atgaggattggaatagtgaagaaatcagcccagtttgcacagctggtaactggaatgacc atacctgaagaagacgctgatgtgggacaaggcagtaaatattgcctcgtggcaattggg agactccaggtgaccagctctcctgtatgcatggacatgaatgggatgtcggtgcccaca gagttcttatcccggcataactccgatggaatcatcacatttgtggatccaagatgtatc agtgtgattggctaccaaccccaggatcttctgggaaaggacattttggaattctgccac cctgaggatcaaagccatctgcgtgagagcttccagcagggcactgtgggcagcggccaa gcagccacagggacatcctggagcacaccgcacctcggccccgtgcttgtgttctttggt ctccggccactgacagccaaggtcttgaggatagagatggggtattcacaggtgaccaag atgtccctgtccccatgctgcatgcagtctggccaagacgacagacaccaccgaagaata gaccacaagtgtggcgggtgtgaggaatgggtggttaagctgaaaggccaagtcctgtcg gtcatgtatcgatttcgcaccaagaaccgggagtggatgttgatccgcaccagcagcttc acattccagaatccctattctgatgagattgagtacatcatctgcaccaacaccaacgtc aagcaacttcagcaacagcaggcagaattggaagtgcaccagagagatggattgtcatcg tatgacttatcccaggtccccgtccccaacctaccagccggtgttcatgaggccgggaag tccgtggaaaaggcggatgcaatcttctcccaggaaagagatcctcggtttgctgaaatg tttgcaggaattagtgcatcggagaagaagatgatgagctcagcctctgcagcaggaacc cagcagatctactcccaaggaagcccatttccctctggacactccgggaaggccttcagc agcagccgtggggaccatgctgaggaggaggacaccatcactgatgacctttgggagaag agtgaagctgcagacagggctcagggcagacagtccagcgtgagctcttcagtggttcat gtgcctggagtgaatgatattcagtcctcttcttccacgggccagaacatgtcccaaatc tcccggcagctaaaccagagtcaggtggcatggacagggagtcgtccgccctttccggga cagccatgccctgaggcagctggcagggaaaggttgggcaggttgagacacctcatcaag aatgagcccctcactactctggacctctggctgcccctgcgccaggagggcatgaaacca ccaacccttgtgacatccccagaccctcatagtcactgtgagggacaagctcataagaga cccgtggtcctggccaggacagtcgttatgtctgatggggtcaagagaggcttcggagca gagctaacacgcaaccgtggcaaggtcttggagcatcccactgtggccgtgacgagacag ctgaaagtggacaaagtagcgggcagttccaagggcggccctcggaagtctggtcgcagt ggcaaagccagcaccatggccagcagagcggacatgctgcccatgccaggagatccaacc caggggactggcaactataacatcgaagactttgccgacctgggcatgtttccaccgttt tctgagtag >gi568815583f:80350882_80693695|GENSCAN_predicted_peptide_7|280_aa MGHLLQGTQQGESGSSCLRPEVPDGLQIIVTKEVFIEFNTFLIPSFWLLCLQEANRKMEA KASFTKGSSCHSGLPRLLLRMTFRRFYLISVISVDDSLEQSQTGSQDRPDSLTRKMRPLE PTYHLGFEEESSDHVGSTLLMRGPSSTIESFSLTCNVAILSCILCQAFQGMTPQSEDTNG CDEVIATELGTAMSRGVELLEKKPRREAGPFLPAPGMRCCEACTCTMFPGSPPPTAHTMN PNVVLAMRRDRRRSRHAQLGSCRHQVILLLVLPKKPVSKT >gi568815583f:80350882_80693695|GENSCAN_predicted_CDS_7|843_bp atggggcatttattgcagggcacccagcaaggagaatcaggcagctcatgcttaagacct gaagtccctgatggattacagataattgttactaaagaagtattcattgaattcaacact ttcttgattccctctttctggcttctctgtctgcaagaagcaaacaggaaaatggaagcc aaggctagcttcaccaagggcagcagctgtcactctggcctgccaaggcttctgctcaga atgacttttagaagattctacctgatatctgtgatatctgtggatgattcactcgagcag agtcagactggctcccaagaccgtccagactccctgaccaggaaaatgcgacccctggaa ccaacttaccacctgggctttgaggaggaatccagtgaccatgtggggtccacccttctg atgaggggccccagcagcactatcgagtcattttcacttacttgcaatgtggccatactc tcatgtatcctgtgccaagccttccaaggaatgacaccacaaagtgaagacacaaatgga tgtgacgaagtgattgccactgagcttggcacagccatgagtagaggagtggagctgctg gagaagaaaccacgcagggaggcaggcccttttctgccggcccctggcatgaggtgttgt gaagcgtgcacttgcaccatgtttcctggcagccctccacccactgcccacactatgaat ccgaatgtggtcctggcgatgcggagagatagaaggagaagccggcatgctcagctcggt tcctgccgccatcaggtcatcttgctgcttgttctgcccaagaagcctgtttctaaaacc tag