GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:07:45 Sequence gi568815590f:78498385_78701818 : 203434 bp : 35.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8276 8349 74 1 2 3 43 293 0.945 13.59 1.02 Term + 9929 10046 118 2 1 69 42 179 0.996 8.43 1.03 PlyA + 10789 10794 6 1.05 2.00 Prom + 16456 16495 40 -4.05 2.01 Sngl + 17615 18073 459 1 0 45 42 226 0.392 9.62 2.02 PlyA + 21063 21068 6 1.05 3.00 Prom + 44970 45009 40 -3.25 3.01 Init + 60099 60172 74 2 2 36 63 133 0.268 6.19 3.02 Intr + 80784 80920 137 0 2 85 92 31 0.055 2.49 3.03 Intr + 96217 96341 125 2 2 118 30 22 0.049 -1.22 3.04 Intr + 99974 100151 178 1 1 94 111 94 0.820 10.87 3.05 Term + 103358 103437 80 0 2 105 41 125 0.994 6.45 3.06 PlyA + 104811 104816 6 1.05 4.00 Prom + 108645 108684 40 -6.25 4.01 Init + 114983 115061 79 1 1 69 83 97 0.852 8.57 4.02 Intr + 146091 146177 87 1 0 91 107 62 0.248 7.42 4.03 Intr + 146512 146817 306 2 0 87 -29 218 0.149 5.50 4.04 Intr + 147964 148236 273 2 0 37 90 219 0.172 13.59 4.05 Intr + 150001 150108 108 2 0 105 5 78 0.363 0.54 4.06 Term + 153755 154374 620 0 2 53 49 228 0.592 9.01 4.07 PlyA + 154655 154660 6 -1.75 5.00 Prom + 154692 154731 40 -6.75 5.01 Sngl + 155057 155839 783 1 0 75 41 273 0.355 17.05 5.02 PlyA + 157586 157591 6 1.05 6.04 PlyA - 157994 157989 6 1.05 6.03 Term - 167916 167669 248 1 2 49 38 149 0.525 0.97 6.02 Intr - 168533 168356 178 2 1 107 56 196 0.963 16.87 6.01 Init - 172687 172667 21 1 0 43 119 44 0.417 0.86 6.00 Prom - 175016 174977 40 -3.65 7.00 Prom + 176859 176898 40 -7.85 7.01 Init + 179628 179636 9 2 0 82 48 27 0.042 -2.80 7.02 Intr + 180179 180295 117 1 0 60 55 81 0.041 1.74 7.03 Intr + 188083 188224 142 0 1 65 28 194 0.047 10.11 7.04 Intr + 190838 190989 152 0 2 82 103 50 0.044 4.86 7.05 Intr + 199023 199122 100 2 1 70 63 73 0.002 1.76 7.06 Intr + 200042 200129 88 0 1 65 110 52 0.001 3.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 188083 188271 189 0 0 65 39 220 0.880 11.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_1|63_aa CQHDDDDDDDDDDEDDDDDDEVTSRSESVVPIGIIRCIHSELRRPSFSVVLAVQNANPSE TEP >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_1|192_bp tgccagcatgatgatgatgatgatgatgatgatgatgatgaggatgatgatgatgatgat gaagttaccagcagatcagagtccgtggttcctattggaatcattcgctgtattcactcc gagctccgacggccatcattctcggttgtattggctgtacaaaatgccaaccccagcgaa acagaaccctag >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_2|152_aa MNNADKEDEEGGKPTGANGTVHAAGVTGERSEEGRYAEKAPLALLSQSIIQRCCSCKSIN RKSIHNLAGQRSSSAVSRLPSLGSGAAPLPPWLARNVVIRPGAGMTSCSRSSAAGAEAAA GRWGAGRRELTEHSAGAAGLRPVAACAGTCAD >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_2|459_bp atgaacaatgctgataaggaagatgaagagggtgggaaacccacaggggcaaatggaaca gtccatgcagccggggtgacgggggagcgctcggaggaaggacgctatgcagagaaagcc cccctggctcttctctctcaatccataatccagcgatgctgcagctgtaaaagcattaat agaaaatcaatccacaacctcgcggggcagcgatcgtcgagcgccgtttccaggctgcct tccctggggtcgggagcggccccgctccccccgtggctggcgcggaatgtggtgatccgt cccggggcggggatgacttcatgcagccggagctccgcggcgggagcggaggctgctgct ggcaggtggggcgcgggccggcgcgagctgaccgagcactcggcgggcgcggcgggactg cggcccgtggcggcgtgcgcggggacctgcgctgactag >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_3|197_aa MAAARDDEEEAKAETPDKLIRSRESKPSLLASPSATGSLLQDLPPSPLATVLLVYNKYNS QNNLPKKFKPGTKYDWHQIKISMTWQVIRHGYQGREMGGNIFSNMCHVNEQWSLLCGYLV AMTDVETTYADFIASGRTGRRNAIHDILVSSASGNSNELALKLAGLDINKTEGEEDAQRS STEQSGEAQGEAAKSES >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_3|594_bp atggcagcggcaagagacgatgaggaagaagcaaaagcggaaactcctgataaactcatc agatctcgtgagagcaagccatcattgcttgcctcacctagtgcaacaggcagtctctta caggatcttcctccttcccctcttgccacggtccttctagtttataacaagtacaatagc cagaataatcttccaaagaaatttaaaccagggactaaatatgactggcatcaaataaaa atcagtatgacctggcaagtgattagacatggatatcaagggagagaaatggggggaaat atattctccaacatgtgtcatgttaatgagcagtggtccctgctatgtggatatttggta gcaatgactgatgtggaaactacatatgcagattttattgcttcaggaagaacaggtaga agaaatgcaatacatgatatcctggtttcctctgcaagtggcaacagcaatgaattagcc ttgaaattagcaggtcttgatatcaacaagacagaaggtgaagaagatgcacaacgaagt tctacagaacaaagtggggaagcccagggagaagcagcaaaatctgaaagctaa >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_4|490_aa MEPLGIGEEQVVQGKVGEEGCINNAGDQPASGDPDMIAFKLNAWSRMTHQSSGNTDRSQS DRPFSSPEGNSEDLQGKETQEPEQQGGAPPPTLFPSVQHKEETFSRNDEDGPEPFPPPRE KPLPSFPPPLRRPSFIGPVQATASPTHLEEIGGHPRDGLDLAVITDMVLKEQDGVQLIST GIYGPLPRGTFGLIIGRSSSTLKGIQIFPGVTDSDYLGELKLMAQVSGVHTISKGTHLAQ IILIPDLQDASGKMGKAAIVWQDAMQTGRKKIHKDFTTTQQAELVQQGPREAYPDLIAHL QDTAQKAISDSHARNVIIQLLAYENANTECQAAIRPIKGEADQNYTYWAYIPFPPLIRPV TWLDPQVEVNVNDSVWMPGPTDNRGPTHPEEEGMLMNVSIGYCFPPICLGLAARCLNYDK QSWMVYVPANNGSKASIHAISGRTFQSLDTIKYLEHGYVMTHCQINKFKPNKKPCPRKAT KWSGKLEVLT >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_4|1473_bp atggagcccctcggtattggcgaggaacaggtggtacaaggcaaagtaggagaggaggga tgcataaataatgctggtgaccaacctgcaagtggtgaccctgacatgattgcctttaag ttaaacgcatggagtaggatgactcaccagtcttcggggaataccgatcgtagtcagtca gatcgcccattttcttcacctgaagggaattcagaagatttacagggaaaggaaacacag gaaccagaacaacagggaggggctcctccccccactttattcccttcagtgcagcataaa gaagagactttttctcgcaatgatgaggacggaccagagccttttcctcccccaagagaa aagccattgccctcttttcctccacccttaagaagaccttcattcattggccccgttcag gcaacagcatctcccacccaccttgaggagattggaggccacccaagagacgggctggac ctcgcagtcataacggacatggtgcttaaagagcaagatggggtccagctgatttcaact gggatctatggcccattacccagagggacatttggattgattattggaaggtcttctagt acacttaaaggaattcaaattttccctggagttacagattcagattatttaggagaacta aaacttatggcacaggtgtcaggggtccacaccatctctaaaggaactcaccttgctcag attattcttattcctgatcttcaagatgcttcaggaaaaatgggaaaggcagctatagtg tggcaagatgccatgcaaactggcagaaaaaaaatccacaaagatttcacaactacacaa caggcagagttagttcaacagggtccaagggaggcttatccagatttgatcgcccatttg caagacacagctcaaaaggctatttcggattctcatgctaggaatgtgatcattcagctg cttgcttatgaaaatgctaatacagaatgtcaggcagcaattagacctattaagggagag gcagatcaaaattacacttattgggcatacattccattcccaccactgattaggcctgtt acatggttagacccccaggtggaggttaatgttaatgacagtgtctggatgcctggacca acagataaccgaggtcctactcatccagaggaagaaggaatgttaatgaatgtttccatt ggttattgctttcctcccatctgcctggggctggcagcaagatgtttaaattatgataaa caaagttggatggtttatgtccctgcaaataatggatcaaaagcctctattcatgcaatc agtggaagaacatttcaatctttggacactattaaataccttgagcatggctatgttatg acacattgccagattaataaatttaaacctaataagaagccctgccctaggaaggccact aaatggtctggaaagctagaggtgctaacctag >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_5|260_aa MGLIAVTATAAAAGIALHSSIQTVGFVDSWQKNSSKLWNSQSQIDQKLANQINDLHQTVI WMGDRIMSLEHRIQMQCDGNTSDFYITPSSYNATEHHWEMLRRHVQGKEDNLILDIDKLK KQVFEASQAHLTLLPGADILAGAADGLSNTNPLKWIKTIHGSTIANFILVCVCLCCLFLV YRCRRCLGREARHRERAMIAMAVINQRKLIKTKKGDMSEGEFLGCQLIWSPQCETPMGSH GWPLRRKVSLLPSCLYAPRA >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_5|783_bp atgggccttatagctgtcacagctactgctgctgctgctggtattgctttgcactcttct attcaaactgtgggctttgtggatagttggcagaaaaattcttctaagctttggaattcc caaagccaaatagatcaaaaattggcaaatcagattaatgatctccatcaaacagtaatt tggatgggagatcggattatgagcttggagcatagaattcaaatgcaatgtgatgggaat acttctgatttttatattactcctagctcttataatgccactgaacaccactgggagatg cttagacgtcacgtacaaggaaaagaagataatttaatattagatattgataaactgaaa aagcaagtttttgaggcatctcaggctcatctcaccctgttacctggagctgatattctt gctggagccgctgatggcctttctaataccaatcctttaaagtggattaaaaccatacat ggatcaacaattgcaaattttattttggtttgtgtctgtttatgctgtttgtttttagtc tacagatgcagacggtgccttgggagagaagccagacaccgtgaacgagccatgatagca atggctgttattaatcaaagaaaattaataaagacaaaaaagggggacatgtcagaagga gagtttctggggtgccagttgatttggtctccccagtgtgagacacccatgggaagccat gggtggcctctgaggagaaaagtctccttattgccttcatgtctttatgccccaagagca taa >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_6|148_aa MSISSLLVNAFITQDKMGKQNREKVKELGTKQNLESWLQYKDKPKDKEPIQALPHTAAVN KSSNPPASGGAAGRGEVRRTEPALAFLLLVSRVPSCFGPALAVLPSSLHRLTFQSLHRAT SSSSARLLQQRPPPPSSGCSNPAAPPTR >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_6|447_bp atgtccatctcctcactgctggtgaatgctttcattacacaggataaaatgggtaaacag aatagagaaaaagtgaaagaactgggaacaaagcaaaatctggagtcatggttacagtac aaagataaacccaaagataaggaaccaatacaggctcttcctcacacagcggccgtgaat aagtcatcaaatcctccggcctccggaggcgcggcgggacgcggcgaggtccgccgcacc gaaccagccctcgccttcctgctactggtgtccagggtccccagctgtttcgggcccgct ctagccgtcctgccctcatcccttcatcgcctcaccttccagtccctccatcgcgccacc tcctcgagctcagcgagactccttcagcagcgcccgccaccgcccagctctggctgtagc aacccagccgcacctcccacccgctaa >gi568815590f:78498385_78701818|GENSCAN_predicted_peptide_7|203_aa MALKKHGPICQKTATKKRKTFDSSRQRAEGTDIPTVKPLKPRPEPPKKPSNWRRKHEEFI ATIRAAKGLDQALKEGGKLPPPPPPSYDPDYIQCPYCQRRFNENAADRHINFCKEQAARI SNKGKFSTDTKGKPTSRTQVYKPPALKKSNSPGTASSGSSRLPQPSGAGKTVVGKVSSSS SSLGNKLQTLSPSHKGIAAPHAG >gi568815590f:78498385_78701818|GENSCAN_predicted_CDS_7|609_bp atggcgctgaaaaaacatggacccatttgccagaagactgcaactaaaaaacggaagact tttgattcaagcagacagagagctgaaggaactgatattccaacagtaaaacctctcaaa ccgaggccagaaccaccaaagaaaccatctaattggagaaggaaacatgaagaattcatt gctaccataagagcagctaaaggccttgatcaggccctcaaagagggtggcaaacttcct cctcctcctccaccttcttatgatcctgattatattcaatgtccatattgtcagaggaga ttcaatgaaaatgcagctgatagacatataaatttctgtaaagaacaggcagcacgtatt agtaataaagggaaattttctacagataccaaaggaaaaccaacttctcggacacaggtg tataagccacccgcacttaaaaagtcaaattctcctggaactgcatcatcaggatcttca cgattaccgcagccaagtggcgctggcaaaactgttgtaggtaaagtgtcttcaagtagc agctctttgggaaacaaacttcagaccttatctccctctcataaagggatagcagcccct catgcaggn