GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:43:47 Sequence gi568815582f:51545818_51746777 : 200960 bp : 40.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1213 1252 40 -2.25 1.01 Init + 9772 9954 183 0 0 82 3 141 0.831 4.17 1.02 Term + 10219 10281 63 0 0 116 48 91 0.500 4.81 1.03 PlyA + 10910 10915 6 1.05 2.00 Prom + 15058 15097 40 -5.05 2.01 Init + 27120 27202 83 2 2 99 60 71 0.589 5.89 2.02 Term + 28087 28186 100 1 1 85 38 114 0.965 2.72 2.03 PlyA + 29731 29736 6 1.05 3.00 Prom + 30711 30750 40 -2.85 3.01 Init + 30967 31021 55 0 1 98 71 33 0.541 4.13 3.02 Intr + 53768 53838 71 0 2 126 89 69 0.174 8.78 3.03 Intr + 80579 80612 34 1 1 144 42 9 0.003 -1.22 3.04 Term + 83082 83200 119 1 2 102 42 65 0.153 1.02 3.05 PlyA + 84124 84129 6 1.05 4.00 Prom + 92829 92868 40 -3.75 4.01 Sngl + 100001 100963 963 1 0 83 32 1112 0.986 101.52 4.02 PlyA + 101431 101436 6 1.05 5.00 Prom + 101615 101654 40 -10.75 5.01 Init + 102417 102492 76 2 1 53 44 116 0.616 5.00 5.02 Intr + 104239 104299 61 2 1 81 100 55 0.652 2.87 5.03 Intr + 105210 105616 407 0 2 88 9 253 0.048 10.47 5.04 Intr + 108381 108589 209 1 2 65 81 105 0.058 5.37 5.05 Term + 112170 112331 162 2 0 70 43 73 0.237 -2.05 5.06 PlyA + 112493 112498 6 1.05 6.00 Prom + 114357 114396 40 -2.75 6.01 Init + 123154 123249 96 0 0 68 31 179 0.727 10.56 6.02 Intr + 127819 128045 227 0 2 88 59 120 0.280 4.96 6.03 Term + 136499 136574 76 2 1 106 44 64 0.035 0.13 6.04 PlyA + 136691 136696 6 1.05 7.08 PlyA - 137026 137021 6 1.05 7.07 Term - 140319 140149 171 0 0 38 47 115 0.525 -0.76 7.06 Intr - 142034 141863 172 1 1 -7 78 117 0.490 0.22 7.05 Intr - 142551 142444 108 2 0 88 103 27 0.460 2.68 7.04 Intr - 152148 151955 194 0 2 63 74 113 0.030 4.77 7.03 Intr - 164787 164566 222 0 0 56 70 161 0.511 8.50 7.02 Intr - 166174 165975 200 2 2 85 36 61 0.266 -1.25 7.01 Intr - 167035 166884 152 0 2 40 74 95 0.019 2.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 31885 32072 188 2 2 25 41 176 0.944 3.27 S.002 Init - 45995 45857 139 2 1 49 94 100 0.928 7.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_1|81_aa MSIKTGFIKALCLQGTEICSEQLKHTRDIFERMQGYLPEGDMNGNQDWASQEQKYFSVSP RVLRQAALAIGEKTGEIRITG >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_1|246_bp atgtccatcaagactggcttcatcaaggctctttgcttacaaggaacagaaatctgctca gaacagctcaagcataccagagatatttttgaaaggatgcaaggatatctcccagaagga gatatgaatgggaatcaggactgggcaagtcaggaacagaagtacttcagtgtttctcct agggtgctcaggcaagctgcccttgccataggagagaaaactggagagattagaataacg ggttga >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_2|60_aa MAKGCVVQYGTSPHFFKASEKLLTEAVRLQATSDEEYHSLILPIDQESKTLISTAAKKTA >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_2|183_bp atggcgaaaggctgtgtggtgcaatatggcacctctccacacttcttcaaggcttcagag aaattactgaccgaggctgtcagactgcaagcaacatctgatgaggaatatcattccttg attttgcccatagatcaagaatccaaaactttaatttccacagctgccaaaaagacagca taa >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_3|92_aa MGHILFNGGRRYFLAESSASPLPDSASALSQLLSEVGSNLTVIESIVILLTGTRGCRVCG RVLQVFADYKVTSCTNTGYLNHHVEMHLLNIQ >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_3|279_bp atgggccatatcctgtttaatggtggccgcagatatttcctggctgagagctcagcatca cctctgccagactcagcaagtgccctgtctcagcttctcagcgaagttggatctaatctc acagtgatagagtccattgtcatccttttgactgggacaagaggatgccgagtatgtggt agagtactccaagtctttgcagattacaaagtcaccagttgtaccaacactgggtatttg aatcaccatgtagaaatgcaccttctgaacatccagtag >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_4|320_aa MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV TYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPDAHLTVKKIFVGGIKEDTEEHH LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA LSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGFGGSRGGGGYGGS GDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNFEGRSSGPHGGGGQYFAKPR NQGGYGGSSSSSSYGSGRRF >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_4|963_bp atgtctaagtcagagtctcctaaagagcccgaacagctgaggaagctcttcattggaggg ttgagctttgaaacaactgatgagagcctgaggagccattttgagcaatggggaacgctc acggactgtgtggtaatgagagatccaaacacgaagcgctccaggggctttgggtttgtc acatatgccactgtggaggaggtggatgcagctatgaatgcaaggccacacaaggtggat ggaagagttgtggaaccaaagagagctgtctccagagaagattctcaaagaccagatgcc cacttaactgtgaaaaagatatttgttggtggcattaaagaagacactgaagaacatcac ctaagagattattttgaacagtatggaaaaattgaagtgattgaaatcatgactgacaga ggcagtggcaagaaaaggggctttgcctttgtaacctttgacgaccatgactccgtggat aagattgtcattcagaaataccatactgtgaatggccacaactgtgaagttagaaaagcc ctgtcaaagcaagagatggctagtgcttcatccagccaaagaggtcgaagtggttctgga aactttggtggtggtcgtggaggtggtttcggtgggaatgacaacttcggtcgtggagga aacttcagtggtcgtggtggctttggtggcagccgtggtggtggtggatatggtggcagt ggggatggctataatggatttggtaatgatggaagcaattttggaggtggtggaagctac aatgattttggcaattacaacaatcagtcttcaaattttggacccatgaagggaggaaat tttgaaggcagaagctctggcccccatggcggtggaggccaatactttgcaaaaccacga aaccaaggtggctatggcggttccagcagcagcagtagctatggcagtggcagaagattt taa >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_5|304_aa MQGPELLFLKPLTATGPLQLNVRSSARGQSAMSLCGALQSVVNALRAVRTFLRCVDQDAR WTSPHPPNQSVPPGAIQLTAAERTAPAEKGSIISLWQETAAQLWRGQLACSQLEQWGAAT DASRPRDAPNRSLLLMGGFLLPCPGNLAALPAARVGWGWVWLLSWKESSIISANQRSGAC RSCSQDSTMAGFLGSTDFVLDYFLSEVVGCKFGTAEGHGYSLSLKVEETHMAMIRDTDHC LETSGTKSQLPLLPVLASLLTEPKVHVGFAEQTEVIDQTSSRQLGFHPTDLRENALLVFC FLMQ >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_5|915_bp atgcaaggacctgagcttctcttcctcaagcccctcacagcgactgggcctttacaactc aatgttcgaagctctgcccggggacagtctgccatgtcactttgtggtgcattacaaagt gtagtcaatgcgctaagggctgtgcggactttccttcgctgcgtggatcaggatgcaaga tggacttccccccaccctccgaatcagtctgttcctcctggggccattcagctgacagct gcggaaaggacagcgcctgctgaaaaagggtccataatttccctctggcaagagacggca gcccaactttggaggggacaattagcgtgctctcagctggaacaatggggagcagcaaca gatgcttctcgtcccagggacgctccgaaccgcagcctcctgctaatgggaggctttctt cttccctgcccgggaaacctggcagccttgccagcagctcgggtagggtgggggtgggtt tggctgctgagctggaaggagagcagcattatcagcgcaaaccagaggagcggggcctgc aggagctgctctcaggactccaccatggcgggctttctgggaagcactgactttgtcttg gattacttcttgagtgaggttgtggggtgcaaatttgggacggcagaggggcatggttac tcactctcactgaaggtagaagagacccacatggcaatgatcagggatactgaccactgc ttggagacctctggcaccaagtctcagcttccattgcttccagtcttggcatccctgttg acagagccgaaagtacacgttggctttgctgagcaaacagaggtaattgatcagacaagt agccgacagcttggcttccatcccacagatcttcgggagaatgctttgcttgttttctgt tttctgatgcagtaa >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_6|132_aa MEGRKEGEKEREREREKKKKEKEEEEEEEEDEIGILTTHLHLWPKLTQTVPPFTLSPGTL YLVHSVFLTPTHSSKLRSSPTSLKCSLTILDLVDSESDLGGLPFGYIRKYYKAIIIGRSK ECDSQNRRKKSL >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_6|399_bp atggagggaaggaaggaaggagaaaaagagagagagagggagagagagaagaagaagaaa gagaaagaggaggaggaggaggaggaggaggacgaaattggtattctcactactcacctt catctctggcccaagctcacccaaactgtgccaccatttacgctgtctcctggaacactc tacctggtccactctgtctttcttactcctacccattcttcaaaactcagatctagtccc acttctctgaagtgttccctgacaattctagacctcgttgattctgagtctgacttaggt ggtctcccctttggctatatcaggaaatattacaaagccatcatcataggaagaagcaaa gagtgtgacagccaaaatcgtagaaaaaaatcactgtag >gi568815582f:51545818_51746777|GENSCAN_predicted_peptide_7|406_aa XTLGLREATFQQQSAGVESLCLLMEDNRKPSSGSPFNKIASCNGEKTNLESVGLASFVLS GCQDHIRAFIRGKSFRKEKTLRKAGDSEDPGGVFEYIPLLQDFFKWSFQPGVSPEGKQVR GGARNVSNQQSGEDVAAASGPHIEESRFQLIRPHPCCLPVLPQTGAQRLKSSSWLWPIRS CQCVLELRSHHLGSPALQEQSQSYNTAASIKLFSSTTSLLLNYFVSKAKNLPRLSPNVEI RLPCTINTPVSDVRLTWEYFTFFLIRDSDNLKDSKIWKQHLQVPALVYIKQHRNRRSSQY KESSKEQFEKSASKSEAFTSNPPLSNATCKNLSRIIFFVTEEGITQLSTEVGVKGPTILV CLEMKHFQGHGNFRANTSKVPGKAGQSVTLGGDLFLWRFESIESHD >gi568815582f:51545818_51746777|GENSCAN_predicted_CDS_7|1221_bp nngactctgggtctgagggaggccaccttccaacagcaatctgctggtgtggaaagcctt tgccttttgatggaggataacaggaagcccagcagtggcagcccctttaataagatagcc tcatgcaatggagagaaaacaaacttagaatcagtgggcttggcttcctttgttctgagt ggttgtcaggatcacatcagagcttttattcgcgggaagagctttagaaaggaaaagacc cttaggaaggcaggggattcagaagacccaggtggcgtctttgaatatatccctctcctt caggatttttttaagtggagttttcagccaggtgtctccccagagggcaagcaggtccgg ggaggggcccgaaatgtttctaaccagcaatcaggtgaggatgtagctgctgcttcagga ccacacattgaggagtcacgatttcagctcattcgccctcacccctgctgccttcctgtg ctgcctcagaccggggctcagaggcttaaatccagttcatggctctggccaatcaggtcc tgccaatgcgtgctggaattgcggtctcatcaccttggtagccctgctctgcaggagcag tcacagagctataacactgctgcctcaataaagctgttttcttctactaccagcttgctc ctgaattatttcgtgagcaaagccaagaaccttcccaggctaagccccaatgtggagatt cgcctgccctgtaccattaatacccctgtgtctgatgtgaggctgacttgggaatatttc actttctttcttattagagattctgataatttgaaagacagtaaaatctggaaacagcat ctgcaagtgccagcactggtgtatattaagcagcacaggaataggaggagttctcagtat aaggagagcagcaaagaacagtttgaaaaatctgcatccaagtctgaagcttttacatca aatccacctttatcaaatgctacctgtaaaaatctatctagaataattttctttgtaacc gaagaaggtattactcagctttccactgaggtgggagtaaaaggacctaccatcctggtg tgcctggaaatgaaacacttccagggacatgggaattttagggcaaataccagtaaggtc ccaggcaaagcagggcagtcagtcactctaggtggtgacctgttcttgtggagatttgaa tctatagaaagccatgattaa