GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:12:34 Sequence gi568815585f:36719358_36927750 : 208393 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3345 3414 70 2 1 93 94 57 0.709 8.06 1.02 Intr + 7653 7753 101 1 2 93 3 85 0.056 -0.59 1.03 Term + 11273 11446 174 1 0 32 32 141 0.088 -0.32 1.04 PlyA + 13762 13767 6 1.05 2.00 Prom + 16812 16851 40 -2.35 2.01 Sngl + 21069 21413 345 2 0 68 49 271 0.599 16.99 2.02 PlyA + 25237 25242 6 1.05 3.00 Prom + 28236 28275 40 -5.35 3.01 Init + 31310 31363 54 1 0 72 72 55 0.868 3.63 3.02 Intr + 35944 36067 124 0 1 79 96 169 0.851 16.04 3.03 Intr + 58541 58662 122 0 2 59 79 69 0.099 2.39 3.04 Term + 62150 62254 105 1 0 57 41 120 0.247 1.63 3.05 PlyA + 63721 63726 6 1.05 4.00 Prom + 74157 74196 40 -4.65 4.01 Init + 75127 75376 250 0 1 95 10 223 0.978 12.87 4.02 Term + 75425 75693 269 0 2 73 48 189 0.965 8.07 4.03 PlyA + 78175 78180 6 1.05 5.00 Prom + 89516 89555 40 -4.75 5.01 Init + 100001 100600 600 1 0 93 54 535 0.080 43.93 5.02 Intr + 108660 108803 144 2 0 60 64 72 0.251 1.56 5.03 Intr + 116704 116818 115 0 1 70 29 85 0.052 -0.10 5.04 Intr + 121238 121417 180 0 0 49 66 110 0.008 3.82 5.05 Term + 124455 124612 158 1 2 32 48 129 0.020 0.41 5.06 PlyA + 125898 125903 6 1.05 6.08 PlyA - 126245 126240 6 1.05 6.07 Term - 129462 129319 144 0 0 114 42 123 0.996 7.23 6.06 Intr - 134318 134062 257 0 2 68 116 254 0.223 22.44 6.05 Intr - 146401 146180 222 2 0 98 90 125 0.460 10.88 6.04 Intr - 148026 147916 111 1 0 72 84 37 0.210 1.13 6.03 Intr - 153389 153297 93 0 0 77 52 90 0.264 3.32 6.02 Intr - 160288 159921 368 0 2 84 94 242 0.690 18.26 6.01 Init - 161174 161158 17 2 2 57 69 38 0.712 -1.46 6.00 Prom - 161816 161777 40 -8.45 7.00 Prom + 162598 162637 40 -5.15 7.01 Init + 168388 168573 186 0 0 74 97 95 0.523 8.10 7.02 Term + 175310 175402 93 1 0 155 51 46 0.573 4.45 7.03 PlyA + 176506 176511 6 1.05 8.00 Prom + 181420 181459 40 -6.85 8.01 Init + 188508 188558 51 2 0 33 93 86 0.770 4.81 8.02 Intr + 197373 197573 201 2 0 26 63 179 0.614 7.76 8.03 Intr + 197612 197907 296 1 2 29 96 124 0.412 2.18 8.04 Intr + 200264 200370 107 2 2 84 36 49 0.265 -1.76 8.05 Intr + 200493 200690 198 1 0 74 57 162 0.915 10.10 8.06 Intr + 200780 200910 131 0 2 45 0 278 0.364 14.19 8.07 Term + 201145 201378 234 0 0 91 49 82 0.460 -0.06 8.08 PlyA + 201643 201648 6 1.05 9.02 PlyA - 202248 202243 6 1.05 9.01 Sngl - 207428 207228 201 2 0 65 39 215 0.247 9.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 100657 657 1 0 93 42 542 0.887 44.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_1|114_aa MRPLMGSEDGDLKHQANKAQSGAGNRSQVIMENSTPHWADGSSKMMPVILVAVAFGCDWC PYKKRKRHKGWAQRKDHVRAQQKAPAASQEDQTHQKPICQHLDLGLRASKTVRK >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_1|345_bp atgaggccactgatgggcagtgaggatggtgacttgaagcatcaagcaaacaaagcacag agtggagcagggaacaggtcacaggtcatcatggagaattctaccccccattgggcagat ggatcttccaaaatgatgccggttattttggtggctgtggcttttggttgtgactggtgt ccttacaagaagaggaagagacacaaggggtgggcacagagaaaagaccatgtgagagca cagcaaaaagcaccagctgcaagccaggaagaccagactcaccagaaacccatctgccag caccttgaccttggacttcgagcctccaagactgtgaggaaataa >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_2|114_aa MRNISIPSRLSATTPEPPTVQPSSPRTSMELSQQGGPHCYKQCHQCHSQTVDGPSGKPWC SGHRAAQSSILTITGTAKTVMNLTCHLENIAKYNDIKNVVKQVMGASEEPPEQH >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_2|345_bp atgagaaacataagcattccctcaagactatcagcaacaacacctgaaccaccaactgtg cagccctcctccccaagaacatccatggaactctcccagcagggaggacctcactgttat aaacaatgccatcagtgccactcacagactgtggatggtccttctggcaaaccatggtgc agtggtcatcgtgctgcccagagcagcatcctcaccatcactggcacggccaagactgtc atgaatctgacttgccaccttgagaacattgccaagtacaatgacatcaagaatgtggtg aagcaggtgatgggggcttccgaagagcctcctgagcaacattga >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_3|134_aa MNHTQNLPILQISDLHDKTQTKPHVQTLGEASAKVKLDMQARDADLPLIEREREKERREG DNNFYSEIGQGGQLEAAAVHNTHRGIKGVSEYSTFNQNNQVEAAKPPSLLHSECLQAQQD VEAAKVYGLCPPEW >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_3|405_bp atgaatcatacccagaatctccccatcctgcagatcagtgacttacatgacaagactcaa acgaagccccatgtccagaccctcggagaagcaagtgccaaggtgaaattggacatgcaa gcccgtgatgcagatctgcccctgattgaaagagaaagagaaaaagaaagaagggaggga gataataatttctattcagagataggccaaggtggccaattagaagcagctgcggtccac aacactcacagaggaataaaaggggtgagtgaatacagcaccttcaaccaaaataatcag gtggaggcagccaagccgccttcactcttgcactctgaatgtctgcaggctcaacaggat gtagaagctgccaaggtttatggcttgtgccctccagagtggtag >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_4|172_aa MGALEEGFIVPFSSDSDAHFDAAVGYLEDIIMDEDFQLLQRNFINNYYQEFEDTKENKLT YTPIFNEYISLAGKYIEEQYIEERKYIEEQLLEQILGFTMATFTTLQHHKDEVVGDILQM LLKFTDFLAFKEMFLDCRAEKEGQRLDLSRGLVETSLCKSSSMTASQNNLQQ >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_4|519_bp atgggtgctttagaagaaggcttcattgtgcccttctcctctgactctgatgcacatttt gatgctgcggttggatatttagaggacattatcatggatgaagatttccagttattacag agaaatttcatcaacaactactaccaggagtttgaggacaccaaagagaataaactcacc tacacacctatttttaatgaatatatttctttggcaggaaagtatatagaagaacagtat atagaagaaagaaagtatatagaagaacagttgctggagcaaattcttggctttaccatg gcaactttcacaacgttacagcaccacaaagatgaagtggttggtgacatactccaaatg ctgctcaaatttacagattttctggcttttaaagaaatgtttctggactgcagagcagaa aaagaaggccagagactggacttaagccgtggtttagtggagacttcattgtgcaaatca tcttctatgacagcttctcagaacaatctgcagcagtag >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_5|398_aa MEAQGVAEGAGPGAASGVPHPAALAPAAAPTLAPASVAAAASQFTLLVMQPCAGQDEAAA PGGSVGAGKPVRYLCEGAGDGEEEAGEDEADLLDTSDPPGGGESAASLEDLEDEETHSGG EGSSGGARRRGSGGGSMSKTCTYEGCSETTSQVAKQRKPWMCKKHRNKMYKDKYKKKKSD QALNCGGTASTGSAGNVKLESLLLKVMKQLLSVEVIKLIDINLDSSSSVVLIKGQFDCHS DLENHWLLSFTLVAEKDVACLKMYSISTDADFYELFVAWALEVPRVGLVDVVLCGKRNFA EVLKLRILRWRDQPELSGWALNVITRVLIRGTLAYRRTRDVMMEATGPDVDVTRWSQQLV TADSTLLIIGLLVHLSPQNGKLLKSRNFVIVSTVPADA >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_5|1197_bp atggaggcgcagggtgtagcggagggcgcggggccgggcgccgccagcggcgtgccccac cccgcggccctagccccggctgcggctcccaccttggcgccagcctcggtggcggccgcg gcctctcaattcaccctgctagtgatgcaaccctgtgctgggcaggacgaggctgcggcc cccgggggcagcgttggggcgggcaagcccgttaggtacctgtgcgaaggggccggggat ggcgaagaggaggctggggaggacgaggcggacctgttagacacttcggaccctccgggg ggaggcgagagcgcggctagtttggaggatctagaggacgaggagactcactcggggggc gagggcagcagcgggggcgcccggaggcggggcagcggtgggggcagcatgagcaagacc tgcacctacgaaggctgcagcgagaccacgagccaggtggccaagcagcgcaaaccgtgg atgtgcaagaaacaccgcaacaagatgtacaaggacaagtataaaaagaagaagagcgac caggccctgaactgcggtgggactgcctcgactggcagcgcgggaaacgtcaaactcgag tcattgttactgaaggtaatgaagcagttactttctgtggaagtcataaagttaatagat attaatcttgactcatctagctcagtggttctcatcaagggtcaatttgattgtcatagt gaccttgaaaaccactggcttttaagctttaccctcgtagcagaaaaggatgtagcctgc cttaagatgtattccataagcacagatgctgatttttatgagctatttgtagcttgggct ttagaggttccaagagtaggacttgttgatgttgtcttatgtggcaaaaggaactttgca gaggtgcttaagttgaggattctgagatggagagatcaacctgaattatctggatgggct ctaaatgtaatcacacgtgtccttataagaggaacccttgcttacagaagaacaagagat gtgatgatggaagcaacagggcctgatgtagatgtcactcggtggtcccagcagcttgtc acagctgacagcacactgcttattattggtttacttgttcacctttctccacaaaatggt aagctccttaaaagtagaaactttgtcattgtatctacagtgcctgctgatgcataa >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_6|403_aa MAVQECPAVKRLLGWKQGDEEEKWAEKAVDSLVKKLKKKKGAMDELERALSCPGQPSKCV TIPRSLDGRLQVSHRKGLPHVIYCRVWRWPDLQSHHELKPLECCEFPFGSKQKEVCINPY HYRRVETPATRSPSPRARPATLTPQEVLLSQRVPINTQVIDTPPLPYHATEASETQSGQP VDATADRHVVLSIPNGDFRPVCYEEPQHWCSVAYYELNNRVGETFQASSRSVLIDGFTDP SNNRNRFCLGLLSNVNRNSTIENTRRHIGKGVHLYYVGGEVYAECVSDSSIFVQSRNCNY QHGFHPATVCKIPSGCSLKVFNNQLFAQLLAQSVHHGFEVVYELTKMCTIRMSFVKGWGA EYHRQDVTSTPCWIEIHLHGPLQWLDKVLTQMGSPHNPISSVS >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_6|1212_bp atggctgtccaggagtgccccgcagtgaagagactgctaggctggaagcaaggagatgaa gaggaaaagtgggcagagaaggcagtggactctctagtgaagaagttaaagaagaagaag ggagccatggacgagctggagagggctctcagctgcccggggcagcccagcaaatgcgtc acgattccccgctccctggacgggcggctgcaggtgtcccaccgcaagggcctgccccat gtgatttactgtcgcgtgtggcgctggccggatctgcagtcccaccacgagctgaagccg ctggagtgctgtgagttcccatttggctccaagcagaaagaagtgtgcattaacccttac cactaccgccgggtggagactccagccacgcgttctcccagtccccgtgcacggccagct accctcactccccaggaagtccttctgagccagagagtccctatcaacactcaggtcatt gacacaccacccctgccttatcatgccacagaagcctctgagacccagagtggccaacct gtagatgccacagctgatagacatgtagtgctatcgataccaaatggagactttcgacca gtttgttacgaggagccccagcactggtgctcggtcgcctactatgaactgaacaaccga gttggggagacattccaggcttcctcccgaagtgtgctcatagatgggttcaccgaccct tcaaataacaggaacagattctgtcttggacttctttctaatgtaaacagaaactcaacg atagaaaataccaggagacatataggaaagggtgtgcacttgtactacgtcgggggagag gtgtatgccgagtgcgtgagtgacagcagcatctttgtgcagagccggaactgcaactat caacacggcttccacccagctaccgtctgcaagatccccagcggctgcagcctcaaggtc ttcaacaaccagctcttcgctcagctcctggcccagtcagttcaccacggctttgaagtc gtgtatgaactgaccaagatgtgtactatccggatgagttttgttaagggttggggtgct gagtatcatcgccaggatgtcaccagcaccccctgctggattgagattcatcttcatggg ccactgcagtggctggacaaagttctgactcagatgggctctccacataaccccatttct tcagtgtcttaa >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_7|92_aa MHVAKDRSGTTEGKRSKREQIQAGLSIWSQDDEGVPSENSKVAEKLVRQAAGGVTDLRSK EKVQSPPWSPPDPLPDALYLSYVTYPVWHHAD >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_7|279_bp atgcatgtggcaaaagatagaagtgggacaacagaagggaagaggagtaaacgggagcag atacaggcaggtttgtctatctggtcacaggatgatgagggtgttccctctgaaaactcc aaggttgcagagaagttagtgagacaggcagcaggaggggtgacagatttgaggagcaag gaaaaggtccagtcacctccatggagtcctcctgatcccctgcccgatgctttgtactta tcatatgtcacttatcctgtgtggcatcatgctgattga >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_8|405_aa MCHRYLEALEENRIPDKVRVEKAKSHRSSKEDIPGTRVITESPVKAAPEQGIQDEEVIFE DPVGGGEAGKASAWGKGAQERKARHKIGKDREDKPGKVVLGQPNTKGSRHQRRAEYKSGK CGRGVRHGFGVGSVRMKTTKQEEHPVFMGIMTPNANRYLEILTTLVDTSFLRDKIYRNIY FKSYSKRPLRRLSWSLPTPRAPAPTRPPPPPRVEHPLTDRLHQKTREQRPPKAKPHAEEP PRLPRGRTTSPSTQCAHTHAHKQRAAEEGTSYLSPPCHSSLPHPAPAGPGGGGGDRDSGC SSGGGGGGGGGPSRRQSDWSREASSPVLPRAALSVPLRDPRLGPCPACRGGPGVGWEPFV CVGRLASLSPTPPLRFLASTWSPKPEELSAQQEYRITGPSVWRPF >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_8|1218_bp atgtgccatcggtacctggaagccttggaagaaaacagaatccctgataaggtaagagtt gaaaaggcgaagagccacaggagctcgaaggaggacatacctgggaccagggtcatcaca gaaagtcccgtgaaggcagcgcctgaacagggcatacaagatgaagaagtcatatttgag gatcctgtgggaggtggggaagctgggaaggcctcagcgtggggtaaaggtgcacaggag aggaaagccaggcataaaatagggaaagaccgggaagataagcctggaaaagtggtcttg ggtcagcctaacacaaaaggaagccgacaccaaaggagagcagaatacaagtcaggcaag tgcggccgaggtgtccgtcatggttttggggtaggaagtgtcaggatgaagaccaccaag caagaagaacatccagttttcatgggtataatgactcctaatgctaacagatatcttgaa atcctaacaactttagttgacacaagtttcttaagagacaaaatttaccggaatatttat tttaaaagctacagcaagcgacccctcaggcgcctatcctggtccctgcccacgccccgg gccccagccccgacccgaccgccgcccccgccgcgggtcgaacatcccctgacagacagg ctccaccaaaaaacccgggaacaacgaccgccaaaagccaagcctcacgccgaggagcct cccaggctgccccgaggacgaacaacttcgccctccactcaatgcgctcacacgcacgca cacaagcagcgcgcggccgaggagggcacgtcttacctgtccccgccgtgccactcatcc ctcccccacccagcgcctgcagggcccggcggcggcggcggggaccgagacagcggctgc agcagcggcggcggcggcggcggcggcggcggccccagccggcgtcagtcagactggagc cgcgaagcctcatcgcccgtattaccccgcgctgccctctcggtccccctgcgcgacccc aggctcggcccctgcccggcctgccggggtggcccgggggtggggtgggagccctttgtc tgcgtgggtcgcctcgcgtctctctctcccaccccacctctgagatttcttgccagcacc tggagcccgaaaccagaagagttgtcagcccaacaagaatataggatcaccggcccatca gtctggagacccttctga >gi568815585f:36719358_36927750|GENSCAN_predicted_peptide_9|66_aa MCFAKKHNKKGPKKMQANSAKAMSARAKAIEALVKPKEVKPKIPKGVSCKLDRLAYIALP QAWEVC >gi568815585f:36719358_36927750|GENSCAN_predicted_CDS_9|201_bp atgtgctttgccaagaagcacaacaagaagggcccaaagaagatgcaggccaacagtgcc aaggccatgagtgcacgtgccaaggctatcgaggccctcgtaaagcccaaggaggttaag cccaagatcccaaagggagtcagctgcaagcttgatcgacttgcctatattgccctaccc caagcttgggaagtatgctag