GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:38:57 Sequence gi568815593f:57114131_57362749 : 248619 bp : 41.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 34161 34199 39 2 0 75 92 31 0.859 2.44 1.02 Term + 36047 36262 216 1 0 72 39 219 0.871 11.56 1.03 PlyA + 37698 37703 6 1.05 2.00 Prom + 53084 53123 40 -2.95 2.01 Init + 59127 59565 439 2 1 29 41 497 0.742 33.42 2.02 Intr + 59915 60081 167 0 2 -13 87 138 0.436 2.36 2.03 Intr + 61073 61177 105 1 0 70 38 75 0.084 0.19 2.04 Intr + 116716 116839 124 0 1 60 83 149 0.800 10.84 2.05 Intr + 116968 117191 224 2 2 15 78 187 0.983 7.32 2.06 Intr + 121836 121893 58 2 1 60 89 53 0.506 0.14 2.07 Intr + 132170 132354 185 0 2 66 106 118 0.942 9.99 2.08 Intr + 132945 133085 141 2 0 85 64 64 0.908 3.33 2.09 Intr + 135279 135446 168 2 0 60 86 224 0.988 18.62 2.10 Intr + 145962 146020 59 2 2 84 77 -29 0.051 -7.64 2.11 Intr + 147050 147152 103 2 1 110 110 108 0.689 14.46 2.12 Term + 148464 148622 159 2 0 58 49 166 0.950 6.66 2.13 PlyA + 148928 148933 6 1.05 3.03 PlyA - 149307 149302 6 1.05 3.02 Term - 169819 169568 252 1 0 63 49 142 0.636 2.45 3.01 Init - 170079 169966 114 0 0 40 96 137 0.820 9.96 3.00 Prom - 173738 173699 40 -4.05 4.00 Prom + 175079 175118 40 -9.35 4.01 Init + 177423 177517 95 2 2 49 94 101 0.387 6.70 4.02 Intr + 184865 185006 142 2 1 66 55 83 0.041 2.23 4.03 Intr + 194758 195010 253 0 1 2 60 181 0.011 2.78 4.04 Term + 195160 195725 566 2 2 50 34 291 0.074 13.47 4.05 PlyA + 196473 196478 6 1.05 5.00 Prom + 197286 197325 40 -7.85 5.01 Init + 199268 199427 160 1 1 65 37 182 0.461 10.83 5.02 Term + 200278 200456 179 2 2 78 43 100 0.699 1.37 5.03 PlyA + 203188 203193 6 1.05 6.02 PlyA - 203638 203633 6 1.05 6.01 Sngl - 204498 203953 546 0 0 44 44 400 0.485 27.05 6.00 Prom - 204641 204602 40 -13.87 7.03 PlyA - 204765 204760 6 -3.74 7.02 Term - 205638 205130 509 1 2 12 48 450 0.907 26.98 7.01 Init - 205890 205665 226 0 1 79 4 208 0.460 10.18 7.00 Prom - 206349 206310 40 -6.85 8.05 PlyA - 207953 207948 6 1.05 8.04 Term - 214180 213979 202 0 1 70 43 134 0.083 2.98 8.03 Intr - 220511 220433 79 0 1 105 42 67 0.039 1.49 8.02 Intr - 232881 232688 194 2 2 61 95 90 0.256 5.21 8.01 Init - 241145 239962 1184 2 2 77 72 520 0.058 42.15 8.00 Prom - 241636 241597 40 -11.44 9.02 PlyA - 242077 242072 6 1.05 9.01 Sngl - 242490 242101 390 0 0 88 54 424 0.686 34.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 189599 189434 166 1 1 14 75 213 0.881 11.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_1|84_aa MVADFPVGAVLMIACGEYYCRCLLKAQGLFSQLVVNAARTGIHHSGQWAPLWPRADPEIP SKSQNLELGTPRARLVLYLTGRGR >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_1|255_bp atggtggcagatttcccagttggtgctgttctcatgatagcctgtggggagtactactgc cgatgtttacttaaggcccaagggctcttcagtcagcttgtggtgaatgctgccaggact ggtatccatcattcagggcagtgggctcccctctggcccagggcagatccagaaatacca tccaagagccaaaacctggaactggggaccccaagagctcgcttggtgctttacctcact gggaggggaagataa >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_2|643_aa MPRWSIPSACPTAPAPARQMFRLRRAEHRRRVSADSPRPEREAQAGYPAKGVRRSVFRSL KLGGAAAQLRELRAGEDAAARATRACAAASRRSTPLSPSAQPHQPGLRGSPGRGEVREEG LLRERRGIPPSTERSRLFTVLTSAAGDLAQNIADRVGAILGLRLVVGEGKAAKGDYSKYR KPSPGIRRGGTPRPGAPLVGQKVKSALALHNTVTYRFSLPLLRCLVHSVDFYRVLAWSSL NFEKHSENFAWTENRYDVNRRRHNSSDGFDSAIGRPNGGNFGRKEKNGWRTHGRNGTENI NHRGGYHGGSSRSRSSIFHAGKSQGLHENNIPDNETGRKEDKRERKQFEAEDFPSLNPEY EREPNHNKSLAAEYPPNPKSRAPRMLVIKKGNTKDLQLSGFPVVGNLPSQPVKNGTGPSV YKGLVPKPAAPPTKPTQWKSQTKENKVGTSFPHESTFGVGNFNAFKSTAKNFSPSTNSVK ECNRSNSSSPVDKLNQQPRLTKLTRMRTDKKSEFLKALKRDRVEEEHEDESRAGSEKMMW AHIEWKLFNFNFSFRIILLKEMGWQEDSENDETCAPLTEDEMREFQVISEQLQKNGLRKN GILKNGLICDFKFGPWKNSTFKPTTENDDTETSSSDTSDDDDV >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_2|1932_bp atgccacggtggagcatcccctcagcgtgccctaccgctccggcgccggctcggcagatg ttccgcctgcgcagagctgagcataggcgccgtgtgagcgccgattcacccaggcccgag cgcgaggcccaagcggggtacccggcgaagggcgtgcgccgcagcgtcttccgcagcttg aagctcggaggagctgcggcgcagcttcgggagcttagagctggggaagacgctgcggcg cgcgccactcgagcctgcgcggccgcttcccggcgcagcacccccctctcgccctctgcg cagccccaccagcccggactccgaggcagccccggacggggggaggtgcgggaggagggg ctcctcagggagcggcggggaattcccccttccaccgaacgttcccgattgttcaccgtc ctcacgtcagcagcaggggacttggcccagaacattgcggatcgggtcggcgccattttg ggactgagactggttgtgggggagggaaaagcggcaaaaggggattattcaaagtaccga aaaccttctcccgggatcaggcgcggcggcacccccaggccaggggcacctctggtgggg cagaaggtgaaatctgcccttgccctccataataccgtcacctaccgtttttccctccct ctactccgatgtcttgttcattcagtagacttttatcgggtgcttgcttggtcgtcattg aattttgagaagcattctgaaaactttgcatggacagagaatcgttatgatgtgaaccgt cgacgacacaactcttcagatggctttgattctgctattgggcgtcctaatggaggtaac tttggaaggaaagaaaaaaatggatggcgtacacatggaagaaatggtacagaaaacata aatcatcgaggtggataccatggtggaagttcccgttctcgtagcagtattttccatgca ggaaaaagccaaggactacatgaaaacaacatacctgacaatgaaaccgggaggaaagaa gacaagagagaacgcaaacagtttgaagctgaggattttccgtctttaaatcctgagtat gagagagaaccaaatcacaataagtctttagctgcagaatatcctccgaatcctaaatct agagctccaaggatgctggtcattaagaaaggtaatacaaaagacttacagctatctgga ttcccagtagtaggaaatcttccgtcacagccagttaagaatggaactggtccaagtgtt tataaaggtttagtccctaaacctgctgctccacctacaaaacctacacaatggaaaagc caaacaaaagaaaataaagttggaacttctttccctcatgagtccacatttggcgttggc aactttaatgcttttaaatcaactgccaagaactttagtccatctacaaattcagtgaaa gagtgtaatcgctcaaattcctcttctcctgttgacaaacttaatcagcagcctcgtcta accaaactgacacgaatgcgcactgataagaagagtgaatttttgaaagcattgaaaaga gacagagtagaagaggaacatgaagatgaaagccgtgctggctcagagaagatgatgtgg gcacacattgagtggaaattattcaactttaatttttcattcagaatcatattgttaaag gaaatgggctggcaggaagacagtgaaaatgatgaaacatgtgctcccttaactgaggat gaaatgagagaattccaagttattagtgaacagttacagaagaatggtctgagaaaaaat ggtattttgaaaaatggcttgatctgtgacttcaagtttggaccgtggaagaacagcact ttcaaacccacaactgagaatgatgacacagagacaagtagcagtgatacatcagatgac gacgatgtgtga >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_3|121_aa MAYVQGPGAQGPGALQSADGKARQAYALLKAANSLRPQIREPPPMITTNTGPKGVLPGYH QCPLKVQVPFSQLVVNAAWPGTHLDGNGSSLVQGSSRNPIQESSPGIKDLQELICVLLPR G >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_3|366_bp atggcctatgttcaaggccctggagctcaaggtcctggggctctacaatcagcagatggc aaagccagacaggcctatgctctccttaaggcagcgaattccctcaggccccagataagg gagcctcccccaatgattaccaccaacactggcccaaagggagtactgccaggctaccac caatgtccccttaaggtccaagtgcctttcagtcagcttgtggtgaatgctgcctggcct gggactcaccttgatggcaatggctcctctctggtccagggcagttccagaaatcccatc caagagtcaagtcctggaatcaaagacctccaagagctcatttgtgttctactcccccgt ggctga >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_4|351_aa MGSGRGGAYAEGGLAKAGGAEGLRRVNLQGSSDSQEMILFGEAEKKRKETNQGVLNFFNK KRTLGGSKPLFGGASAQNLRFVELELERDDLGYLAEEISKQQSIQEVTWVLLKAFSFNRE AEHESLENLKPDNVIENKIPFSEEKFKLAADICISNKELNVNQAAPAMAERGQYRAWAMV SEGESPNPWQLPHGVEPGGAKKSRIGVWEPPPRFQKMYGNAWMPREKFAAGAGTPWRTSA RTMQKGNVGLEPLHRVPTGALCSGAVRRGPPSSRPQNGRSTYNLHCVPGKAADTQHQPMK AAGRESVPCKATGAELSKAMGTYLLQQCDLDVRPLFLPSLRYVFISSMKTN >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_4|1056_bp atgggatcgggcagaggaggagcctatgcagagggtgggctggcaaaagcagggggtgca gagggcctgagaagagtgaacctgcagggcagcagtgacagtcaagagatgattctcttt ggagaggctgagaagaaaaggaaagagactaatcagggtgtgttaaacttttttaataaa aaacgaacactagggggcagcaaaccactgttcggaggagcgtctgcacaaaatttaaga tttgtggaacttgaacttgagagagatgatttagggtatttggcagaagaaatttctaag cagcaaagcattcaagaggtgacttgggtgctattaaaggcattcagttttaatagggaa gcagagcatgaaagtttggaaaatttgaagcctgacaatgtgatagaaaataaaatccca ttttctgaggagaaattcaaactggctgcagatatttgcataagtaacaaggagctgaat gttaatcaggctgctccagctatggctgaaaggggccaatatagagcttgggccatggtt tcagagggtgaaagccccaacccttggcagcttccacatggtgttgagcctgggggtgca aagaagtcaagaattggggtttgggaacctccacctagatttcagaagatgtatggaaat gcctggatgcccagagagaagtttgctgcaggggcagggacaccgtggagaacctctgct aggacaatgcagaagggaaatgtggggttggagcccctacacagagtccctactggggca ctgtgtagtggagctgtgagaagagggccaccatcctccagaccccagaatggtagatcc acctacaacttgcactgtgtgcctggaaaagccgcagacactcaacaccagcccatgaaa gcagctgggagggagtctgtaccctgcaaagccacaggggcagagctgtccaaggccatg ggaacctacctcttgcagcagtgtgacttggatgtgagacctctttttcttcccagtctc aggtatgtctttattagtagcatgaaaacgaactaa >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_5|112_aa MKMGDGATTSQGMPKTAGNHQKKLGRDKKDFPPGFRKSMALPTPSSETSSPQNSPLMMSV GIGRVLRTLAWLMDVLHPLTAVARNPNGAQTVLPGMDGEALGEKALSSHWYC >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_5|339_bp atgaagatgggagatggtgcaactacaagccaaggaatgccgaagactgctggcaaccac cagaagaagctaggaagagacaagaaggatttccctccaggtttcagaaagagtatggct ctgccaacaccttcctctgagacttctagcccccagaactcccctttaatgatgagtgtg gggataggcagggtgctcaggaccctggcttggctaatggatgtgttgcatcctctgaca gcagtggcacggaacccaaacggagcccaaacggttcttcctggaatggatggagaggca ctgggagagaaagcgctttcttcccactggtactgctaa >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_6|181_aa MSVLNMVVRHDPGSTESAKAKEELIFRCRSRHFRASPLFSQHAAADKHKFQRFLTADMAL AVTVYAPTTFPPSSLLLFKQKSNGMHNLIATGHLLLVDPNRMVIKRVVLSDPPFKICTKM AVVCYMFFNREDVQWFKPVELRTKWSRRGHIQEPSGTHGRMKCSFDRKLKSQNTELMNLY K >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_6|546_bp atgtcagtattgaatatggtggtgaggcatgaccctggcagcactgaatctgcgaaagcc aaggaggagctcatattccgctgtagatccaggcacttccgagcctcacctttattctct cagcatgctgcagcagacaaacataaatttcagagattcctgactgctgatatggccctg gcggtgacagtatacgcaccaaccacttttcctccttcatctttgctgcttttcaagcag aaaagcaatggaatgcacaatctcattgctacaggccatcttttgctggtggatccaaac agaatggtcatcaagagagttgtcctgagtgatcctcctttcaaaatttgtactaagatg gcagtggtatgttacatgttcttcaatagagaggatgtgcagtggtttaaaccagtagaa ctgagaacaaagtggagccgcagaggacacatccaggagccttcaggtacccatggccgc atgaagtgcagctttgataggaagctaaaatctcagaacacagaactgatgaacctttac aaatga >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_7|244_aa MAAHHSSPLKMQNKAHKSGQHRGRGSAQQDGKGHLALKTLSKKVRKELSRVDQRHSASQL QKQKKEVVLAEKRQLALAVQGLSGLPLKKQIDARKKLSKAVEKHFLDDKLLLLDTQQEAG MLLRQLANQKQWHLAFRDQQAYLFAHTADFVPSEENNLVGTLEISGYVRGQTLNVNRLLH ITGRGDFQMKHIDDPMDPFPLNPTVIKSQKDPDMAIEICAMDTVDDMEEDLKVLMKADPA RQES >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_7|735_bp atggcagcccaccactccagcccactcaagatgcagaataaagctcataaaagtgggcag catcggggtcggggatctgcacagcaggatggcaagggccatctagcactgaaaacccta agcaagaaggtgagaaaagaactcagcagagtagaccagaggcattccgccagtcagctc caaaagcagaagaaggaggtggttctggcggagaagagacagctggcactagctgtccag ggattatctggcctcccactgaagaaacaaatagatgccagaaagaagctaagtaaagca gtggagaagcactttctggatgacaaactcctcttgttagacactcaacaggaggcaggg atgctgcttaggcagttggctaaccagaagcaatggcatcttgcttttcgagatcagcag gcctacctatttgcccatactgctgattttgttcctagtgaagagaataatttggtgggc accttggaaatttcaggctatgttcgtgggcagactctgaatgtaaataggttgctgcat atcactggacgtggtgatttccagatgaaacacatagatgaccccatggaccctttccct ttaaatcctacagtaattaaatcccaaaaggacccagacatggcaatagagatttgtgct atggatactgtcgatgatatggaagaagaccttaaagtcctaatgaaggcagatcctgct agacaggaatcctga >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_8|552_aa MNRDKEGHYIMVKGSVQQEELTIPNIYAPNTGAPILIKQVLSDLQGHLDSHTLIMGDFNT PLSTLDRSTRQKANKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIL GSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKA EIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQE QTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKN LIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVES LNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRHTVSILISYCQGGSLMSQVYWNQQE KPPSSSNAPPGLSVNKAQHRHVCCRGEMLKRVQVHYVRAVSCVWPHHPAVRRDVVRGGQP PAQTAGPLHSGDQLTLPGLVTHDSTGPEASTQKLTQCKRTIFHTSMISSPTNQHFPFPLP LPAKLSLKNPSL >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_8|1659_bp atgaacagagacaaagaaggccattacataatggtaaagggatcagttcaacaagaagag ctaactatcccaaatatatatgcacccaatacaggagcaccaatattgataaagcaagtc ctgagtgacctacaaggacacttagactcccacacattaataatgggagactttaacaca ccactgtcaacattagacagatcaacaagacagaaagccaacaaggatacccaggaattg aactcagctctgcaccaagcagacctaatagacatctacagaactctccaccccaaatca acagaatatacatttttttcagcaccacaccacacctattccaaaattgaccacatactt ggaagtaaagctctcctcagcaaatgtaaaagaacagaaattataacaaactatctctca gaccacagtgcaatcaaactagaactcaggattaagaatctcactcaaaaccgctcaact acatggaaactgaacaacctgctcctgaatgactactgggtacataacgaaatgaaggca gaaataaagatgttctttgaaaccaacgagaacaaagacacaacataccagaatctctgg gacgcattcaaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaag caggaaagatccaaaattgacaccctaacatcacaattgaaagaactagaaaagcaagag caaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagcagaactgaag gaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctggttttttgaa aggatcaacaaaattgatagaccgctagcaagactaataaagaaaaaaagagagaagaat ctaatagatgcaataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaa actaccatcagagaatactacaaacacctctacgcaaataaactagaaaatctagaagaa atggataaattcctcgacacatacactctcccaagactgaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggcacacagtatcaata ctcatcagctattgtcagggtggaagcctgatgagccaagtatactggaaccagcaagag aaacccccttcctccagcaacgcccctccagggctctctgttaacaaagctcagcatcgt catgtttgctgcagaggagaaatgctaaaaagggtccaggttcattatgtcagagcagtc tcctgtgtttggccacatcatccagcagtcagaagggatgtggtccgtggaggtcagccc ccagcacagacagcaggacctttgcattctggtgaccagctgactctacctggactggtg actcatgactcaaccggtcctgaggcctccacccagaagctgactcagtgcaagaggacc attttccacacctctatgatttcatccccaaccaatcagcattttccattccctctccct cttcctgccaaactatctttgaaaaaccctagcctctga >gi568815593f:57114131_57362749|GENSCAN_predicted_peptide_9|129_aa MGKKQNRKTGNSRKQSASPPPKEHSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMK >gi568815593f:57114131_57362749|GENSCAN_predicted_CDS_9|390_bp atggggaaaaaacagaacagaaaaactggaaactctagaaagcagagcgcctctcctcct ccaaaggaacacagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgtgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagtga