GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:04:45 Sequence gi568815597f:192616183_192816650 : 200468 bp : 37.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1886 1881 6 1.05 1.03 Term - 2391 2279 113 1 2 66 38 63 0.036 -3.16 1.02 Intr - 7482 7252 231 1 0 51 72 143 0.162 5.82 1.01 Init - 12777 12705 73 0 1 65 92 63 0.910 5.70 1.00 Prom - 16788 16749 40 -6.45 2.03 PlyA - 17714 17709 6 1.05 2.02 Term - 22492 22405 88 0 1 83 44 62 0.335 -2.55 2.01 Init - 25777 25707 71 1 2 87 64 100 0.808 8.07 2.00 Prom - 34093 34054 40 -5.05 3.03 PlyA - 36607 36602 6 1.05 3.02 Term - 44770 44633 138 1 0 67 34 141 0.306 3.58 3.01 Init - 48794 48363 432 2 0 82 86 162 0.563 11.66 3.00 Prom - 50174 50135 40 -3.35 4.03 PlyA - 50646 50641 6 1.05 4.02 Term - 51820 51585 236 2 2 48 54 214 0.905 9.50 4.01 Init - 68204 67283 922 2 1 74 4 262 0.009 10.57 4.00 Prom - 70238 70199 40 -6.15 5.03 PlyA - 70406 70401 6 1.05 5.02 Term - 71599 71267 333 1 0 73 54 340 0.665 22.73 5.01 Init - 77822 77547 276 2 0 48 87 100 0.347 2.93 5.00 Prom - 88830 88791 40 -4.35 6.00 Prom + 96185 96224 40 -7.45 6.01 Init + 98559 98653 95 2 2 96 110 53 0.869 8.20 6.02 Term + 99985 100471 487 1 1 45 36 471 0.823 30.79 6.03 PlyA + 100483 100488 6 1.05 7.06 PlyA - 100533 100528 6 1.05 7.05 Term - 110656 110504 153 1 0 118 53 93 0.635 5.74 7.04 Intr - 112952 112805 148 1 1 40 78 51 0.285 -1.38 7.03 Intr - 129523 129442 82 2 1 97 94 71 0.317 6.38 7.02 Intr - 136016 135864 153 0 0 48 44 115 0.493 2.22 7.01 Init - 136246 136132 115 1 1 83 113 9 0.706 3.38 7.00 Prom - 141682 141643 40 -4.95 8.04 PlyA - 141851 141846 6 1.05 8.03 Term - 151113 150878 236 1 2 115 50 75 0.609 1.90 8.02 Intr - 159079 159011 69 2 0 30 121 58 0.081 1.54 8.01 Init - 167323 167116 208 1 1 30 88 162 0.798 9.43 8.00 Prom - 170315 170276 40 -6.95 9.00 Prom + 175065 175104 40 -3.65 9.01 Init + 180314 180393 80 1 2 33 85 98 0.317 4.59 9.02 Intr + 192865 192999 135 1 0 8 76 127 0.103 2.26 9.03 Intr + 193984 194085 102 1 0 62 108 52 0.886 2.97 9.04 Intr + 194188 194249 62 1 2 64 101 84 0.998 4.76 9.05 Intr + 194799 194965 167 1 2 122 94 78 0.999 10.56 9.06 Term + 195220 195414 195 0 0 97 44 150 0.995 7.93 9.07 PlyA + 196064 196069 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 164773 164613 161 2 2 31 54 124 0.946 0.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_1|138_aa MRAFGCEVAMSGEALTLLLMVTFLEIQTAIREYYKHLYANKLQNLEEMDKFLDTYTLPRL NQEEIESLNRPITGSEIEAIINSLPTKKVQDQMDSQPNSTGGQSWKWRRVRDDRNVSYFL VIAMDHIKMPQSSRNSMA >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_1|417_bp atgagggcatttggctgtgaagttgcaatgagtggagaggcattgacattgttgctgatg gtgacgttcctggaaatacaaactgccatcagagaatactataaacacctctatgcaaat aaactacaaaatctagaagaaatggataaattcctggacacatacaccctcccaagacta aaccaggaagaaattgaatccctgaatagaccaataacaggctctgaaattgaggcaata attaatagcctaccaaccaaaaaagtccaggaccagatggattcacagccgaattctaca ggaggccaatcctggaagtggaggagggtgagggatgacagaaatgtgtcctattttttg gttattgccatggatcacatcaagatgccacagtcctcccggaactccatggcttaa >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_2|52_aa MEGRREIVDVLEEELLELVDPLDMKWNWRLDAVPVTRVLLLCLEPGAFPPVF >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_2|159_bp atggagggaaggagagagattgtagacgttttggaggaagaactattagaacttgttgac ccattggatatgaaatggaactggagactggacgctgttccagtgactagagttctgctc ctgtgcctggagcctggtgccttcccgcctgtattctaa >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_3|189_aa MAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFNKVSGYKINVP KSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKQDT NKWKNIPCSWVGRINITKMAILPKAFLVSELEARWFSDLALLSTCAKIGCVVTHIAHLLK VDSATEKRL >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_3|570_bp atggcaatcaggcaagagaaagaaataaagggtattcaattaggaaaagaggaagtcaaa ttgtctctgtttgcagatgacatgattgtatatttagaaaacccaatcgtctcagcccaa aatctccttaagctgataagtaacttcaacaaagtctcgggatacaaaatcaatgtgcca aaatcacaagcattcctatacaccaataacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacttaggaattcaacttacaagggac gtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaacaggacaca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcactaaaatggcc atactgcccaaggcttttttggtttcagagctggaggcaaggtggtttagtgacttagcc cttcttagcacctgtgctaagattggctgtgtagttactcacattgctcatcttttaaaa gtggacagcgctacggaaaaacgcctgtga >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_4|385_aa MLPEFKLYYKAIVTKTAWYWYQNRYIDQWNRTQPSDIIPHIYNQLIFNKPDKTKKWRKDS LFNKWCWENGLAISRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIDM GKDFMTKTPKAKATKAKINKWDLIKLKSFCRAKETTIRVNRQPTEWEKIFAIYPSDKGLL SRIYKELKQIYKKKSNNPIKKWAKDMNRHFSEQDIYAANRHMRKCSSSLVIREMQIKTTM RFHLTPVRMAIIKKSGNNRYRRGCGEIGTLLPCWWACNLVQPLWKTVWQFLKDLELEIPF DPAIPLLGTQLLGSKEQNWMENEFDELTKLSFRRLVITNFTELKKHVLTHRKEAKNLDKR LDEWLTRINSVEKSLNDLMELKTTV >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_4|1158_bp atgctacctgagttcaaactatactacaaggctatagtaaccaaaacagcatggtactgg taccaaaacagatacatagaccaatggaacagaacacagccctcagatataataccacac atctacaaccaactgatctttaacaaacctgacaaaaccaagaaatggagaaaggattcc ctatttaataaatggtgctgggaaaacgggctagccataagcagaaagctgaaactggat cccttccttacaccttatacaaaaattaattcaagatggattaaagacttaaatgttaga cctaaaaccataaaaaccctagaagaaaacctaggaaataccattcaggacatagacatg ggcaaggacttcatgactaaaacaccaaaagcaaaggcaacaaaagccaaaattaacaaa tgggatctaattaaactaaagagcttctgcagagcaaaagaaactaccatcagagtgaac aggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggctgcta tccagaatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatcaaa aagtgggcaaaggatatgaacagacacttctcagaacaagacatttatgcagccaataga cacatgagaaaatgttcatcatcactggtcatcagagaaatgcaaatcaaaaccacaatg agattccatctcacaccagttagaatggctattattaaaaagtcaggaaacaacaggtac cggagaggatgtggagaaataggaacacttttgccctgttggtgggcctgtaacctagtt caaccattgtggaagacagtgtggcaattcctcaaggatctagaactagaaataccattt gacccagccatcccattacttggaacccaactcctcggcagcaaggaacaaaactggatg gagaatgagtttgacgagttgacaaaactaagcttcagaaggttagtaataacaaacttc accgagctaaagaagcatgttctaacccatcgcaaggaagctaaaaaccttgataaaagg ttagatgaatggctaactagaataaacagtgtagagaagagcttaaatgacctgatggag ctgaaaaccacagtatga >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_5|202_aa MIQKKIPLSEEKFKSAAEVCISYEEPNVNPQDSGENVSRTCQRSSWQPLPSQTQKSRRKW FRGPGLRSPCCVQYRHLMPCIPATPAVTKVAKECSSSPATEQSWMENDFDELREEGFRRS VITNFSKLKEDVQTHHKEAKNLEKRLDEWLTRINSVEKSLNDLMELKTTAQELCDACTSF SSRFDHMEERVSVIEDQINEMK >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_5|609_bp atgatacaaaagaaaatcccactttctgaggagaaatttaagtcagctgcagaagtttgc ataagttacgaggagccaaatgttaatccccaagacagtggggaaaatgtctctaggaca tgtcagaggtcttcatggcagcccctcccatcacagacccagaagtctaggagaaaatgg tttcgtgggccaggcctcaggtccccatgctgtgtgcagtataggcacttgatgccctgc atcccagccactccagctgtgactaaagtggccaaggaatgcagctcctcaccagcaaca gaacaaagctggatggagaatgactttgatgagttgagagaagaaggcttcagacgatcg gtaataacaaacttctctaagctaaaggaggatgttcaaacccatcacaaggaagctaaa aaccttgaaaaaagattagacgaatggctaactagaataaacagtgtagagaagtcctta aatgatctgatggagctgaaaaccacggcacaagaactatgtgatgcatgcacaagcttc agtagccgatttgatcacatggaagaaagggtatcagtgattgaagatcaaatcaatgaa atgaagtga >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_6|193_aa MQALRVPDSRTWPLDGISGLALCQKGAHCPEGGAATKMQIFVKTLMGKTITLEVELSDTI DNVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGAKKRKKK SYTTPRKNKHKRKKVKLALLKYYKVDENGKISCLHRECPSDECGAGVFMASHFDRHYCGK CCLTYCFNKPEDK >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_6|582_bp atgcaggctcttagggttcctgattccaggacttggcccttggatggcatttctggactt gccctgtgccagaagggagcccattgccctgaaggtggagccgccaccaaaatgcagatt ttcgtgaaaacccttatggggaagaccatcaccctcgaggttgaactctcggatacaata gataatgtaaaggccaagatccaggataaggaaggaattcctcctgatcagcagagactg atctttgctggcaagcagttggaagatggacgtactttgtctgactacaatattcaaaag gagtctactcttcatcttgtgttgagacttcgtggtggtgctaagaaaaggaagaagaag tcttacaccactcccaggaagaataagcacaagagaaagaaggttaagctggctctcctg aaatattataaggtggatgagaatggcaaaattagttgccttcatcgagagtgcccttct gatgaatgtggtgctggggtgtttatggcaagccactttgacagacattattgtggcaaa tgttgtctgacttactgtttcaacaaaccagaagacaagtaa >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_7|216_aa MSNSFSISHNSILFSWLCVRNGWVLGLTDFKNKAADLYAIKSSKSAAVHSSHPESFASPS GFMVSLASGVKLQTFVVTVTAHKGGTNPKKTVKALELFFLEYEYSDKQAQCKISLYRSSS KPNYFPKTPSQFSSHWELGLQLTDFCRDTIASTAVSEHKDLSPYTFVITQSSLLSLWEQL LDLRIGPPQQNGDVLIPGWQENMVVHGKPVAGVVSI >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_7|651_bp atgagcaattccttttctatcagtcataactccatccttttctcatggttgtgtgtccgg aatgggtgggttcttggtcttactgacttcaagaataaagctgcggacctttacgctatt aaaagcagcaagtctgcagctgttcattcctcccatccagagtcgtttgcctctcccagt gggttcatggtctcgctggcttcaggagtgaagctgcagacctttgtggtgactgttaca gctcacaaaggcggcacgaacccaaagaaaacagtcaaagctcttgagctcttcttcctt gagtacgagtacagtgacaaacaggcccaatgcaaaattagtctttaccggagctcatct aaacctaattactttccaaagactccatcccagttctcatcacattgggaattagggctt caacttacagatttttgcagggatacaattgcgtctacagctgtgtctgagcacaaagac ctctctccctatacctttgttataactcagagctctctcctttctctgtgggagcaattg ctggacctcaggattggacccccacagcaaaatggagatgtgctaattcctgggtggcag gaaaatatggtagttcatgggaaacctgttgctggagtggtcagtatttga >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_8|170_aa MKEFEQQPPTLDLPFDRDYPNEKEPVLLPTLVIEQGSLTPPHPHKIKLVRQQWIQTKKKS LIYLKNNSGDFYKQEHNLTLKALLKETPTFRQGPHSKLWNGASYFKSHSFPFVFTSILIK VLTRSILVICSSAEACFHVLLTLKLKARQRTSSEQFQQNCHCSVTLLEAI >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_8|513_bp atgaaagaatttgaacaacagcctccaaccctagaccttccctttgacagagactaccca aatgagaaggaaccagttctattaccaactctggtaatagaacaaggctctttaacaccc ccacacccccacaaaatcaaactagttcgccagcaatggatccaaaccaagaagaaatcc ctgatttacctgaaaaataattcaggagatttctacaaacaggaacacaatctcacccta aaagcactgctaaaggagactccaacattcagacaaggaccacattcaaaactgtggaat ggtgcatcatatttcaagtctcatagcttcccttttgtcttcacttccatcctcatcaag gtccttactagaagcatcttggtcatttgttcctctgccgaggcttgttttcatgtgctg ctgacactaaaacttaaggccaggcaaagaacctcctctgaacaattccagcagaactgc cattgttccgtgaccctgctggaggcgatatag >gi568815597f:192616183_192816650|GENSCAN_predicted_peptide_9|246_aa MLAEILELLIAAGISGQDVMAYQLLASRGSSGRTIMQSAMFLAVQHDCRPMDKSAGSGHK SEEKREKMKRTLLKDWKTRLSYFLQNSSTPGKPKTGKKSKQQAFIKPSPEEAQLWSEAFD ELLASKYGLAAFRAFLKSEFCEENIEFWLACEDFKKTKSPQKLSSKARKIYTDFIEKEAP KEINIDFQTKTLIAQNIQEATSGCFTTAQKRVYSLMENNSYPRFLESEFYQDLCKKPQIT TEPHAT >gi568815597f:192616183_192816650|GENSCAN_predicted_CDS_9|741_bp atgttggctgaaattctggagctcttaattgctgctggtatttcaggacaagacgtaatg gcttatcagcttttggcaagccggggctccagcgggagaacgataatgcaaagtgctatg ttcttggctgttcaacacgactgcagacccatggacaagagcgcaggcagtggccacaag agcgaggagaagcgagaaaagatgaaacggacccttttaaaagattggaagacccgtttg agctacttcttacaaaattcctctactcctgggaagcccaaaaccggcaaaaaaagcaaa cagcaagctttcatcaagccttctcctgaggaagcacagctgtggtcagaagcatttgac gagctgctagccagcaaatatggtcttgctgcattcagggcttttttaaagtcggaattc tgtgaagaaaatattgaattctggctggcctgtgaagacttcaaaaaaaccaaatcaccc caaaagctgtcctcaaaagcaaggaaaatatatactgacttcatagaaaaggaagctcca aaagagataaacatagattttcaaaccaaaactctgattgcccagaatatacaagaagct acaagtggctgctttacaactgcccagaaaagggtatacagcttgatggagaacaactct tatcctcgtttcttggagtcagaattctaccaggacttgtgtaaaaagccacaaatcacc acagagcctcatgctacatga