GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:15:55 Sequence gi568815592f:24257640_24457891 : 200252 bp : 40.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 628 416 213 2 0 48 55 143 0.000 4.76 1.07 Intr - 20572 20410 163 1 1 53 103 161 0.906 12.73 1.06 Intr - 31267 31213 55 0 1 68 84 32 0.321 -1.14 1.05 Intr - 33439 33293 147 0 0 62 106 169 0.551 14.53 1.04 Intr - 44153 44058 96 1 0 113 89 34 0.279 4.21 1.03 Intr - 44405 44329 77 2 2 73 89 33 0.654 -0.71 1.02 Intr - 45036 44952 85 2 1 61 69 81 0.441 2.40 1.01 Init - 45537 45482 56 0 2 73 80 86 0.500 7.11 1.00 Prom - 47860 47821 40 -3.25 2.00 Prom + 48087 48126 40 -8.15 2.01 Init + 48531 48593 63 2 0 78 115 62 0.975 9.10 2.02 Intr + 58186 58249 64 0 1 21 105 67 0.044 -0.93 2.03 Intr + 64872 64899 28 1 1 129 80 9 0.086 0.46 2.04 Intr + 67429 67695 267 1 0 53 101 73 0.107 0.82 2.05 Intr + 77172 77326 155 0 2 96 46 143 0.486 9.69 2.06 Term + 81573 81784 212 1 2 89 38 68 0.106 -1.63 2.07 PlyA + 82368 82373 6 1.05 3.07 PlyA - 83611 83606 6 1.05 3.06 Term - 84890 84705 186 2 0 25 34 184 0.304 3.21 3.05 Intr - 87903 87762 142 2 1 24 82 85 0.103 0.93 3.04 Intr - 100208 99819 390 1 0 80 89 393 0.366 31.51 3.03 Intr - 100418 100231 188 2 2 38 14 200 0.134 5.17 3.02 Intr - 102890 102774 117 2 0 101 61 25 0.084 0.84 3.01 Init - 127044 126802 243 0 0 78 101 59 0.227 3.98 3.00 Prom - 127168 127129 40 -3.65 4.00 Prom + 131095 131134 40 -5.75 4.01 Init + 145408 145597 190 0 1 92 84 216 0.978 18.82 4.02 Intr + 151822 151934 113 2 2 90 83 20 0.296 0.88 4.03 Intr + 154583 154756 174 1 0 76 106 89 0.967 8.71 4.04 Intr + 155397 155554 158 2 2 44 94 74 0.503 1.49 4.05 Intr + 158758 158874 117 1 0 67 89 106 0.980 7.26 4.06 Intr + 160445 160597 153 2 0 24 110 127 0.965 6.77 4.07 Intr + 160822 160939 118 1 1 91 78 59 0.877 4.75 4.08 Term + 170768 170833 66 1 0 107 54 28 0.005 -1.74 4.09 PlyA + 170960 170965 6 -0.45 5.12 PlyA - 172546 172541 6 1.05 5.11 Term - 172788 172627 162 0 0 14 38 179 0.289 2.45 5.10 Intr - 179097 178937 161 1 2 97 115 154 0.999 17.79 5.09 Intr - 179650 179474 177 2 0 69 82 219 0.735 18.37 5.08 Intr - 188000 187907 94 2 1 82 90 69 0.872 5.12 5.07 Intr - 188192 188087 106 1 1 74 86 82 0.996 5.90 5.06 Intr - 189340 189199 142 2 1 14 94 149 0.998 6.59 5.05 Intr - 190394 190238 157 2 1 93 67 136 0.985 10.66 5.04 Intr - 190569 190495 75 0 0 93 91 35 0.893 3.09 5.03 Intr - 191699 191508 192 2 0 94 4 133 0.737 4.27 5.02 Intr - 192318 192150 169 2 1 40 78 211 0.578 14.33 5.01 Init - 196533 196376 158 0 2 82 94 146 0.490 12.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:24257640_24457891|GENSCAN_predicted_peptide_1|298_aa MAAVLESGHKVKGKIKGTSSWDPGSKKRGKGDHIYPVARAAGIEGAGVKPVIHSRINVSA RFRKPLQEPCTILKTLNQWDHVLQMVTEKITLRSGAVHRYVKTILLYTLEGKLVESGAEL ENGQFYVAVGRDKFKKLPYSELLFDKSTMRRPFGQKASSLPPIVGSRKSKGSGNDRHSKS TVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKNSQETIPNSASSSNPLGSTCTLW KGFSFALHNKSCCCSLFGSAPPLGAVILTVKVSGVIFEVSETTNPPEGTHSEHTTSGX >gi568815592f:24257640_24457891|GENSCAN_predicted_CDS_1|894_bp atggcagctgtacttgagagcggccataaggtcaaggggaaaattaaaggcaccagcagc tgggaccctggaagcaaaaagagaggcaaaggtgatcacatctacccagtggcaagggct gcgggaattgagggtgctggcgtaaaaccagtaatccatagcaggatcaacgtgtcagct cgctttagaaaaccgcttcaggagccgtgcactatcttaaaaaccttgaatcagtgggat catgtactacaaatggtcacagaaaaaatcactctgaggagcggggctgttcacaggtat gtcaaaacaatcttgctttatactttagaaggaaaacttgttgagagtggagcagagttg gagaatgggcagttttatgtggctgttggcagagataagtttaagaaactgccttacagt gagttactttttgacaagtcaacgatgagaaggccttttggtcagaaagcttcttcacta cctcctattgtaggatccagaaagtctaaagggagtggaaatgatcgccactctaagtca acagttggatccagtgacaactcatctcctcagcccctgaagaggaaagggaaaaaagaa gacgtgaattcagaaaaactgacgaaattgaaacaaaatgtaaaattaaagaattcacaa gaaaccattccaaatagtgccagcagcagcaacccacttgggtccacttgcacgctgtgg aagggtttttcttttgctcttcacaataaatcttgctgctgctccctctttgggtccgca ccacctttaggagctgtaatactcactgtgaaggtcagtggcgtcatttttgaagtcagc gagaccacgaacccacccgaaggaacccactctgaacacactacttcaggagnn >gi568815592f:24257640_24457891|GENSCAN_predicted_peptide_2|262_aa MEFTGEQTDQRVICDYETNQMLPHGEETAESKNGSKETKKEADPGDNQLRARECLGIRGS LKRPSPLRIIQNQNLQSPVVKTKTFHLGVREHDIYFLTLLLGSQIKALTYICHLYSVIRF YVTANKQIMRSSGRKENQHGIPDMLVEDAAFETGFLVSVMYQDPGVAKGEKQHLFSRDKA GRAVTRGRHSKHGHNVYIISSRKLLRIPLAYGSHPSSSLCRLYPPTQHTSDSLRTAPSSP LHSQLQESLPCLIYSPLQIQLT >gi568815592f:24257640_24457891|GENSCAN_predicted_CDS_2|789_bp atggagttcactggtgagcagactgaccagcgtgtaatctgtgactacgaaaccaatcaa atgctgccacatggagaagagactgcagaaagcaagaatggaagcaaggagaccaaaaag gaggctgatccaggagataatcaactaagagccagagaatgccttggaatcagaggttcc ctgaagagaccctctcctcttagaataatccaaaaccagaatctccagagccccgtggtc aaaactaaaacgttccatctaggagtgagagagcacgatatctacttcctcacacttctc ctcggttctcaaataaaagcgctcacttacatttgccatctttattctgtgatccgtttt tatgttacagcaaataagcaaattatgaggtcctctgggcgaaaggaaaatcagcatgga ataccagacatgctggtggaagatgctgcatttgaaacgggctttctggtttcagtgatg tatcaggatcctggggtggcaaaaggagaaaaacagcacttattctccagagacaaagca ggcagagcagtcacaagagggcggcacagcaaacatggtcacaacgtatatatcatttct tccaggaagcttctgaggattcccctggcttatggtagccatccttccagcagcctgtgc cgcctgtatcctcccacacagcacacttcggatagcttgcgtacggccccatcttctcca ctacacagccagcttcaggaaagcttgccttgtctgatatattcaccattgcaaatccaa ctaacataa >gi568815592f:24257640_24457891|GENSCAN_predicted_peptide_3|421_aa MEYYAAIKKDEFMSFAGTWMKLKTIILSKLTQEQKTKHHVFSLISGSRTVRTHGHREGNI THWGLSGWECRGGMALGEIPNTELTIYTNSTDVGSWTASQTFSLLAAVKKSFPWRKAWHR KLSTLILPVAGEPGVVRALRVLIGCARHGEKLERLLRVLQAPSVSSGWTPRRTAVTSLSP VDSRRSRALGAVRPLGQRRCLPTRSSELRRRPAGKMSGSSARSSHLSQPVVKSVLVYRNG DPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFGAVRNIYTPRTGHRIRKLDQIQSGGN YVAGGQEAFKKLKENVDQPCFFLVGWRILEVPGEEERKEESETAKLCLAEKFVGTRVVLQ VFGVRAFLWKPRRSAAQLPHSTEKWLVQEDCRSDWQQSVLTPPVLISCFREFFHANKHAM L >gi568815592f:24257640_24457891|GENSCAN_predicted_CDS_3|1266_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacatggatg aagctgaaaaccatcattctcagcaaactaacacaggaacagaaaaccaaacaccacgtg ttttcacttataagtgggagtcgaacagtgagaacacatggacacagggagggaaacatc acacactggggcctgtcggggtgggagtgtagaggagggatggcattaggagaaatacct aatacagagctaacaatctacacaaatagcacagatgtgggctcttggacagccagtcag acatttagccttcttgcggcagtgaaaaaatcctttccctggcgtaaagcttggcacagg aagctgtccaccttgattctgcccgtagcaggtgaacctggtgtggtgcgcgcactgcgg gtcctgattggctgtgcacgccacggtgaaaaacttgagcgtctactccgagtcctgcaa gctcctagtgtctcctctgggtggacgcctaggcgcacggccgttacttctctttcacct gtggattcgaggcgatcccgcgccctgggtgcggtgaggccgctgggacagcggaggtgc ctcccgacgcgaagcagcgagctgaggcggcggccagcggggaagatgagcggcagcagc gccaggtccagccacctgtctcagcccgtcgtgaagagcgtgcttgtgtaccgcaacggg gaccccttctacgcggggcgccgcgtcgtcatccatgagaagaaggtgtccagcttcgaa gtcttcctgaaggaggtgaccggcggcgttcaggcaccctttggggccgtcaggaacatc tacaccccgcggactggccaccgaatccggaagctagaccagatccagagcgggggcaat tacgtggctggaggccaggaagccttcaagaaactcaaggagaacgtggatcaaccttgt ttcttcttagtggggtggagaattttagaagtaccaggagaggaagagagaaaggaagag agtgaaactgcaaaactttgccttgctgaaaagtttgtgggaacccgggttgtacttcag gttttcggggtcagagccttcctgtggaagcccagaaggagcgctgcccagctgccacac tccacggagaagtggctggtgcaagaagactgccggagtgactggcagcagtccgtattg acacctcctgtcttgatttcctgtttcagagagttcttccatgcaaataaacatgcgatg ctgtga >gi568815592f:24257640_24457891|GENSCAN_predicted_peptide_4|362_aa MECLRSLPCLLPRAMRLPRRTLCALALDVTSVGPPVAACGRRANLIGRSRAAQLCGPDRL RVAERKKTELYQELGLQARDLRFQHVMSITVRNNRIIMRMEYLKAVITPECLLILDYRNL NLEQWLFRELPSQLSGEGQLVTYPLPFEFRAIEALLQYWNRWLHFLLGVINEHQLAPASE MISSGSPVTKQGESRREKEDVNSGKSWAPQAGLSELETDIKIFKESILEILDEEELLEEL CVSKWSDPQVFEKSSAGIDHAEEMELLLENYYRLADDLSNAARELRVLIDDSQSIIFINL DSHRNVMMRLNLQLTMGTFSLSLFGLMGVAFGMNLESSLEEAPVCVVSLYVSTCSHLAPT YK >gi568815592f:24257640_24457891|GENSCAN_predicted_CDS_4|1089_bp atggaatgcctgcgcagtttaccctgcctcctgccccgcgcgatgagacttccccggcgg acgctgtgtgccctggccttggacgtgacctctgtgggtcctcccgttgctgcctgcggc cgccgagccaacctgattggaaggagccgagcggcgcagctttgcgggcccgaccggctc cgcgtggcagaaaggaagaaaactgaattataccaagagttaggtcttcaagccagagat ttgagatttcagcatgtaatgagtatcacagtcagaaacaataggattatcatgagaatg gagtatttgaaagctgtgataactccagagtgtcttctgatattagattatcgtaattta aacttagagcaatggctgttccgggaactcccttcacagttgtctggagagggtcaactc gttacataccctttaccttttgagtttagagctatagaagcactcctgcaatattggaat agatggctacacttcctgcttggggtaattaatgagcaccagctagcaccagccagtgag atgataagtagtgggagcccagttaccaagcagggggaaagcagaagagagaaagaagat gtaaacagtggtaagtcatgggctccacaggctggtctatcagagttagaaacagatatt aaaattttcaaagagtcaattttggagatcttggatgaggaagagttgctagaagagctc tgtgtatcaaaatggagtgacccacaagtctttgaaaagagcagtgctgggattgaccat gcagaagagatggagttgctgttggaaaactactaccgattggctgacgatctctccaat gcagctcgtgagcttagggtgctgattgatgattcacaaagtattattttcattaatctg gacagccaccgaaacgtgatgatgaggttgaatctacagctgaccatgggaaccttctct ctttcgctctttggactaatgggagttgcttttggaatgaatttggaatcttcccttgaa gaggccccagtgtgtgttgtttccctctatgtgtccacgtgttctcatttagctcccact tacaagtga >gi568815592f:24257640_24457891|GENSCAN_predicted_peptide_5|530_aa MGTVTSWWAHQATAAPATSTSGACTSSTAMTWACHLLTWTWTRRPTGSLKASRSALSEAV AQPRGSKLVPLQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLTYKMLPTIAGTTHP GTMCDCGGRGPSLASCPLQTQLLPLPPHTPFLDMPAVCHVLNSSGCLTLRPLLGAVYVYF GSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPDLVIGSPFAPGGGKQKGIVAA FYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSPTWKNASRLG HLLHIRDEKKSLGRVYGYFPPNGQSWFTISGDKAMGKLGTSLSSGHVLMNGTLKQVLLVG APTYDDVSKVAFLTVTLHQGGATRMYALTSDAQPLLLSTFSGDRRFSRFGGVLHLSDLDD DGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWITPCPEEKVGK YSRPTRPSSLVYDDQEPNEVRQQLQQKSLLVESGKGLQRSWARSCIWKDD >gi568815592f:24257640_24457891|GENSCAN_predicted_CDS_5|1593_bp atgggcacggtgacctcgtggtgggcgcaccaggctacagccgccccggccacatccaca tcgggcgcgtgtacctcatctacggcaatgacctgggcctgccacctgttgacctggacc tggacaaggaggcccacaggatccttgaaggcttccaggagtgctctttccgaggccgtg gctcagcccaggggaagtaaacttgtgcctcttcagccctcaggtcggtttggctcggcc ttggctgtgttggactttaacgtggacggcgtgcctgacctggccgtgggagctccctcg gtgggctccgagcagctcacctacaaaatgcttcctaccatagcagggaccacgcatccg ggcaccatgtgtgactgtggaggtcgagggccctcccttgccagctgccctctgcagact caactcttgccccttcctcctcacactccatttctagacatgccagcagtctgtcacgtc ctgaactcatcaggctgtctcactttgcgacctttgctcggtgccgtgtatgtctacttt ggttccaaacaaggaggaatgtcttcttcccctaacatcaccatttcttgccaggacatc tactgtaacttgggctggactctcttggctgcagatgtgaatggagacagtgaacccgat ctggtcatcggctccccttttgcaccaggtggagggaagcagaagggaattgtggctgcg ttttattctggccccagcctgagcgacaaagaaaaactgaacgtggaggcagccaactgg acggtgagaggcgaggaagacttctcctggtttggatattcccttcacggtgtcactgtg gacaacagaaccttgctgttggttgggagcccgacctggaagaatgccagcaggctgggc catttgttacacatccgagatgagaaaaagagccttgggagggtgtatggctacttccca ccaaacggccaaagctggtttaccatttctggagacaaggcaatggggaaactgggtact tccctttccagtggccacgtactgatgaatgggactctgaaacaagtgctgctggttgga gcccctacgtacgatgacgtgtctaaggtggcattcctgaccgtgaccctacaccaaggc ggagccactcgcatgtacgcactcacatctgacgcgcagcctctgctgctcagcaccttc agcggagaccgccgcttctcccgatttggtggcgttctgcacttgagtgacctggatgat gatggcttagatgaaatcatcatggcagcccccctgaggatagcagatgtaacctctgga ctgattgggggagaagacggccgagtatatgtatataatggcaaagagaccacccttggt gacatgactggcaaatgcaaatcatggataactccatgtccagaagaaaaggtgggaaaa tacagcaggcccacaagaccatccagtttggtgtatgatgaccaagagccaaacgaggtc agacagcaactgcagcagaagtctctgctggtggagtcagggaagggcctgcagagatca tgggcccggagctgcatctggaaagatgactag