GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:40:06 Sequence gi568815594r:173288344_173491414 : 203071 bp : 38.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3765 3931 167 0 2 114 90 90 0.986 10.48 1.02 Intr + 7053 7183 131 1 2 86 91 193 0.999 18.89 1.03 Intr + 7421 7500 80 1 2 80 92 16 0.869 -1.37 1.04 Intr + 13704 13821 118 0 1 94 87 74 0.176 7.45 1.05 Intr + 15653 15775 123 1 0 92 113 77 0.999 10.46 1.06 Intr + 25615 25833 219 0 0 85 100 193 0.988 17.78 1.07 Intr + 29291 29389 99 1 0 88 97 29 0.889 3.19 1.08 Intr + 30088 30216 129 0 0 130 61 76 0.995 9.07 1.09 Term + 33237 33374 138 2 0 90 44 71 0.813 -0.12 1.10 PlyA + 33384 33389 6 1.05 2.07 PlyA - 33660 33655 6 1.05 2.06 Term - 43895 43737 159 2 0 56 36 371 0.975 25.66 2.05 Intr - 44652 44478 175 2 1 125 87 134 0.999 16.02 2.04 Intr - 44871 44726 146 0 2 91 103 184 0.999 18.36 2.03 Intr - 45326 45157 170 0 2 96 100 225 0.961 23.24 2.02 Intr - 45870 45701 170 2 2 88 34 118 0.745 5.07 2.01 Init - 54967 54882 86 1 2 50 34 111 0.324 0.24 2.00 Prom - 64034 63995 40 -5.15 3.03 PlyA - 64910 64905 6 1.05 3.02 Term - 71252 71056 197 0 2 46 38 177 0.764 5.09 3.01 Init - 71486 71333 154 2 1 60 73 88 0.862 4.70 3.00 Prom - 75326 75287 40 -4.05 4.00 Prom + 80692 80731 40 -9.35 4.01 Init + 81775 81855 81 0 0 72 58 57 0.541 2.12 4.02 Intr + 82223 82350 128 1 2 85 41 67 0.669 0.26 4.03 Intr + 82665 82968 304 0 1 67 40 380 0.680 26.77 4.04 Intr + 85047 85172 126 2 0 78 61 92 0.960 5.46 4.05 Term + 88862 88984 123 1 0 111 49 65 0.963 2.30 4.06 PlyA + 89170 89175 6 1.05 5.11 PlyA - 89211 89206 6 1.05 5.10 Term - 103085 102826 260 0 2 106 49 123 0.056 4.93 5.09 Intr - 110900 110783 118 2 1 119 81 6 0.117 2.12 5.08 Intr - 111983 111888 96 2 0 96 18 103 0.151 3.39 5.07 Intr - 117119 117047 73 1 1 59 60 74 0.156 0.19 5.06 Intr - 122050 121674 377 1 2 74 123 99 0.106 4.89 5.05 Intr - 125701 125629 73 0 1 76 77 28 0.048 -1.01 5.04 Intr - 131218 131002 217 2 1 -34 60 402 0.353 21.84 5.03 Intr - 131645 131324 322 2 1 14 68 265 0.030 11.61 5.02 Intr - 141397 141203 195 1 0 79 97 67 0.167 5.39 5.01 Init - 148435 148403 33 1 0 79 97 33 0.150 3.22 5.00 Prom - 156798 156759 40 -3.65 6.04 PlyA - 157725 157720 6 1.05 6.03 Term - 164001 163936 66 0 0 107 39 90 0.476 2.96 6.02 Intr - 177743 177561 183 2 0 69 116 65 0.136 6.46 6.01 Intr - 193771 193718 54 1 0 90 101 28 0.149 2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 153906 154115 210 2 0 72 32 148 0.880 3.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:173288344_173491414|GENSCAN_predicted_peptide_1|401_aa XCKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLK EKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLV APISKDRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCS RVGHIYRLEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDIS ELKKFREDHNCKSFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGF VELGPCHRMGGNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEWQYFKNLHR FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNIHSV >gi568815594r:173288344_173491414|GENSCAN_predicted_CDS_1|1206_bp nnatgcaagtattggcattatgatgaaaacttgctcacttcgagcgttgtcattgtcttc cataatgaaggatggtcaaccctcatgagaacagtccacagtgtaattaaaaggactcca aggaaatatttagcagaaattgtgttaattgacgatttcagtaataaagaacacttaaaa gaaaaactggatgaatatattaagctgtggaatggcctagtgaaggtatttcgaaatgaa agaagggaaggtttaattcaagcacgaagtattggtgctcagaaggctaaacttggacag gttttgatataccttgatgcccactgtgaggtggcagttaactggtatgcaccacttgta gctcccatatctaaggacaggtccccagccatggctgggggattatttgccattgaacga gagttcttctttgaattgggtctctatgatccaggtctccagatttggggtggtgaaaac tttgagatctcatacaagatatggcagtgtggtggcaaattattatttgttccttgttct cgtgttggacatatctaccgtcttgagggctggcaaggaaatcctccgcccatttatgtt gggtcttctccaactctgaagaattatgttagagttgtggaggtttggtgggatgaatat aaagactacttctatgctagtcgtcctgaatcgcaggcattaccatatggggatatatcg gagctgaaaaaatttcgagaagatcacaactgcaaaagttttaagtggttcatggaagaa atagcttatgatatcacctcacactaccctttgccacccaaaaatgttgactggggagaa atcagaggcttcgaaactgcttactgcattgatagcatgggaaaaacaaatggaggcttt gttgaactaggaccctgccacaggatgggagggaatcagcttttcagaatcaatgaagca aatcaactcatgcagtatgaccagtgtttgacaaagggagctgatggatcaaaagttatg attacacactgtaatctaaatgaatttaaggaatggcagtacttcaagaacctgcacaga tttactcatattccttcaggaaagtgtttagatcgctcagaggtcctgcatcaagtattc atctccaattgtgactccagtaaaacgactcaaaaatgggaaatgaataacatccatagt gtttag >gi568815594r:173288344_173491414|GENSCAN_predicted_peptide_2|301_aa MEGIFAFLLAPASAMLCSTTLTPGVARIGASTRSRRSGSRGLTRRAAFGVRAGEGWVCGG PAGSRRRRKLPLTGPGSGSFQCRSRGGRGSVNMGKGDPNKPRGKMSSYAFFVQTCREEHK KKHPDSSVNFAEFSKKCSERWKTMSAKEKSKFEDMAKSDKARYDREMKNYVPPKGDKKGK KKDPNAPKRPPSAFFLFCSEHRPKIKSEHPGLSIGDTAKKLGEMWSEQSAKDKQPYEQKA AKLKEKYEKDIAAYRAKGKSEAGKKGPGRPTGSKKKNEPEDEEEEEEEEDEDEEEEDEDE E >gi568815594r:173288344_173491414|GENSCAN_predicted_CDS_2|906_bp atggaaggcatttttgcttttcttctggctccggcctctgcaatgctgtgctctactacc ctgactccaggagtggcccgaataggagcctctactcggtctcggcgcagtggctctcgg ggtctgacccggcgagcggcatttggggtgcgggccggcgagggctgggtctgtggaggg ccggcgggcagtcggaggaggcggaaactgcccctgaccgggcccggttctgggagtttt caatgtcggtcacgaggtggacgcggatctgtcaacatgggtaaaggagaccccaacaag ccgcggggcaaaatgtcctcgtacgccttcttcgtgcagacctgccgggaagagcacaag aagaaacacccggactcttccgtcaatttcgcggaattctccaagaagtgttcggagaga tggaagaccatgtctgcaaaggagaagtcgaagtttgaagatatggcaaaaagtgacaaa gctcgctatgacagggagatgaaaaattacgttcctcccaaaggtgataagaaggggaag aaaaaggaccccaatgctcctaaaaggccaccatctgccttcttcctgttttgctctgaa catcgcccaaagatcaaaagtgaacaccctggcctatccattggggatactgcaaagaaa ttgggtgaaatgtggtctgagcagtcagccaaagataaacaaccatatgaacagaaagca gctaagctaaaggagaaatatgaaaaggatattgctgcatatcgtgccaagggcaaaagt gaagcaggaaagaagggccctggcaggccaacaggctcaaagaagaagaacgaaccagaa gatgaggaggaggaggaggaagaagaagatgaagatgaggaggaagaggatgaagatgaa gaataa >gi568815594r:173288344_173491414|GENSCAN_predicted_peptide_3|116_aa MKLQTLVVSVTVLKDGVSGVCSFRCSDVSRVSSFRWACGLADLRSEATDLCRMKPQTLAV SVTAHKGGVDPKSEQQQDLLSRAKEQSFHGRGPKQVAAAGSGGQLLFPYVALPTSR >gi568815594r:173288344_173491414|GENSCAN_predicted_CDS_3|351_bp atgaagctgcagaccctcgtggtgagtgttacagttcttaaagatggtgtgtccggagtt tgttccttcagatgttcagatgtgtccagagtttcttccttccggtgggcttgtggtctc gctgacttaaggagtgaagccacagacctttgcagaatgaagccacagacactcgcagtg agtgttacagctcataaaggtggtgtggacccaaagagtgagcagcagcaagatttattg tcaagagcgaaagaacaaagcttccatggaaggggacctaagcaggttgccgctgctggc tcaggtggccagcttttattcccttatgtggccctgcccacgtcccgctga >gi568815594r:173288344_173491414|GENSCAN_predicted_peptide_4|253_aa MEKVGRGGEKEPCCQLVRTPPQEAANLESRNFSENQFYDIPAAKHSPNSLETERARENTV NPNHTLLFSRTVHCFGASRDQQARDSWCLAVPLSNLVCRVNCRCRSGERRSGQERGDFCQ RRPRELGDMNGFTPDEMSRGGDAAAAVAAVVAAAAAAASAGNGTGAGTGAEARHLYICDY HKNLIQSVRNRRKRKGSDDDGGDSPVQDIDTPEIVGCHFRSIPVNEKDTLTYFIYSVKND KNKSDLKVDSGVH >gi568815594r:173288344_173491414|GENSCAN_predicted_CDS_4|762_bp atggaaaaggttgggcgtggaggtgaaaaggagccctgttgtcaactggtccgtactcca cctcaggaggcggccaacttggagtcgaggaacttctctgaaaaccaattctacgatatt ccagcagctaaacacagcccgaattccctagaaacagagagagcgagagaaaacactgta aaccccaatcacacgctcctgttttcccgcactgtccactgtttcggtgccagcagagac cagcaggcccgggacagttggtgtttggccgtgccgctgtctaacttggtgtgcagagtg aattgccgctgccggagcggagagaggcggagcggccaggagagaggggatttctgtcag cgccggcctcgggagctcggagacatgaacggcttcacgcctgacgagatgagccgcggc ggggatgcggccgccgcagtggccgcagtggtcgctgccgcggccgccgccgcctcggcg gggaacgggaccggcgcgggcaccggggctgaggcaaggcatctttacatatgtgattat cataaaaacttaattcagagtgttcgaaacagaagaaagagaaaagggagtgatgatgat ggaggtgattcacctgttcaagatattgataccccagagatagttggttgccactttagg tctattccagtgaatgaaaaagacaccttaacatatttcatctactcagtgaagaatgac aagaacaaatcagatctcaaggttgatagtggtgttcactag >gi568815594r:173288344_173491414|GENSCAN_predicted_peptide_5|587_aa MKVSYSLTEKQTYIENVFLGKVKRKFVQLCACLNRQYNPYGESSSSFCSPLTNKVQNFLR MGSVAHSYNHKLWEAKDDRDIICQVAYARIEGAMIVCTKYGVKAGLTNYAAAYCTGLLLA RRLLNRLGMDKIYEGQVEVTGNEYNVESIDGQPGAFTCYLVADLARTTTGNKVFGAPKGA VDGEVHRKHIMGQKFADDLRYLIEEDENASKKQFSQYTKNSVTPDLMEEMYKKAHAAVRE NPVYEKKPKKEIKKKRYTISLSKDLQTCSLNLGHILPIIKEWPCCYGTLHRQQGSTVHMD AQDNSLLGLHSHNQPLFPKHDSFAKVAGYNWASNHSYTQDLEQAVPTNLYALVLNLSLSQ TLSSWETSVRAILALQKVLLRLNLKWDGDPFPRSSSLSSWRIFSTRVYYGKGEKKSKCAV EPHNENDLSQMINSIIVQRPKPLKVNKISYLYAKHKHLSGHQSWLEQRVCRSQARDKADK GSFVKKLLPAPPLFSFCPNSPRPKAKMKLMVLVFTIGLTLLLGVQAMPANRLSCYRKILK DHNCHNLPEGVADLTQIDVNVQDHFWDGKGCEMICYCNFSELLCCPK >gi568815594r:173288344_173491414|GENSCAN_predicted_CDS_5|1764_bp atgaaggtgtcttactcccttactgaaaaacagacttacattgaaaatgtcttcttaggc aaagtgaaaagaaaatttgtgcaattatgtgcctgtctcaatcgacaatacaatccatat ggagagagcagctcctccttctgctcacccctcacgaacaaggttcaaaattttctcagg atgggttcagtggctcattcctataatcacaagctttgggaagctaaggatgatagagat atcatttgtcaggttgcttatgcccgtatagagggggctatgatagtctgcacaaaatat ggtgtgaaggccggcctgacaaattatgctgcagcgtattgtactggcctgctgctggcc cgcaggcttctcaataggttaggcatggacaagatctatgaaggccaagtggaggtgact ggcaatgaatacaatgtggaaagcattgatggtcagccaggtgcctttacctgctatttg gtcgcagaccttgccagaactaccactggcaataaagtgtttggtgccccgaagggagct gtggatggagaagtacaccggaagcacatcatgggccagaagtttgcagatgacctgcgc tacttaatagaagaagatgaaaatgcttccaaaaaacagttctctcaatacacaaagaac agcgtaactccagacttgatggaggagatgtataagaaagctcatgctgctgtacgagag aatccagtctatgaaaagaagcccaagaaagaaattaaaaagaagaggtacacaataagc ctttccaaggatctccaaacttgttccctcaatcttggacacattctgcccattatcaag gaatggccttgctgctatggaacactgcacaggcagcagggtagcactgtgcatatggac gcccaagacaacagcttattgggtttacactcacacaatcagccactgttccctaaacat gacagctttgccaaagtggctgggtataattgggcttctaaccacagctacacacaggat ctagaacaagctgtccccacaaacctctacgctctggtgttgaacctgtcactgagccag actctctcttcttgggagacctcagtcagagctatcttagcccttcagaaagttttactt aggttgaatctcaaatgggacggagatcccttccccaggtcttcatccctttcctcctgg aggatctttagcactcgagtgtactatggaaagggggagaaaaagagtaaatgtgcagtg gagccacataacgaaaacgacttgagccagatgattaacagtatcatcgtccaaaggccc aagcctttgaaagttaataaaatttcctatctctatgctaaacataaacacttaagtggt catcagtcctggctggagcagcgagtctgtcgatcccaggccagagacaaggcagacaaa ggttcatttgtaaagaagctccttccagcacctcctctcttctccttttgcccaaactca cccaggccaaaagccaaaatgaaactgatggtacttgttttcaccattgggctaactttg ctgctaggagttcaagccatgcctgcaaatcgcctctcttgctacagaaagatactaaaa gatcacaactgtcacaaccttccggaaggagtagctgacctgacacagattgatgtcaat gtccaggatcatttctgggatgggaagggatgtgagatgatctgttactgcaacttcagc gaattgctctgctgcccaaagtaa >gi568815594r:173288344_173491414|GENSCAN_predicted_peptide_6|100_aa EMNSSNFIMITQPPNQDNTGHFVHKWSLNQGEGDLVPNLCQVSFIAGNYPLCLKGSFGEK EMTLPSYLGRRTTCPMCTKPTQHENKEDEDLSDDPPPFNE >gi568815594r:173288344_173491414|GENSCAN_predicted_CDS_6|303_bp gaaatgaattccagcaattttataatgatcactcaaccaccaaatcaagataatactggc cattttgtacacaaatggtctttaaaccaaggggaaggggatttggttccaaatttgtgc caagtatcctttattgctggtaattatcctctctgtttgaaaggaagttttggagaaaag gaaatgacactcccttcatacctgggaagacgaaccacttgcccgatgtgcacaaagccc actcaacatgaaaataaagaggatgaagacctctctgatgatccacctccattcaatgaa tag