GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:13:58 Sequence gi568815580r:25932398_26190569 : 258172 bp : 37.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 120 202 83 2 2 76 107 27 0.228 3.89 1.02 Intr + 10098 10229 132 0 0 77 47 106 0.121 4.24 1.03 Term + 17274 17421 148 0 1 37 43 88 0.072 -4.41 1.04 PlyA + 17469 17474 6 1.05 2.04 PlyA - 18477 18472 6 1.05 2.03 Term - 21008 20819 190 1 1 51 42 227 0.921 10.34 2.02 Intr - 21972 21719 254 0 2 77 98 179 0.589 13.11 2.01 Init - 25079 24939 141 2 0 41 54 165 0.546 8.58 2.00 Prom - 33116 33077 40 -3.45 3.13 PlyA - 33128 33123 6 1.05 3.12 Term - 44442 44288 155 1 2 42 45 139 0.306 2.10 3.11 Intr - 49552 49522 31 1 1 89 75 35 0.052 -0.91 3.10 Intr - 90260 90174 87 2 0 123 63 33 0.450 3.55 3.09 Intr - 100135 100002 134 2 2 75 111 140 0.989 14.44 3.08 Intr - 102730 102608 123 2 0 88 55 194 0.989 15.74 3.07 Intr - 103526 103434 93 0 0 20 55 140 0.865 2.92 3.06 Intr - 106262 106158 105 0 0 76 44 100 0.912 3.67 3.05 Intr - 107059 106892 168 2 0 99 90 99 0.063 10.20 3.04 Intr - 120448 120227 222 2 0 78 88 115 0.153 7.68 3.03 Intr - 125345 125192 154 2 1 74 119 146 0.794 15.02 3.02 Intr - 126965 126826 140 0 2 87 75 32 0.785 1.06 3.01 Init - 130182 130122 61 0 1 78 68 29 0.735 1.36 3.00 Prom - 131284 131245 40 -4.55 4.04 PlyA - 131337 131332 6 1.05 4.03 Term - 158262 157849 414 0 0 22 34 344 0.590 16.68 4.02 Intr - 159038 158878 161 0 2 21 69 145 0.381 4.69 4.01 Init - 166721 166715 7 2 1 62 89 0 0.096 -1.42 4.00 Prom - 178993 178954 40 -6.15 5.00 Prom + 179521 179560 40 -4.35 5.01 Init + 189198 189480 283 2 1 83 74 246 0.370 19.95 5.02 Intr + 190867 191060 194 2 2 17 106 72 0.256 0.19 5.03 Term + 194015 194431 417 1 0 -7 42 219 0.264 2.09 5.04 PlyA + 196147 196152 6 1.05 6.00 Prom + 199593 199632 40 -4.45 6.01 Init + 201569 201670 102 1 0 74 77 96 0.324 7.39 6.02 Intr + 212162 212288 127 1 1 46 80 121 0.578 6.43 6.03 Intr + 219461 219585 125 0 2 57 115 101 0.937 9.08 6.04 Intr + 225725 225847 123 1 0 66 113 93 0.975 9.46 6.05 Term + 229137 229235 99 2 0 50 54 47 0.307 -5.35 6.06 PlyA + 230176 230181 6 1.05 7.02 PlyA - 230671 230666 6 1.05 7.01 Sngl - 238889 238326 564 2 0 73 34 422 0.907 31.19 7.00 Prom - 241479 241440 40 -2.35 8.00 Prom + 242155 242194 40 -9.85 8.01 Init + 242286 242288 3 2 0 56 95 0 0.450 -2.45 8.02 Intr + 246433 246552 120 0 0 70 100 147 0.977 13.87 8.03 Intr + 246671 246733 63 1 0 71 115 30 0.771 2.00 8.04 Term + 251084 251251 168 1 0 34 44 140 0.627 1.10 8.05 PlyA + 251320 251325 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_1|120_aa MKICWNILSTTEFIENEIKGFWEKIDVWFSQLQFPEEQSSFAYPANCVKSKGMNTKLPLR PLKVCRELTIDRKGSHMYLIQLHRALGSTEHHLKTTELDLHLHNILKVLRNQQEERQINQ >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_1|363_bp atgaagatttgctggaatatattgagtacaacagagtttatagaaaatgaaatcaagggc ttctgggagaaaattgatgtctggttctcacaactgcagtttccagaggaacagagcagt tttgcatatcctgccaactgtgtcaaatctaaaggaatgaatacaaaacttccccttcgc cctctgaaggtttgccgtgaactgacaatagacaggaagggcagccacatgtacctcata cagctccacagagccttgggctccacggaacaccatttgaaaaccacagaactggacttg catcttcacaatatattaaaagttcttagaaatcaacaagaagaaagacaaattaaccaa tag >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_2|194_aa MHKASFECEDEFHVWLGVEQRKTQEVQSEGSGELEEEEGRVRSGWGNGLGHTRVATQSAH SILSSTPPDFSPTNVCPLSLLRAGGVHTGGDAAALERLTWEGTMHRLWDPGQGILESRVP ETQSNIVAKDSGLQHRSLPEDTGHEGNEEEQAEQHAGIPAPATPHAALGGGKLKLTVPDA ANTRAGIVWHLQTH >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_2|585_bp atgcacaaagccagctttgagtgtgaagatgagtttcatgtttggcttggtgtggagcag aggaaaacccaggaggtgcagagtgagggaagtggagagctagaagaagaggaaggcaga gttagaagtggctgggggaatggccttgggcacacacgtgtggcaacccagtctgcacat tcaattctatcctcaactcccccggacttcagccccacaaacgtctgccccttgtcactt cttagggctgggggtgtgcacactggtggggatgcagctgcccttgagaggttgacttgg gaaggcaccatgcacaggctctgggaccctggacagggcattctggagtcccgggtgcct gaaacacagtctaatattgtggccaaggactcaggactgcaacacagaagccttccggaa gatactggacatgaagggaatgaagaggaacaagcagagcagcatgctggaattcctgca cctgccactccccatgctgcccttgggggtggaaagctcaagctcactgtccctgatgcc gccaacaccagagcaggaattgtctggcatctgcaaactcattaa >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_3|490_aa MGPFTFNVEHDIVGFVSYKAVQLDCTFYLSCISLLLSKLKEKAKIIVELVLYASVLSGVV APSVPLCPPTQNMPMGPGGMNQSGPPPPPRSHNMPSDGMVGGGPPAPHMQNQMNGQMPGP NHMPMQGPGPNQLNMTNSSMNMPSSSHGSMGGYNHSVPSSQSMPVQNQMTMSQGQPMGNY GPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQRQI PPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSYPEQGY DRPYEDSSQHYYEGGNSQYGQQQDAYQGPPPQQGYPPQQQQYPGQQGYPGQQQGYGPSQG GPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQTLSLISLSQNIWQTQPQFPF VHCRLETLKEHKAMNFMQIVRPVLEKARAASHDQPRTEYTKGRSWLQYNHSNSIHCGKTH KQLENVKGES >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_3|1473_bp atgggtccatttacatttaatgtagaacatgatatagtaggatttgtgtcttacaaggca gtccaattggactgcactttctacttgagctgtatttccctgttactcagcaaactcaag gaaaaagccaagataattgtggagctggtcttatatgcttccgttctttcaggagttgta gccccttctgttcctctttgtccacccacacagaatatgcctatgggtcctggagggatg aatcagagcggccctcccccacctccacgctctcacaacatgccttcagatggaatggta ggtgggggtcctcctgcaccgcacatgcagaaccagatgaacggccagatgcctgggcct aaccatatgcctatgcagggacctggacccaatcaactcaatatgacaaacagttccatg aatatgccttcaagtagccatggatccatgggaggttacaaccattctgtgccatcatca cagagcatgccagtacagaatcagatgacaatgagtcagggacaaccaatgggaaactat ggtcccagaccaaatatgagtatgcagccaaaccaaggtccaatgatgcatcagcagcct ccttctcagcaatacaatatgccacagggaggcggacagcattaccaaggacagcagcca cctatgggaatgatgggtcaagttaaccaaggcaatcatatgatgggtcagagacagatt cctccctatagacctcctcaacagggcccaccacagcagtactcaggccaggaagactat tacggggaccaatacagtcatggtggacaaggtcctccagaaggcatgaaccagcaatat taccctgatggtcataatgattacggttatcagcaaccgtcgtatcctgaacaaggctac gataggccttatgaggattcctcacaacattactacgaaggaggaaattcacagtatggc caacagcaagatgcataccagggaccacctccacaacagggatatccaccccagcagcag cagtacccagggcagcaaggttacccaggacagcagcagggctacggtccttcacagggt ggtccaggtcctcagtatcctaactacccacagggacaaggtcagcagtatggaggatat agaccaacacagcctggaccaccacagccaccccagcagaggccttatggatatgaccag actctaagccttatctccttaagtcagaacatctggcagactcagcctcagttccccttc gtgcactgcagactggaaactctcaaggaacataaggcaatgaacttcatgcagatagta aggccagtcttggagaaagccagggcagccagccatgaccaaccacgaactgaatataca aaaggaaggagctggcttcagtacaaccacagtaattccatccactgtggcaagacacat aaacagttggaaaatgttaaaggagaaagttga >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_4|193_aa MSWFVPRGELFSVRSSQAGRLPLSTGHNLGVPVSGQGYFPIVRHDFVRACETVDSEVVQP ERPASLPQFAVHPERSGLADSGDGGNMSVAFAAPRQRGKGEITPAAIQKVKPRGVADARP LPPLGETDAGGRAGPGVDRAEAEAGLLGISSVWEHGLLAVPEWVAEEGEPVFLGVFHAFT RFSTDTKAQTAAA >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_4|582_bp atgtcttggtttgtcccaagaggcgagctcttctctgtccgctccagccaagctggccga ttgcccctgagcactggacacaaccttggtgtcccggtttcgggtcagggttactttcca attgttaggcacgactttgtgcgtgcgtgcgagactgtggactcggaggtggttcagccc gagaggccggcgtctctcccccagtttgccgttcacccggagcgctcgggacttgccgat agtggtgacggcggcaacatgtctgtggctttcgcggccccgaggcagcgaggcaagggg gagatcactcccgctgcgattcagaaggtgaaaccgcgcggggttgcggatgccaggccc ttaccgcctctgggagagacagacgcggggggaagggccgggcccggagtcgaccgggcc gaggcggaggcgggcctgctgggaatcagcagtgtttgggaacacggactgctggctgtg cctgagtgggtggcggaagagggtgaaccggtttttctcggagtctttcacgcatttacg cgtttttctacagacacaaaagcccaaacagccgccgcctga >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_5|297_aa MAEMTETGFRMWIKMNFSELKEHVATEGKEAKNHDKTMQELTAKIARIERNITDLIELKN TLQELHNAITSINSRIDQVEKRISELEDCLSEMTAYGTYSKIDHIIGNKTLLSKCKRTEI ITNSLLDHTRIKLEPKIKKFTQNLTTTWKLNKLLLNDFWKLNKNRKTLHKAKNEDHNYVL KEWIHWCGSKHMPLNGILIRKQAKIYHDELKTEEDCEYSTSWLQKFKKQHGIKILKLCGD KVSDDHKAVEKIIHNFAKVIADKNLIPELLLMKILSQNKSIMLIKHHCFGITAPERT >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_5|894_bp atggctgaaatgacagaaacaggcttcagaatgtggataaaaatgaacttcagtgagcta aaggagcatgttgcaaccgaaggcaaggaagccaagaatcatgataaaacaatgcaagag ctgacagccaaaatagccagaatagagaggaacataactgacctgatagagctgaaaaac acactacaagaacttcacaatgcaattacaagcattaacagcagaatagaccaagtggag aaaagaatctcagagcttgaagactgtctttctgaaatgacagcatatggcacttactct aaaattgatcacataatcggaaataaaacactcctcagcaaatgtaaaagaaccgaaatc ataacaaacagtctcttggaccacaccagaatcaaattagaacccaagattaagaaattc actcaaaaccttacaactacatggaaattaaacaagctgctcctgaatgacttttggaag ttaaacaaaaatagaaaaacgctacataaagctaaaaatgaggatcacaattatgtattg aaagagtggatccactggtgtggcagtaaacacatgcctcttaatggtatactgatcagg aaacaagcaaagatctatcatgatgaactgaaaactgaggaggactgtgaatactcaaca agctggttgcagaaatttaagaaacaacatggcattaaaattttaaagctctgtggtgat aaagtatctgatgaccacaaagcagtggaaaaaatcattcacaactttgccaaagtcatt gctgataaaaatcttatcccagagttactgctgatgaaaatcttatcccagaacaagtct ataatgctgataaaacatcattgttttggcattactgcaccagaaagaacctga >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_6|191_aa MASRYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGTNIVVLGVEKKSVAKLQDERT VRKICALDDHVCMAFAGLTADARVVINRARVECQSHKLTVEDPVTVEYITRFIATLKQKY TQSNGRRPFGISALIVGFDDDGISRLYQTDPSGTYHAWKFLAQRLCFLEYYDGEESYSGQ LKWCMAVVLAT >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_6|576_bp atggcgtctcgatatgacagggcgatcactgtcttctccccagacggacacctttttcaa gttgaatatgcccaggaagcggtgaagaaaggatccaccgcggtcggaattcgaggtacc aatatagttgttcttggggtagaaaaaaaatctgttgccaagcttcaagatgaaagaact gtgaggaaaatttgtgcccttgatgaccatgtctgcatggcttttgcaggacttactgct gatgctagagtagtaataaacagagcccgtgtggagtgccagagccataagcttacggtt gaggacccagtcactgtagaatacataactcgcttcatagcaactttaaagcagaaatat acccaaagcaatggacgaagaccttttggtatttctgccttaattgtaggttttgatgat gatggtatctcaagattgtatcagacagatccttctggtacttatcatgcttggaagttc cttgctcagaggctatgttttctagaatactatgatggtgaagaaagctattcaggtcag ctaaagtggtgcatggcagtagtcctagctacttga >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_7|187_aa MVGSLNCIVAVSQNMGIGKNGDLPWPPLRNEFRYFQRMTTTSSVEGKQNLVIMGKKTWFS IPEKNRPLKGRINLVLSRELKEPPQGAHFLSRSLDDALKLTEQPELANKVDMVWIVGGSS VYKEAMNHPGHLKLFVTRIMQDFESDTFFPEIDLEKYKLLPEYPGVLSDVQEEKGIKYKF EVYEKND >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_7|564_bp atggttggttcgctaaactgcatcgtcgctgtgtcccagaacatgggcatcggcaagaac ggggacctgccctggccaccgctcaggaatgaattcagatatttccagagaatgaccaca acctcttcagtagaaggtaaacagaatctggtgattatgggtaagaagacctggttctcc attcctgagaagaatcgacctttaaagggtagaattaatttagttctcagcagagaactc aaggaacctccacaaggagctcattttctttccagaagtctagatgatgccttaaaactt actgaacaaccagaattagcaaataaagtagacatggtctggatagttggtggcagttct gtttataaggaagccatgaatcacccaggccatcttaaactatttgtgacaaggatcatg caagactttgaaagtgacacgttttttccagaaattgatttggagaaatataaacttctg ccagaatacccaggtgttctctctgatgtccaggaggagaaaggcattaagtacaaattt gaagtatatgagaagaatgattaa >gi568815580r:25932398_26190569|GENSCAN_predicted_peptide_8|117_aa MANAIGRSAKTVREFLEKNYTEDAIASDSEAIKLAIKALLEVVQSGGKNIELAIIRRNQP LKEFKTLVEEVTADVVQIARELKLEVEPEYATELLQSCDKTDDELLLKDEKRMWFLG >gi568815580r:25932398_26190569|GENSCAN_predicted_CDS_8|354_bp atggcaaatgcaataggccgaagtgctaaaactgttcgagaatttctagaaaagaattac acagaagatgccatagcaagtgacagtgaagctatcaagttagcaataaaagctttgcta gaagttgtccagtctggtggaaaaaacattgaacttgctataataagaagaaatcaacct ttgaaggagttcaagactttagtggaggaagtcactgcagatgtggtacaaatagcaaga gaactaaaattagaagtggagcctgaatatgcaactgaattgctgcaatcttgtgataaa acagatgatgagttgcttcttaaagatgagaaaagaatgtggtttcttggatga