GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:49:34 Sequence gi568815591r:31238258_31439268 : 201011 bp : 38.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11762 11875 114 1 0 104 84 51 0.643 6.02 1.02 Intr + 13350 13476 127 2 1 -12 90 111 0.201 0.63 1.03 Intr + 24226 24363 138 2 0 45 91 82 0.270 3.71 1.04 Intr + 42562 43010 449 2 2 67 86 234 0.019 13.24 1.05 Intr + 53044 53119 76 0 1 18 84 76 0.012 -1.63 1.06 Intr + 58477 58573 97 2 1 38 78 90 0.033 1.15 1.07 Intr + 67552 67726 175 1 1 55 70 100 0.041 3.92 1.08 Intr + 79777 79900 124 0 1 15 101 41 0.013 -2.66 1.09 Intr + 85407 85704 298 1 1 122 31 160 0.172 8.71 1.10 Intr + 95365 95447 83 1 2 44 98 44 0.023 -0.64 1.11 Term + 98368 98684 317 2 2 79 54 165 0.247 6.32 1.12 PlyA + 99457 99462 6 1.05 2.02 PlyA - 99691 99686 6 1.05 2.01 Sngl - 101011 99998 1014 1 0 55 39 648 0.953 53.36 2.00 Prom - 101243 101204 40 -6.05 3.00 Prom + 101710 101749 40 -7.25 3.01 Init + 117870 117929 60 2 0 62 100 37 0.509 3.60 3.02 Intr + 119464 119597 134 0 2 69 51 99 0.378 2.82 3.03 Intr + 119688 119781 94 0 1 26 61 127 0.426 2.85 3.04 Term + 130526 130900 375 1 0 51 43 341 0.541 19.65 3.05 PlyA + 130970 130975 6 1.05 4.06 PlyA - 131006 131001 6 1.05 4.05 Term - 133089 132900 190 2 1 75 49 80 0.537 -1.26 4.04 Intr - 133770 133549 222 2 0 49 93 101 0.705 3.02 4.03 Intr - 134770 134333 438 0 0 55 70 372 0.547 23.80 4.02 Intr - 134968 134836 133 2 1 13 47 72 0.689 -5.62 4.01 Init - 135914 135617 298 2 1 68 82 198 0.806 14.66 4.00 Prom - 153860 153821 40 -5.25 5.06 PlyA - 154082 154077 6 1.05 5.05 Term - 167962 167812 151 0 1 92 49 178 0.933 10.70 5.04 Intr - 175658 175559 100 0 1 -20 75 111 0.020 -2.75 5.03 Intr - 182161 182031 131 0 2 22 88 300 0.984 22.82 5.02 Intr - 184840 184714 127 2 1 31 95 65 0.132 0.32 5.01 Init - 192657 192450 208 0 1 57 53 144 0.432 6.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 42333 43010 678 2 0 42 86 299 0.821 20.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:31238258_31439268|GENSCAN_predicted_peptide_1|665_aa THSALFMETTMLVAHSLSLAQVFSEELQSLDVDRFEVGKETHRLQELRRLLKLPPGGGLR EVEADVPREELSVDFVQVPQGISHMEKDVREGQSNWRYKWLGYGAVKHHNGSNTNSVLQL LQLALAVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISN FSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIHLTRDVKYLLKEN YKSPLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKTLELLQPSVNKTSGGKGGACRFPL TECGRKEQEPEHNGCKGKINAGQTKGTIVYEIQRKSETVQRHTIPLGRVNTKLAPPDQSL VLCAALSTESCNANAILTPWNQIQGLVYGGVLARLNTQWAYDHIARIWNISAKTPFLDEG ILLQERDSDTKQNGPALILSPSMPLSPQLQLRSLFNFRHRQLPFNRLVHNSVRTFHKRLS LDKFKTEGLIAVPPQPARSRSHRTFHKGLSLNKFKTEGLITVPPRPARSRSHIEPEIGDE SQREQFQSGVQMPKKMIEVKGQVSVGAAARVRRAQSPGPQGAHCGRDTGSAACARAATEA TQRRQQPLPAPPARPRRACAGSGGAHKVPEGQRWPGLPKTCPVPPPLHPLSAALIGLRSC PRSSP >gi568815591r:31238258_31439268|GENSCAN_predicted_CDS_1|1998_bp acccatagtgctttatttatggagactactatgctggtagcccacagcctaagtcttgct caggtgttttctgaagagttgcagtctctggatgtagataggtttgaggtggggaaagag acccatcggttgcaggagctgaggcgattgcttaaactgccacctggaggaggattgaga gaggtagaggctgatgttcccagagaagaactttctgtggattttgtccaagtgccccag ggaattagtcatatggagaaagatgtacgtgaaggacagtccaattggagatacaagtgg ttgggatatggagctgtgaaacatcataatggctcaaatacaaatagtgtattacaatta ttacaactagcacttgcagtgttggaagttctggccagggcaatcaggcaggagaaggaa ataaagggtattcaattaggaaaagaagaagtcaaattgtccctgtttgcagacgacatg attgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaccttacaagggacgtgaagtacctcctcaaggagaac tacaaatcaccgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagactttagaacta ttacagccctctgtaaataagacttcaggtggaaaaggtggagcatgccggtttcctcta actgaatgtggaagaaaagaacaagaaccagaacataatgggtgcaaaggtaagataaat gcaggacagacaaaaggaaccattgtttatgaaatccagaggaaatcagaaacagtccaa aggcacactattcccttaggaagagtcaacacaaaattagcaccaccagaccaaagtcta gtgctttgtgctgccttaagcacagaatcctgcaatgcaaatgccattttaactccttgg aatcaaattcagggcctggtgtatggaggtgttctggctaggttgaacactcaatgggct tatgatcacatagccagaatctggaatattagtgcaaaaacaccattcttggacgaagga atattactccaagagagggatagtgacactaagcagaatggccctgcactgattttgagc ccatccatgcccctaagtccacaactgcagctcagatctctcttcaacttcagacataga cagctgccattcaaccgcctagtacataactcagtcaggactttccacaagagacttagt ctcgacaagttcaaaactgagggtctcatcgctgtcccaccccagcctgcccgatcgagg tcacacaggactttccacaagggacttagtctcaacaagttcaaaactgagggtctcatc accgtcccaccccgacctgcccgatctaggtcacacatagaaccagagataggtgatgag agccagagagagcagtttcaaagtggggtacaaatgcccaaaaagatgattgaggttaaa ggccaggtcagtgtgggcgccgcggcccgggttcgcagggcgcagtcccccggcccccag ggcgcccactgcgggcgggacacgggcagtgcagcctgtgcccgggcggccaccgaggcg acgcaacgccgccagcaacctctccccgcgccgcccgccaggccccgtcgggcctgcgcc ggctctggcggtgcccacaaagtccccgaggggcagcgttggccaggcctacccaagacc tgccccgtcccgccaccactgcacccactttctgctgccctcatcggtctgcgcagctgt ccccgctcctccccctga >gi568815591r:31238258_31439268|GENSCAN_predicted_peptide_2|337_aa MLTLPFDESVVMPESQMCRKFSRECEDQKQIKKPESFSKQIVLRGKSIKRAPGEETEKEE EEEDREEEDENGLPRRRGLRKKKTTKLRLERVKFRRQEANARERNRMHGLNDALDNLRKV VPCYSKTQKLSKIETLRLAKNYIWALSEILRIGKRPDLLTFVQNLCKGLSQPTTNLVAGC LQLNARSFLMGQGGEAAHHTRSPYSTFYPPYHSPELTTPPGHGTLDNSKSMKPYNYCSAY ESFYESTSPECASPQFEGPLSPPPINYNGIFSLKQEETLDYGKNYNYGMHYCAVPPRGPL GQGAMFRLPTDSHFPYDLHLRSQSLTMQDELNAVFHN >gi568815591r:31238258_31439268|GENSCAN_predicted_CDS_2|1014_bp atgttaacactaccgtttgatgagtctgttgtaatgccagaatcccagatgtgcagaaag ttttctagagaatgcgaggaccagaagcaaattaagaagccagaaagcttttccaaacag attgtccttcgaggaaagagcatcaaaagggcccctggagaagaaaccgagaaagaagaa gaggaggaagacagggaagaggaagatgaaaatgggttgcctagaaggaggggtcttagg aaaaaaaagacaacaaagctgcgattggaaagggtcaagttcaggagacaggaagcgaac gcgcgcgagaggaacaggatgcacggcctcaacgacgctctggacaacttaagaaaagtg gtcccctgttattctaaaacccagaaactgtccaaaatagagactttacgactggccaaa aactacatctgggcactttctgaaattctgagaatcggcaagagaccagatctgctcaca ttcgtccaaaacttatgcaaaggtctttcccagccaactacaaacttggtggcaggctgc ttgcagctcaacgccaggagtttcctgatgggtcagggtggggaggctgcacaccacaca aggtcaccctactctaccttctacccaccctaccacagccctgagctcaccactccccca gggcatgggactcttgataattccaagtccatgaaaccctacaattattgcagtgcgtat gaatccttctatgaaagtacttcccctgagtgtgccagccctcagtttgaaggtccctta agtcctcccccaattaactataatgggatattttccctgaagcaagaagaaaccttggac tatggtaaaaattacaattacggcatgcattactgtgcagtgccacccaggggtcccctt gggcagggtgccatgttcaggttgcccaccgacagccacttcccttacgacttacatctg cgcagccaatctctcacaatgcaagatgaattaaatgcagtttttcataattaa >gi568815591r:31238258_31439268|GENSCAN_predicted_peptide_3|220_aa MSNDCIAKVASLINTRVSLKTMQIPTLVQPWKPSCLECSNQVTEEYDTLPKRYQVEDPIG RGQQQLTRDETKLRAVVKLLQVDHDGACRATMEMKQGAGERMKRELECTSTGIGQPLQCQ WDQTPLTQTYCVPPTVGGSAQVSGCRSWSKCFWVLAGANYMQALPQRQGRVSATLEAPEG MLQCPFSSVIRGRLKYEQSVSPLPFHMRQLPSAIEGKGPV >gi568815591r:31238258_31439268|GENSCAN_predicted_CDS_3|663_bp atgagcaatgattgcattgcaaaagtggcctctctcattaatacccgagtcagtctgaag acaatgcagatccccaccttggttcagccctggaaaccctcttgcctggagtgcagtaac caagttacagaggaatatgacacattgccaaagaggtatcaagttgaagatcctataggg cgtggtcagcaacagcttaccagagatgaaacaaagttgagggcagtcgtgaagttattg caggtggaccatgatggtgcctgcagggccacgatggagatgaagcaaggagcaggtgag cgaatgaagcgggaactggagtgcacaagcactggaattggccagccacttcagtgccag tgggatcaaactccactcactcagacctactgcgttccacccaccgtgggagggagtgca caggtgagtggatgcaggagctggagcaagtgcttttgggtgctggcaggagcaaattac atgcaggccctgccgcagcgtcaaggcagggtgtctgcgactctggaagccccagagggc atgttacagtgcccttttagctctgtcatccgtggacggcttaagtatgaacagtcagtg agccctctgccctttcacatgaggcagctgccctctgccattgagggcaaagggccagtg tag >gi568815591r:31238258_31439268|GENSCAN_predicted_peptide_4|426_aa MVAPVLASQPFDSHLFSVTACLVSDSPTLSVYHVSPLSVLGTSPLCGKMHHPQRQPLLHM GQELADKWPSLPEALYPAEEVSLGSSLCRAQQPLRDIPEVREIPVYLMEHQQGAGNRSTS KQRSDNRALAEDQTALCWGILPPGNLCSPQAFIQEMFDGCLCLRHAHTFQPPRCQPYAPN EQPVFIITGSLYRACPVANCVTLSSKKLGYVESLFWAGWIKSYRRLCSVEETDGTDGDAV AEHKPKRRANGCWTSRTGCRSEKEAEFGFTHLFSQLVNPPSSPLPASYLRTLQTDFTMHC YHPGHIYCLIFNWGDLSKPLRINGCKLVELKGLWRKPFANNRETMALQAGPPEGKYTFGG LVACVVQSFQLSVPTMPIPTKIKMGDNLFTKAEKCYRIFYRHSADHQTEELNAQAEIFGR KKYIFL >gi568815591r:31238258_31439268|GENSCAN_predicted_CDS_4|1281_bp atggtggccccggtgcttgcatcacagcccttcgactcacatctgttttcagtgacagct tgtctggtttcagattcacccaccttaagtgtgtaccatgtgtctccactttctgttctg gggacttctccactctgtggtaaaatgcatcatcctcagaggcagcccttactccatatg ggacaggagctggcagacaaatggcccagcctcccggaggcactctaccctgctgaggag gtctccctgggctcgagcctctgccgtgctcagcagcccctccgtgacatacccgaagtg agggagatccctgtctacctgatggagcatcagcaaggggcagggaacaggtctacatca aaacagcgttcagacaatagggcattagctgaggaccaaactgccttgtgttggggcatc ctcccaccaggaaacctttgcagcccacaagcattcattcaagagatgtttgatgggtgt ctttgtcttcgacatgctcacacctttcagcccccacgatgtcaaccatatgctcctaat gagcaacccgtgtttattataacagggtctctttatcgtgcatgccctgtagccaactgt gtcacactctcttctaagaagcttgggtatgtggagtcattgttctgggcaggctggatc aagagctaccggaggttgtgcagtgtcgaggaaacagatggaacagatggggacgctgtg gcagagcacaaacccaagagaagagccaacgggtgctggaccagccgcacagggtgtagg agtgagaaggaggcagaatttggctttactcatcttttttcccagctggtgaatccacct tcttctcctttacctgcttcctacctaaggacacttcaaacagacttcacaatgcactgt taccatccgggccatatttattgtttaatatttaactggggagatttgtcaaaaccctta agaattaatggttgcaaacttgtggagttaaaaggcttgtggaggaaaccatttgctaac aacagggagacaatggccctgcaggctgggccaccagaaggcaaatacacatttgggggt ctagtggcctgtgtggtccaatcatttcagttaagtgtcccaactatgccgattcctacg aaaataaaaatgggtgataatctgttcactaaagccgagaaatgttatagaatattttac agacatagtgccgaccaccagactgaggaattgaatgcccaggctgaaattttcggcagg aaaaaatatattttcctgtga >gi568815591r:31238258_31439268|GENSCAN_predicted_peptide_5|238_aa MPVVHIQNSLNIVQDSCCFSVYKYKKVIDPHELVGQDPEQWSRGEPENFSPLLQENKKNM KAGRMVDREEGDAEDEQVPGLQPKFLAARHSSGHLLTHIQLLSPAQVHIFPRWNDDDDDD DDDDDDDDEMTIIMMKEKDQRKENRNESVPSNGKGGTYAGNLQAMPWDCVVERSHGTAVN TAGQNNMDSLPTFISGLEIPENSKVAKGVEEEVMVAQCILGNVPPHKVFSQRHKQEEL >gi568815591r:31238258_31439268|GENSCAN_predicted_CDS_5|717_bp atgcctgtagtacacattcagaattcactaaatattgtgcaggacagctgctgcttttca gtctacaagtataaaaaagtgattgacccacatgagcttgtgggccaggaccctgaacag tggagccgtggtgagcctgaaaacttctctccacttctccaggagaataagaagaatatg aaagcagggaggatggtggacagggaggagggtgatgcagaggacgaacaggtgccaggt ctccaacccaagttcctggctgccaggcacagctcgggccatctactcacacacatacag ctcctctccccagctcaggtccacatattcccaagatggaatgatgatgatgacgatgat gatgatgatgatgatgatgatgatgagatgacaatcattatgatgaaagagaaggaccaa aggaaagaaaacaggaatgagagtgttcctagtaatggaaaaggagggacctatgctggc aatcttcaggcaatgccctgggactgtgttgtggagagaagccatggaacagctgtgaac actgccgggcagaataacatggacagtcttcccacattcatatcgggactggaaattcca gaaaacagtaaggtagcaaaaggagttgaggaagaagtgatggtggcccagtgcattcta ggaaatgtccctccacacaaagtattcagccagaggcacaaacaagaagaactttag