GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:53:03 Sequence gi568815575f:96784311_96985093 : 200783 bp : 35.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1040 1135 96 1 0 84 65 99 0.657 7.56 1.02 Intr + 2446 2476 31 0 1 85 116 -3 0.713 -1.01 1.03 Intr + 4040 4060 21 0 0 134 72 6 0.135 0.20 1.04 Term + 11644 11711 68 2 2 63 47 109 0.106 1.42 1.05 PlyA + 13181 13186 6 1.05 2.02 PlyA - 14279 14274 6 1.05 2.01 Sngl - 28826 28266 561 2 0 49 41 213 0.530 8.59 2.00 Prom - 28919 28880 40 -5.65 3.05 PlyA - 29090 29085 6 1.05 3.04 Term - 29994 29318 677 1 2 -33 43 282 0.390 4.49 3.03 Intr - 30599 30466 134 1 2 57 -20 136 0.377 -0.93 3.02 Intr - 40171 40042 130 2 1 76 100 51 0.025 3.93 3.01 Init - 56288 56198 91 2 1 80 86 41 0.106 3.80 3.00 Prom - 70229 70190 40 -2.55 4.00 Prom + 70250 70289 40 -6.05 4.01 Init + 74053 74286 234 0 0 62 77 109 0.516 5.39 4.02 Intr + 97269 97403 135 2 0 94 53 114 0.012 8.34 4.03 Term + 100001 100786 786 1 0 -20 54 853 0.007 63.44 4.04 PlyA + 101387 101392 6 1.05 5.03 PlyA - 101491 101486 6 1.05 5.02 Term - 115449 114374 1076 1 2 67 47 435 0.682 28.58 5.01 Init - 116094 115785 310 0 1 31 37 163 0.677 3.02 5.00 Prom - 119642 119603 40 -4.85 6.00 Prom + 130739 130778 40 -6.05 6.01 Init + 132206 132264 59 1 2 94 103 39 0.488 6.83 6.02 Intr + 134199 134307 109 0 1 104 109 118 0.999 14.87 6.03 Intr + 146423 146533 111 1 0 83 100 37 0.893 4.06 6.04 Intr + 152923 153080 158 0 2 49 31 191 0.718 7.39 6.05 Intr + 157708 157826 119 1 2 107 77 -5 0.758 -0.51 6.06 Intr + 161217 161281 65 1 2 55 121 81 0.887 5.62 6.07 Intr + 164625 164729 105 2 0 42 95 180 0.678 13.59 6.08 Term + 166520 166531 12 1 0 107 42 1 0.199 -5.37 6.09 PlyA + 166691 166696 6 1.05 7.04 PlyA - 167549 167544 6 1.05 7.03 Term - 169654 169320 335 2 2 91 41 282 0.975 17.59 7.02 Intr - 171983 171855 129 0 0 34 86 78 0.304 1.95 7.01 Init - 172512 172377 136 0 1 53 38 121 0.549 3.95 7.00 Prom - 173348 173309 40 -8.15 8.04 PlyA - 173471 173466 6 -0.45 8.03 Term - 173770 173514 257 2 2 37 44 253 0.871 10.56 8.02 Intr - 185691 185594 98 2 2 41 54 73 0.045 -2.07 8.01 Init - 194621 194488 134 2 2 66 42 168 0.385 9.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 33564 33488 77 0 2 52 80 74 0.865 3.61 S.002 Init - 175331 175328 4 2 1 74 106 0 0.869 0.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_1|71_aa MKVMEFREGSLAASSGGCHGQCHVLNMVKGGRNPSSIRKNSKGHCGIFEVFTSGDTTNGE IVDAEPVDMEG >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_1|216_bp atgaaggtgatggaatttcgtgagggctctcttgctgcatcctctggagggtgccacgga caatgccatgtcctcaacatggtgaaaggtggaaggaatccttctagtattaggaagaat tcaaaaggacactgtgggatatttgaagttttcacatctggagacacaaccaacggagaa atagtggatgcagaacctgtggatatggagggctga >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_2|186_aa MGYFNTPLSTLDRSMRQQVNKDIQHLNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEITTNCLSDHSAIKLGHRIKKLTQNHTTTWKLNNLLLNDYWV HNKMKAEKKMFFETNKNKDTVYQNLYDTFNAVCRGKFIALNAHKRKQERSKIDTLISQLK ELEKRE >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_2|561_bp atgggatattttaacacccccctgtcaacattagacagatcaatgagacagcaggttaac aaggatatccagcacttaaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatacacattcttctcagcaccacatcacacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatc acaacaaactgtctctccgaccacagtgcaatcaaattaggacacaggattaagaaactc actcaaaaccacacaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagagaaaaagatgttctttgaaaccaataagaacaaagacaca gtgtaccagaatctctatgacacatttaatgcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaatatcacaattaaaa gaactagagaagcgagagtaa >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_3|343_aa MNKQHRKRLLQERKKCKRAKLIAEHGPWNEELSVMSLDSGTPTCNTKSRREDVTNYQTCF CFTLHICTKAFHFQTKLPDEGSGSNIGCSAVSAGDTQANRVCSGPPANTNRPASEGPDWE ERVSVIEDQINEIKREEKFREKRVKRNEQSLQEIWEYVKRPNLRLFGVPESDRENGTKLE NTLKDIIQENFPKLASQANIQIQETQRTPQRYSLRRATPRHIIVRFTKVEMKEKRLRAAR EKGWVTHKGKPIRLTVDLSAEILQDRRERGPIFKSLKEKNFQPRISYPAELSFISEGQIK SFTDKQMLRDFVTTRPALQELLKEALSMERNNQYQPLQKHAKL >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_3|1032_bp atgaacaaacaacacaggaaaaggctacttcaagagagaaagaaatgtaagagggccaag cttattgcagaacatggaccctggaatgaagagcttagtgtcatgagcttggacagtggg actcctacttgtaatacgaagagcagacgggaagatgtaaccaactaccagacttgtttt tgttttactcttcacatctgtaccaaagcatttcatttccagacaaagcttccagacgaa ggatcaggcagcaatattggctgttctgcagtctccgctggtgatacccaggcaaacagg gtgtgcagtggacctccagcaaacaccaacagacctgcatctgagggacctgactgggaa gaaagggtatcagtgattgaagatcaaatcaatgaaataaagcgagaggagaagtttaga gaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatatgggaatatgtgaaaaga ccaaatctacgtttgtttggggtacctgaaagtgacagggagaatggaaccaagttggaa aacactctgaaggatattatccaggagaacttccccaaactagcaagtcaggccaacatt caaattcaggaaacacagagaacaccacaaagatactccttgagaagagcaaccccaaga cacataattgtcagattcaccaaggttgaaatgaaggaaaaaaggctaagggcagccaga gagaaaggttgggttacccacaaagggaagcccatcagactgacagtggatctttcagca gaaatcctacaagacagaagagagagggggccaatattcaaaagtcttaaagaaaagaat tttcaacccagaatttcatatccagccgaactaagcttcataagtgaaggacaaataaaa tcctttacagacaagcaaatgctgagggattttgtcaccaccaggcctgccttacaagag ctcctgaaggaagcactaagcatggaaaggaacaaccagtaccagccactgcaaaaacat gccaaattgtaa >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_4|384_aa MAVDEMKAQTDVVKWHFIWPIDNIQGVCSGHENAAACSFRQTFGRIVVPGDQSGLEQKFL PSFGIHHAGCYIQFLEGKGGLKNSKHECTLSSQEYVHELRSGISDEKLLNCLESLRVSLT SNPMSKSGFGSYGSISAADGASGGSDQLCERDATPAIKTQRPKVRIQDVVPCNVNQLLSS TVFDPVFKVRGIIVSQVSIVGVIRGAEKASNHICYKIDDMTAKPIEARQWFGREKVKQVT PLSVGVYVKVFGILKCPTGTKSLEVLKIHVLEDMNEFTVHILETVNAHMMLDKARRDTTV ESVPVSPSEVNDAGDNDESHRNFIQDEVLRLIHECPHQEGKSIHELRAQLCDLSVKAIKE AIDYLTVEGHIYPTVDREHFKSAD >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_4|1155_bp atggctgtagatgaaatgaaggctcaaacagatgtggtgaaatggcattttatctggcct attgacaatatccaaggagtctgctctggtcatgagaatgcagctgcatgttctttcagg cagacctttggcagaattgtggttccaggagatcagtctggtttggagcaaaaattcttg ccaagctttgggattcaccatgcaggatgctatattcaattcctagagggaaagggtggg ctgaaaaacagcaaacatgaatgcaccctgtcttcacaagaatatgttcatgaattacga tcgggtatatcagatgagaaacttcttaattgcctagaatccctcagggtttctttaacc agcaatccgatgagtaagagtgggtttgggagctatggcagcatttctgctgctgatgga gcgagtggaggcagtgaccaactgtgtgagagagatgcaactcctgctattaagacccaa agacctaaggtccgaattcaggacgttgtaccgtgtaatgtgaaccagcttctcagctct actgtgtttgaccctgtgttcaaggttaggggaattatagtttcccaggtctccatcgtg ggggtaatcagaggggcagagaaggcttcaaatcacatttgttacaaaattgatgatatg accgcgaaaccaatcgaggcccgacagtggtttggtagagagaaagtcaagcaggtgact ccattgtcagtcggagtatatgtcaaagtgtttggtatcctcaaatgtcccacgggaaca aagagccttgaggtattgaaaattcatgtcctagaggacatgaacgagttcaccgtgcat attctggaaacggtcaatgcacacatgatgctggataaagcccgtcgtgataccactgta gaaagtgtgcctgtgtctccatcagaagtgaatgatgctggggataacgatgagagtcac cgcaatttcatccaggacgaagtgctgcgtttgattcatgagtgtcctcatcaggaaggg aagagcatccatgagctccgggctcagctctgcgaccttagcgtcaaggccatcaaggaa gcgattgattatctgaccgttgagggccacatctatcccactgtggatcgggagcatttt aagtctgctgattga >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_5|461_aa MIAYQENPKDSSKRLLDLINEFSKVSGYKINVHKSVAPSRQSNEELDPLYNSCKTKTNKQ TKKHSGIYLTKEAKYLYKENYETLLKEITDNTNTWNHIPCSWKVLEVLARAIRQEKEIKG IQLGKEEIKLSLFADDMIVHLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTKNR RTESQIMSELPFTIASKRRKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWV GRISIVKMAILPKVIYRFNAIPIKLPMTFFTGLEKTTLKFIWNQKRAHIAKSILSQKNKA GGITLPDFKLYYKPTVTKRAWYWYQNRDIDQRNRTEPSEIMPHIYNYLIFDKPEKNKQWG KDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQD IGMGKDFVSKTPKAMATKDKIDKWDLIKLKSFCTAKELPSE >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_5|1386_bp atgattgcataccaagaaaaccctaaagactcatccaaaaggctcctagatctgataaat gaattcagtaaagtctcaggttacaaaatcaatgtacacaaatcagtggcaccaagcaga cagtcaaatgaagaactcgaccctctttacaacagctgcaaaacgaaaacaaataaacaa acaaaaaagcactcaggaatatacttaaccaaggaggcaaaatatctctataaggaaaac tacgaaacactgctgaaagaaatcacagataacacaaacacatggaaccacatcccatgc tcatggaaggtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggt attcaattaggaaaagaggaaatcaaattgtccctgtttgcagacgacatgattgtacat ctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaa gtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaaaaacaga cgaacagagagccaaatcatgagtgaactcccattcacaattgcctcaaagagaagaaaa tacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaacca ctgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggta ggaagaatcagtatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgcc atccccatcaagctaccaatgactttcttcacaggattggaaaaaactactttaaagttc atatggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagct ggaggcatcacgctacctgacttcaaactatactacaagcctacagtaaccaaaagagca tggtactggtaccaaaacagagatatagatcaacggaacagaacagagccctcggaaata atgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatgggga aaggattccctatttaataaatggtgctgggaaaactggttagccatatgtagaaagctg aaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaagactta aacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggac ataggcatgggcaaggacttcgtgtctaaaacaccaaaagcaatggcaacaaaagacaaa attgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaactaccatca gagtga >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_6|245_aa MMTEIVKILSAICIVGEENILDKLLGAITTAAERNNRERFSPIVEGLENQEALQLQVACM QFINALVTSPYELDFRIHLRNEFLRSGLKTMLPDLKEKENDELDIQLKVFDENKEDDLTE LSHRLNDIRAEMEYPLTNHKTILFALPQYYKIIEECVSQIVLHCSGMDPDFKYRQRLDID LTHLIDSCVNKAKVEESEQKAAEFSKKFDEEFTARQEAQAELQKRDEKIKELEAEIQQLR TQPRL >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_6|738_bp atgatgactgaaatagtaaaaatactttctgctatttgcattgttggagaagagaacatt ctagataaacttttaggggctataacaacagcagcagaaagaaataacagggaacgattt tcaccaattgtggaaggtttagaaaatcaggaagccttgcaattacaggtggcctgcatg cagtttataaatgcccttgtcacttctccttatgagcttgattttcgaatacatttaagg aatgaattcctccgttcaggactaaaaacaatgttaccagatctaaaagaaaaagagaat gatgagcttgatattcagttgaaagtatttgatgaaaacaaagaagatgacctaactgaa ttatcacaccgtctcaatgacattcgagcagaaatggaatatcctttgacaaaccacaaa acaattttgtttgcattgccacaatattataaaataattgaggaatgtgtttcacagata gtgctacactgcagtggtatggatccagacttcaaatacaggcaaagattagacatcgat ttaactcatctgatagattcttgtgtgaacaaggcgaaagttgaagaaagtgaacaaaaa gctgcagagttttcaaagaagttcgatgaagaattcacagctcgacaggaagctcaagca gagcttcaaaaaagagatgagaaaatcaaagaacttgaagcagaaatccagcaacttcga acccagcctaggctctga >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_7|199_aa MWESLELPRDLLNGFNQNADNDINNKVQDEVVSDGDVKLVGNWNKVSLVPCILATSAMDK RSHGTFQDIALEGASPKLWQLPSGVEPAGIPVAIGETLVNVFIGETELVGIPEGYIPEQW AYYKHPILRWIVHTFYDDPENNYERTMAILHTETEKAELQIKELEVRRLMWKRRDGPWYQ YVTIDRAIMDHSPKATPSK >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_7|600_bp atgtgggaaagtttggaacttcctagagacttgttgaatggctttaaccaaaatgctgat aatgatataaacaataaagtccaggatgaggtggtctcagatggagatgtgaaacttgtt gggaactggaataaagtctctttggtgccctgcatcctagccacttcagccatggataaa aggagtcatggtacatttcaagacatagccttagagggtgcaagccccaagctttggcaa cttccaagcggtgttgagcctgcagggattccagtggcaattggagaaactctagtgaat gtattcattggtgaaactgaactagtaggaattccagaaggctatatcccagaacaatgg gcatattataagcatcctatattgagatggattgtccatactttctatgatgatcctgaa aataattacgaaagaacaatggctatccttcacactgaaactgaaaaggctgaactgcag ataaaggagctggaagtacgaagattaatgtggaagagaagagatggaccctggtatcaa tatgtgaccattgatagggcaattatggatcattctccaaaagcaactcccagtaaataa >gi568815575f:96784311_96985093|GENSCAN_predicted_peptide_8|162_aa MKALSPITLEELNPANHMNELRNKSFTSQAFTQDHNPDNIFYAALIKNKNHMIISIDMEK AINKIQHPFTIKIHKKLGRLSDIPGGGGTPPRGGGGGGPPNKSGGGGGGGIPIIPGKGGG GGGRGAPPGRGGAGGGGGGPTPGNGGAAGGPGIPDELESTCA >gi568815575f:96784311_96985093|GENSCAN_predicted_CDS_8|489_bp atgaaggccctcagtccaatcactctggaggaactgaatcctgccaatcatatgaatgag cttaggaacaaatcattcaccagtcaagccttcacacaggaccacaaccctgacaacatc ttctatgcagccttaattaaaaacaaaaaccatatgatcatctccatagacatggaaaaa gccatcaataagatccagcatcccttcacgataaaaatccacaagaaactaggtagatta agtgatattcctgggggaggaggaactcctccaaggggtggtggtggaggaggtccccca aataaaagtggtggtgggggtggtggtggtatccccatcattccaggtaaaggaggtggt ggaggaggaagaggagctcctccgggtagaggtggcgcgggtggtggtggaggcggccct acacctggcaatggaggtgctgcaggaggacctggaattcctgatgaacttgagagtact tgtgcctga