GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:27:22 Sequence gi568815586r:7832817_8033904 : 201088 bp : 45.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9760 9799 40 -2.96 1.01 Init + 29636 29797 162 1 0 97 90 165 0.827 15.54 1.02 Intr + 32070 32151 82 2 1 77 76 10 0.495 -2.09 1.03 Intr + 38194 38508 315 2 0 -22 17 449 0.254 22.84 1.04 Intr + 39937 40181 245 2 2 61 68 118 0.454 4.32 1.05 Term + 55196 55306 111 1 0 84 33 47 0.004 -2.64 1.06 PlyA + 55494 55499 6 1.05 2.00 Prom + 56136 56175 40 -3.36 2.01 Init + 57933 58034 102 2 0 70 109 85 0.990 9.04 2.02 Intr + 62699 62923 225 1 0 70 96 110 0.909 8.18 2.03 Intr + 65449 65535 87 0 0 121 92 80 0.989 11.87 2.04 Term + 65719 66087 369 0 0 75 49 211 0.935 10.55 2.05 PlyA + 66715 66720 6 1.05 3.10 PlyA - 66782 66777 6 1.05 3.09 Term - 88815 88597 219 0 0 98 41 272 0.999 20.54 3.08 Intr - 90208 90005 204 1 0 137 87 135 0.999 17.70 3.07 Intr - 93132 93028 105 0 0 120 84 128 0.999 16.01 3.06 Intr - 97055 96868 188 0 2 78 68 165 0.893 12.91 3.05 Intr - 97826 97664 163 2 1 63 70 14 0.863 -3.35 3.04 Intr - 98669 98429 241 1 1 84 78 208 0.997 16.95 3.03 Intr - 100331 100171 161 2 2 106 72 162 0.998 15.19 3.02 Intr - 101086 100994 93 1 0 135 113 131 0.999 20.46 3.01 Init - 103218 103204 15 0 0 89 116 1 0.971 3.44 3.00 Prom - 103883 103844 40 -3.36 4.00 Prom + 104987 105026 40 -1.46 4.01 Init + 117595 117644 50 0 2 100 45 90 0.732 4.32 4.02 Intr + 124174 124256 83 1 2 91 90 17 0.753 1.48 4.03 Term + 148636 148721 86 2 2 80 41 89 0.190 1.22 4.04 PlyA + 149354 149359 6 1.05 5.00 Prom + 168297 168336 40 -4.56 5.01 Init + 168720 168812 93 2 0 62 84 82 0.496 5.58 5.02 Intr + 177280 177395 116 0 2 43 60 161 0.568 8.05 5.03 Intr + 182192 182423 232 2 1 62 31 74 0.186 -3.12 5.04 Intr + 182474 182862 389 1 2 -34 36 455 0.352 21.69 5.05 Intr + 185963 186062 100 2 1 55 69 97 0.546 4.61 5.06 Intr + 186554 186615 62 1 2 71 86 20 0.292 -2.37 5.07 Term + 193544 193679 136 2 1 32 37 139 0.273 0.69 5.08 PlyA + 194911 194916 6 1.05 6.02 PlyA - 195005 195000 6 1.05 6.01 Term - 200051 199522 530 0 2 45 47 181 0.611 4.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 48397 48771 375 0 0 58 52 210 0.911 8.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:7832817_8033904|GENSCAN_predicted_peptide_1|304_aa MPEPSHPLCGLLCGRASPTNAAPCSTAPSPIDHPRAKECEHMARDWQAAPPAAPISSWPA LYVTLVSAQIYRKEAFPDHTTGGQHDKLASPRKNPMDEYLQGMMSEAPGPINFTTSLTMF GEKLNGTDPQDVIQNAFTCFNEESSGFIHEDHLQELLTTMGDSFTDEEVDEMYWEAPIDK KRQVQLAQVLIPAPPGRRDGAAHPPPRPRDCGWAPRLRSTTHLPGRCAPAGPACSRWDPL TASKSLGFDFEWSVNVVTKRDSVHREPSHHEVGPSLIQEEGRKSPESGLLPPMLSLWFFL QKST >gi568815586r:7832817_8033904|GENSCAN_predicted_CDS_1|915_bp atgcctgagccttcccaccccctctgtggcctcctgtgcggacgagcctccccgacgaat gccgccccctgctccacggcgcccagtcccatcgaccacccaagagctaaggaatgcgag cacatggcgcgggactggcaggcagctccacctgcagcaccgatctcctcatggccggct ctttatgtcactctggtttctgctcaaatttaccgtaaagaggccttccctgaccacact acagggggacagcacgacaagctggcttcaccgcggaagaaccccatggatgaatacctg cagggcatgatgagcgaggccccagggcccatcaacttcaccacttccctcaccatgttt ggggagaagctgaatggaacagacccccaggatgtgatccaaaacgccttcacctgcttc aacgaggaatcctcaggtttcattcatgaggaccacctccaggagctgctcaccaccatg ggtgacagcttcacagatgaagaagtggatgagatgtactgggaggcacccattgataaa aaaagacaagttcaactagctcaggtcctcattcccgcaccccccggccgcagggacggc gcggcgcacccacctcccagaccccgcgactgcggttgggccccgcggcttcgctcaacc acgcacctcccgggccgctgcgcccccgccggccccgcctgcagccgttgggacccacta actgcctcgaaaagcctaggattcgactttgaatggtccgttaatgtggttacaaaacgt gactcggttcatcgggagccctcccaccatgaagttggcccctcattgatccaggaagaa ggcaggaaatctccagaatcagggcttttgccaccaatgttgtctctctggttcttcctc cagaaatctacctag >gi568815586r:7832817_8033904|GENSCAN_predicted_peptide_2|260_aa MGEFSSGFYAKSPFSKEQSFWYLPFGQLRASSASDSHDSSTSPKGKQPTTAEKSATKKED KVPVKKQKTRTVFSSTQLCVLNDRFQRQKYLSLQQMQELSNILNLSYKQVKTWFQNQRMK SKRWQKNNWLKNSNGVTQGCLVNPTGNLPMWSNQTWNNSTWSNQTQNIQSWSNHSWNTQT WCTQSWNNQAWNSPFYNCGEESLQSCMQFQPNSPASDLEAALEAAGEGLNVIQQTTRYFN TPQTMDLFLNYSMNMQPEDV >gi568815586r:7832817_8033904|GENSCAN_predicted_CDS_2|783_bp atgggggaattcagctcaggcttttatgcaaagtcccccttcagcaaagaacaaagcttc tggtacctgccctttggacagctgcgggcaagctcagcctcggacagccatgattcttcc accagtcccaaaggcaaacaacccactactgcagagaagagtgccacaaaaaaggaagac aaggtcccggtcaagaaacagaagaccagaactgtgttctcttccacccagctgtgtgta ctcaatgatagatttcagagacagaaatacctcagcctccagcagatgcaagaactttcc aacatcctgaacctcagctacaaacaggtgaagacctggttccagaaccagagaatgaaa tctaagaggtggcagaaaaacaactggctgaagaatagcaatggtgtgacgcagggatgc ctggtgaacccgactgggaaccttccaatgtggagcaaccagacctggaacaattcaacc tggagcaaccagacccagaacatccagtcctggagcaaccactcctggaacactcagacc tggtgcacccaatcctggaacaatcaggcctggaacagtcccttctataactgtggagag gaatctctgcagtcctgcatgcagttccagccaaattctcctgccagtgacttggaggct gccttggaagctgctggggaaggccttaatgtaatacagcagaccactaggtattttaat actccacaaaccatggatttattcctaaactactccatgaacatgcaacctgaagacgtg tga >gi568815586r:7832817_8033904|GENSCAN_predicted_peptide_3|462_aa MGTQKVTPALIFAITVATIGSFQFGYNTGVINAPEKIIKEFINKTLTDKGNAPPSEVLLT SLWSLSVAIFSVGGMIGSFSVGLFVNRFGRRNSMLIVNLLAVTGGCFMGLCKVAKSVEML ILGRLVIGLFCGLCTGFVPMYIGEISPTALRGAFGTLNQLGIVVGILVAQIFGLEFILGS EELWPLLLGFTILPAILQSAALPFCPESPRFLLINRKEEENAKQILQRLWGTQDVSQDIQ EMKDESARMSQEKQVTVLELFRVSSYRQPIIISIVLQLSQQLSGINAVFYYSTGIFKDAG VQEPIYATIGAGVVNTIFTVVSDNYNGMSFVCIGAILVFVAFFEIGPGPIPWFIVAELFS QGPRPAAMAVAGCSNWTSNFLVGLLFPSAAHYLGAYVFIIFTGFLITFLAFTFFKVPETR GRTFEDITRAFEGQAHGADRSGKDGVMEMNSIEPAKETTTNV >gi568815586r:7832817_8033904|GENSCAN_predicted_CDS_3|1389_bp atggggacacagaaggtcaccccagctctgatatttgccatcacagttgctacaatcggc tctttccaatttggctacaacactggggtcatcaatgctcctgagaagatcataaaggaa tttatcaataaaactttgacggacaagggaaatgccccaccctctgaggtgctgctcacg tctctctggtccttgtctgtggccatattttccgtcgggggtatgatcggctccttttcc gtcggactcttcgtcaaccgctttggcaggcgcaattcaatgctgattgtcaacctgttg gctgtcactggtggctgctttatgggactgtgtaaagtagctaagtcggttgaaatgctg atcctgggtcgcttggttattggcctcttctgcggactctgcacaggttttgtgcccatg tacattggagagatctcgcctactgccctgcggggtgcctttggcactctcaaccagctg ggcatcgttgttggaattctggtggcccagatctttggtctggaattcatccttgggtct gaagagctatggccgctgctactgggttttaccatccttcctgctatcctacaaagtgca gcccttccattttgccctgaaagtcccagatttttgctcattaacagaaaagaagaggag aatgctaagcagatcctccagcggttgtggggcacccaggatgtatcccaagacatccag gagatgaaagatgagagtgcaaggatgtcacaagaaaagcaagtcaccgtgctagagctc tttagagtgtccagctaccgacagcccatcatcatttccattgtgctccagctctctcag cagctctctgggatcaatgctgtgttctattactcaacaggaatcttcaaggatgcaggt gttcaagagcccatctatgccaccatcggcgcgggtgtggttaatactatcttcactgta gtttctgataactataatgggatgagctttgtctgtattggggctatcttggtctttgta gccttctttgaaattggaccaggccccattccctggtttattgtggccgaactcttcagc cagggcccccgcccagctgcgatggcagtggccggctgctccaactggacctccaacttc ctagtcggattgctcttcccctccgctgctcactatttaggagcctacgtttttattatc ttcaccggcttcctcattaccttcttggcttttaccttcttcaaagtccctgagacccgt ggcaggacttttgaggatatcacacgggcctttgaagggcaggcacacggtgcagataga tctggaaaggacggcgtcatggagatgaacagcatcgagcctgctaaggagaccaccacc aatgtctaa >gi568815586r:7832817_8033904|GENSCAN_predicted_peptide_4|72_aa MTRCILELLAQARSGLQCWEYTDKKDVQAGTVAHTYNPSSLGGREQRQGWKLENLGISKD YGTDCATDFAKT >gi568815586r:7832817_8033904|GENSCAN_predicted_CDS_4|219_bp atgactcgctgcatcctagagctcctggctcaagcgaggtcggggctgcaatgctgggaa tatacagataaaaaagatgttcaggccggcacagtggctcacacgtataatcctagcagt ttgggaggccgagaacaaagacaaggttggaagttagaaaacctgggtatcagcaaagac tatggtacagactgtgctacagacttcgcaaaaacttaa >gi568815586r:7832817_8033904|GENSCAN_predicted_peptide_5|375_aa MIGKLVTKKFEEDMQMDLSEWSKTVKIFVYHKLLPPPPAFGDHDPDQSAAINVEARPSTS KKIVTRSRLSRPAEMLLPLPTVFPQMRLLSRVLAPHLTRAYAKDVKFGADARALMLQGVD LLADAVAVTMGPKGRTVIIEQSWGSPQIQIRAKLVQDIANNTNEEAGNGTTSATVLARSI AKEGFKKISKGANPVEIRKGVMLAVDAVIAELRKQSKFVTTPEEIAQVATTSANGDKEIG NIISNAMKKVGRKGVITVKDGKTLNDELEIIESGWASRWDPLTASKSLGFDFEWSVNVVA KRDSVHRALPGKAPADSTKVFWRDCSPPTQGFQKASVHADFGGSKQETIDPVQEMTLFEG NEGVADFPRFPSVPF >gi568815586r:7832817_8033904|GENSCAN_predicted_CDS_5|1128_bp atgattggaaaattggtgacaaagaaatttgaggaagacatgcagatggacctctctgag tggtcaaaaactgtgaagatatttgtataccataaattgctaccaccacccccagccttc ggcgaccacgaccctgatcagtcagcagccatcaacgttgaggcaagaccctccaccagc aaaaagattgtgacacgctcaaggctcagccgccccgcagaaatgcttcttccgttaccc acagtctttccccagatgagactgctgtccagggtactggcccctcatctcactcgggct tatgccaaagatgtaaaatttggcgcagatgcccgagccttaatgcttcaaggtgtagac cttttagccgatgctgtagctgttacgatggggccaaagggaagaacagtgattattgag cagagctggggaagtccccaaatacaaattagagctaaacttgttcaagatattgctaat aacacaaatgaagaggctggaaatggcaccacctctgctaccgtactggcacgctctatt gccaaggagggcttcaagaagattagcaaaggtgctaatccagtggaaatcaggaaaggt gtgatgttggctgttgatgctgtaattgctgaacttagaaagcagtctaaatttgtgacc acccctgaagaaattgcacaggttgctacaacttctgcaaacggagacaaagaaattggc aatatcatctccaatgcaatgaaaaaggttggaagaaagggtgtcatcacagtaaaggat ggaaaaacactgaatgatgaattagaaattattgaaagtggctgggcgagccgctgggac ccactaactgcctcgaaaagcctaggattcgactttgaatggtccgttaatgtggtcgca aaacgtgactcggttcatcgggcgctccctggtaaggcccctgcagacagcacgaaggtg ttttggagagattgttctcccccgacccaaggattccagaaagccagtgtacacgcagac ttcggaggcagtaaacaggaaaccatcgatcccgtgcaggaaatgaccttgtttgaaggc aacgaaggtgtcgcggatttccctcgttttccttcagttcctttctaa >gi568815586r:7832817_8033904|GENSCAN_predicted_peptide_6|176_aa XPSCRVRSLRRVGLPALPPLGLRPVWAPGEPEGGQTVSGLGAGSGSARLRCRPRLLPGLR AALRAPPPPPPAGRCRHRLPARVKEEPGDAEPRAAPSREPRYPVCAPPGEPRPLSRRRGI SPPSPTCGRVAPGVRRLPGMTTHCSPSGESLVRTVYRQFQLTPQLRRFILLNSVYR >gi568815586r:7832817_8033904|GENSCAN_predicted_CDS_6|531_bp nggccgtcctgtcgggtccgctcccttcggcgcgtggggctccccgcgctgccaccgctc gggctccgaccagtctgggctcccggggagccagaaggcgggcagacagtgtcagggctc ggcgcgggctcgggctcggctcggctccgctgcaggccccggctcctccccggcctccgc gccgcgctccgcgcccctccccctcccccgcccgccggccgctgccgccaccgcctcccg gcgcgggttaaggaggagccgggagacgccgagccgcgggcagcgccgagccgcgagcct cgctacccggtgtgcgctccgcctggagagccacggccgctctcacgccggcgagggatc tcgccgccctcgcccacctgcggccgcgtggctccgggagttcgccgtctacccgggatg acaactcactgctccccgtcgggagagtccctagtgcgcaccgtataccgtcaattccag ctgacaccccaactccggcgtttcattctgttaaactccgtatacagatga