GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:11:10 Sequence gi568815586f:112346274_112604761 : 258488 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 6807 6935 129 2 0 53 48 143 0.875 4.98 1.02 PlyA + 8602 8607 6 1.05 2.03 PlyA - 8658 8653 6 1.05 2.02 Term - 15456 15301 156 0 0 27 49 88 0.019 -3.27 2.01 Init - 35855 35562 294 2 0 100 89 623 0.965 58.79 2.00 Prom - 47300 47261 40 -5.56 3.09 PlyA - 51912 51907 6 1.05 3.08 Term - 59103 58951 153 0 0 88 36 174 0.999 10.12 3.07 Intr - 59764 59580 185 2 2 104 107 296 0.999 32.61 3.06 Intr - 60069 60021 49 0 1 85 99 23 0.999 1.35 3.05 Intr - 60617 60474 144 2 0 56 100 132 0.998 11.68 3.04 Intr - 62065 61967 99 1 0 97 95 161 0.567 18.01 3.03 Intr - 62383 62147 237 1 0 26 116 294 0.993 23.81 3.02 Intr - 63149 63046 104 0 2 56 68 84 0.991 3.09 3.01 Init - 63407 63314 94 2 1 82 116 31 0.884 6.16 3.00 Prom - 67465 67426 40 -5.46 4.03 PlyA - 67761 67756 6 1.05 4.02 Term - 72792 72449 344 1 2 65 49 182 0.658 6.67 4.01 Init - 76532 76520 13 2 1 84 98 8 0.441 1.66 4.00 Prom - 86765 86726 40 -3.36 5.00 Prom + 87316 87355 40 -5.86 5.01 Init + 89166 89200 35 2 2 66 76 6 0.172 -3.46 5.02 Intr + 100003 100125 123 1 0 83 97 50 0.893 5.10 5.03 Intr + 104045 104239 195 2 0 43 86 213 0.963 15.13 5.04 Intr + 106922 107114 193 2 1 49 99 172 0.999 13.89 5.05 Intr + 108291 108407 117 2 0 49 99 129 0.998 10.76 5.06 Intr + 109677 109790 114 2 0 7 107 62 0.591 0.64 5.07 Intr + 126671 126767 97 1 1 142 95 41 0.997 9.78 5.08 Intr + 131378 131457 80 0 2 123 103 74 0.999 11.67 5.09 Intr + 131584 131742 159 0 0 36 94 152 0.993 10.78 5.10 Intr + 135801 135932 132 2 0 101 108 160 0.981 20.14 5.11 Intr + 140202 140356 155 2 2 59 61 203 0.999 13.57 5.12 Intr + 142170 142237 68 0 2 118 98 95 0.966 12.05 5.13 Intr + 142751 142902 152 0 2 93 81 178 0.961 17.48 5.14 Intr + 155029 155061 33 0 0 130 98 -15 0.461 2.12 5.15 Intr + 155871 155983 113 2 2 67 89 9 0.384 -1.92 5.16 Term + 158422 158491 70 1 1 127 54 55 0.480 3.41 5.17 PlyA + 160967 160972 6 1.05 6.10 PlyA - 162664 162659 6 1.05 6.09 Term - 173747 173666 82 1 1 86 55 135 0.737 7.17 6.08 Intr - 187485 187323 163 1 1 90 77 46 0.085 2.73 6.07 Intr - 194140 194055 86 0 2 53 96 13 0.133 -1.94 6.06 Intr - 195213 195131 83 0 2 97 89 62 0.645 5.64 6.05 Intr - 196477 196405 73 0 1 77 84 27 0.400 0.61 6.04 Intr - 202768 202711 58 2 1 97 101 39 0.626 4.04 6.03 Intr - 217755 217560 196 0 1 -22 101 180 0.067 7.29 6.02 Intr - 237207 237019 189 0 0 90 53 47 0.056 1.18 6.01 Intr - 246573 246411 163 2 1 84 23 117 0.115 4.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 160727 160660 68 0 2 10 43 225 0.864 8.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:112346274_112604761|GENSCAN_predicted_peptide_1|42_aa TKTEHQLYGNQQKNFIRTKELTDSHLTGSHHYQVKTMSDDVS >gi568815586f:112346274_112604761|GENSCAN_predicted_CDS_1|129_bp accaaaacagaacatcaactctatggaaaccagcagaagaatttcataagaaccaaggag ctgactgacagccacctgactggcagccaccattaccaggttaaaactatgtctgatgat gtttcttga >gi568815586f:112346274_112604761|GENSCAN_predicted_peptide_2|149_aa MGSSAAAAAAAAAAADSAQWLSVKEETIFLHDGLIRVTDLAELPSEILGAPEAADTDLEV SEAAPGPRHRTDPLARHPRVGPPPRLSRGLASRRLPGQGPSGDPYEHRVSLVVKTVGFGI DTSRQLLVQFYHLSEFQFLGYKMPPFKGK >gi568815586f:112346274_112604761|GENSCAN_predicted_CDS_2|450_bp atgggctcgtcggcggccgcggcggcggcggcggcggcggccgctgactcggcgcagtgg ctctcggtgaaggaagagaccatcttcctgcacgacgggctgatccgggtcaccgacctg gccgagctgcccagcgagatcctcggggccccagaggccgcggacaccgacctggaggtg agcgaggcggcccccggcccccgccatcggactgacccactcgcccggcacccccgggtc gggccccctcctcgcctcagccgcggcctcgcctcccggcggctgccggggcagggtcct tcaggggatccttatgaacacagagtgagcctagtggttaagactgtaggctttggcatt gacacatctcgccagctgttagttcagttctaccacctatcagaatttcaatttcttggt tataaaatgccccccttcaagggtaaataa >gi568815586f:112346274_112604761|GENSCAN_predicted_peptide_3|354_aa MPQIRKEVRLALPMIAYRPEAGTLILFPILQGHKEGEVECRSGSLRFFATLLGWGNEALS GTRSRLMAGEKVEKPDTKEKKPEAKKVDAGGKVKKGNLKAKKPKKGKPHCSRNPVLVRGI GRYSRSAMYSRKAMYKRKYSAAKSKVEKKKKEKVLATVTKPVGGDKNGGTRVVKLRKMPR YYPTEDVPRKLLSHGKKPFSQHVRKLRASITPGTILIILTGRHRGKRVVFLKQLASGLLL VTGPLVLNRVPLRRTHQKFVIATSTKIDISNVKIPKHLTDAYFKKKKLRKPRHQEGEIFD TEKEKYEITEQRKIDQKAVDSQILPKIKAIPQLQGYLRSVFALTNGIYPHKLVF >gi568815586f:112346274_112604761|GENSCAN_predicted_CDS_3|1065_bp atgccccagatccggaaggaagtgagactcgcacttcccatgattgcttatagaccggaa gccgggaccttaattctctttcccatcttgcaagggcacaaagagggtgaggtagaatgc cgtagtggctccctccggttcttcgccactctcttgggctgggggaacgaggccctttcg gggacccgtagcagactgatggcgggtgaaaaagttgagaagccagatactaaagagaag aaacccgaagccaagaaggttgatgctggtggcaaggtgaaaaagggtaacctcaaagct aaaaagcccaagaaggggaagccccattgcagccgcaaccctgtccttgtcagaggaatt ggcaggtattcccgatctgccatgtattccagaaaggccatgtacaagaggaagtactca gccgctaaatccaaggttgaaaagaaaaagaaggagaaggttctcgcaactgttacaaaa ccagttggtggtgacaagaacggcggtacccgggtggttaaacttcgcaaaatgcctaga tattatcctactgaagatgtgcctcgaaagctgttgagccacggcaaaaaacccttcagt cagcacgtgagaaaactgcgagccagcattacccccgggaccattctgatcatcctcact ggacgccacaggggcaagagggtggttttcctgaagcagctggctagtggcttattactt gtgactggacctctggtcctcaatcgagttcctctacgaagaacacaccagaaatttgtc attgccacttcaaccaaaatcgatatcagcaatgtaaaaatcccaaaacatcttactgat gcttacttcaagaagaagaagctgcggaagcccagacaccaggaaggtgagatcttcgac acagaaaaagagaaatatgagattacggagcagcgcaagattgatcagaaagctgtggac tcacaaattttaccaaaaatcaaagctattcctcagctccagggctacctgcgatctgtg tttgctctgacgaatggaatttatcctcacaaattggtgttctaa >gi568815586f:112346274_112604761|GENSCAN_predicted_peptide_4|118_aa MDTRGSAGLGHIGLARGRPARDRTRGRASRPRTDPPPGLGIPETVQLPPGPARSSSAPAL RRRRPRDVTARDARRHFLLPVSGGPSDGREWQRGGSAEAPRTVSKASLLPPGNRGRQV >gi568815586f:112346274_112604761|GENSCAN_predicted_CDS_4|357_bp atggacacgaggggctccgctgggctcggtcacatcgggctggcccgagggaggcccgct cgggaccggacgcggggcagagccagccggccgcgcacagacccccctccaggcctgggg atcccggagactgtgcagctgccccccggcccggctcgctcctcctccgcccccgccctt cgccgccgtcgcccccgtgatgtcaccgctcgcgacgcccgccgccacttcctgcttccc gtcagcggaggaccgagcgacggccgggaatggcagcgcgggggctccgcggaggcgcca cgcacagtgtccaaagcatccttgcttcctcctggaaaccgcggccgccaggtgtga >gi568815586f:112346274_112604761|GENSCAN_predicted_peptide_5|611_aa MTVVTVREQISRWFHPNITGVEAENLLLTRGVDGSFLARPSKSNPGDFTLSVRRNGAVTH IKIQNTGDYYDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIELKYPLNCADPTSERWF HGHLSGKEAEKLLTEKGKHGSFLVRESQSHPGDFVLSVRTGDDKGESNDGKSKVTHVMIR CQELKYDVGGGERFDSLTDLVEHYKKNPMVETLGTVLQLKQPLNTTRINAAEIESRVREL SKLAETTDKVKQGFWEEFETLQQQECKLLYSRKEGQRQENKNKNRYKNILPFDHTRVVLH DGDPNEPVSDYINANIIMPEFETKCNNSKPKKSYIATQGCLQNTVNDFWRMVFQENSRVI VMTTKEVERGKSKCVKYWPDEYALKEYGVMRVRNVKESAAHDYTLRELKLSKVGQGNTER TVWQYHFRTWPDHGVPSDPGGVLDFLEEVHHKQESIMDAGPVVVHCSAGIGRTGTFIVID ILIDIIREKGVDCDIDVPKTIQMVRSQRSGMVQTEAQYRFIYMAVQHYIETLQRRIEEEQ GIIRTKNKVLKKSKRKGHEYTNIKYSLADQTSGDQSPLPPCTPTPPCAEMREDSARVYEN VGLMQQQKSFR >gi568815586f:112346274_112604761|GENSCAN_predicted_CDS_5|1836_bp atgacagtggtgacagtcagggaacagataagcagatggtttcacccaaatatcactggt gtggaggcagaaaacctactgttgacaagaggagttgatggcagttttttggcaaggcct agtaaaagtaaccctggagacttcacactttccgttagaagaaatggagctgtcacccac atcaagattcagaacactggtgattactatgacctgtatggaggggagaaatttgccact ttggctgagttggtccagtattacatggaacatcacgggcaattaaaagagaagaatgga gatgtcattgagcttaaatatcctctgaactgtgcagatcctacctctgaaaggtggttt catggacatctctctgggaaagaagcagagaaattattaactgaaaaaggaaaacatggt agttttcttgtacgagagagccagagccaccctggagattttgttctttctgtgcgcact ggtgatgacaaaggggagagcaatgacggcaagtctaaagtgacccatgttatgattcgc tgtcaggaactgaaatacgacgttggtggaggagaacggtttgattctttgacagatctt gtggaacattataagaagaatcctatggtggaaacattgggtacagtactacaactcaag cagccccttaacacgactcgtataaatgctgctgaaatagaaagcagagttcgagaacta agcaaattagctgagaccacagataaagtcaaacaaggcttttgggaagaatttgagaca ctacaacaacaggagtgcaaacttctctacagccgaaaagagggtcaaaggcaagaaaac aaaaacaaaaatagatataaaaacatcctgccctttgatcataccagggttgtcctacac gatggtgatcccaatgagcctgtttcagattacatcaatgcaaatatcatcatgcctgaa tttgaaaccaagtgcaacaattcaaagcccaaaaagagttacattgccacacaaggctgc ctgcaaaacacggtgaatgacttttggcggatggtgttccaagaaaactcccgagtgatt gtcatgacaacgaaagaagtggagagaggaaagagtaaatgtgtcaaatactggcctgat gagtatgctctaaaagaatatggcgtcatgcgtgttaggaacgtcaaagaaagcgccgct catgactatacgctaagagaacttaaactttcaaaggttggacaagggaatacggagaga acggtctggcaataccactttcggacctggccggaccacggcgtgcccagcgaccctggg ggcgtgctggacttcctggaggaggtgcaccataagcaggagagcatcatggatgcaggg ccggtcgtggtgcactgcagtgctggaattggccggacagggacgttcattgtgattgat attcttattgacatcatcagagagaaaggtgttgactgcgatattgacgttcccaaaacc atccagatggtgcggtctcagaggtcagggatggtccagacagaagcacagtaccgattt atctatatggcggtccagcattatattgaaacactacagcgcaggattgaagaagagcag ggtatcatcagaaccaaaaataaagttttaaagaaaagcaagaggaaagggcacgaatat acaaatattaagtattctctagcggaccagacgagtggagatcagagccctctcccgcct tgtactccaacgccaccctgtgcagaaatgagagaagacagtgctagagtctatgaaaac gtgggcctgatgcaacagcagaaaagtttcagatga >gi568815586f:112346274_112604761|GENSCAN_predicted_peptide_6|364_aa XPLTGISRVAPLNYKGLGMEKCRDHLRNIIVSATEETHTQVSTQTGYTVGHQYLQMFLGE QIWGQGEEYSASEVKLLDKEKLGANPIPYLTLRKELNFSVPQFPRLQNVENSDFRGAVKR ATKRAFQKDRRIQRKALRPQNLNVSEEVKAQCNWSVEGGSSQEPGHTRLVDHDKKVGYHL ECNDGTIYLQKNKLRAPTDSTLCCYQKAVPIQAPREGSWISRKKEFGVINFNKVEIVSIL FTAEVAAPSTVPGTCCGTWRSERVLMCQRPHRQAVQGWDWCQPEWEVPVRCKKPINYAGW RSAPTPTLTPQTVHFNHPPNDEVRASRLKDIRPQPRARLLCGQSLAYEHRPLLQGAQSHR PPKG >gi568815586f:112346274_112604761|GENSCAN_predicted_CDS_6|1095_bp nncccattgacaggaataagtcgtgtggccccgcttaactacaagggtctgggaatggag aagtgccgggaccacctgaggaacattattgtctctgccacagaggagacacacactcaa gtcagcacacaaacaggatatacagtcggccatcagtatctgcagatgtttctaggagag caaatatgggggcaaggagaggaatacagtgcttcagaggtcaaactcttggataaagag aaacttggtgcaaatcctattccctatctgaccttgagaaaagaactcaacttttctgtg cctcagtttcctcgtctgcaaaatgttgaaaatagtgacttcagaggggctgttaaaaga gcaacaaagagagccttccagaaagacagacgaatccagagaaaggccctgaggccacag aatctcaacgtatctgaggaggtgaaagcccagtgcaactggagtgttgaaggaggcagc agccaggaaccaggtcatacccgactcgtagaccatgataagaaggttggatatcatctc gaatgcaatgatgggaccatctacttgcagaaaaacaagctcagggctcccactgattct acattatgctgttatcagaaagcggtcccaatccaggccccaagagagggttcttggatc tcacgcaagaaagaattcggggttataaacttcaacaaggtagagatcgtgtccatcttg ttcacagccgaagttgcagcacctagcacagtgcctggcacatgttgtggaacttggcgc tcagagagagtcctgatgtgccagagaccacacagacaagcagtccagggttgggactgg tgccagccagaatgggaggtgcctgtccgatgcaaaaagcccattaattacgctggctgg aggtcagctcccacacccacactcactccacagacagtacattttaaccacccacccaat gatgaggtgcgggcatcacgactaaaggacatcagaccccagcctcgagcaaggctcctg tgcggccagagcctcgcctacgagcaccgccctctgctccagggtgcccagtcccatcga ccacccaagggctga