GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:54:31 Sequence gi568815578f:32680324_32907900 : 227577 bp : 48.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 5611 5606 6 1.05 1.08 Term - 10456 9857 600 1 0 -44 41 737 0.004 50.33 1.07 Intr - 24166 24117 50 2 2 122 108 7 0.870 4.50 1.06 Intr - 24581 24491 91 2 1 102 78 61 0.974 6.07 1.05 Intr - 26297 26260 38 0 2 132 70 25 0.681 3.08 1.04 Intr - 26437 26381 57 2 0 121 69 29 0.581 3.16 1.03 Intr - 47672 47570 103 2 1 106 101 34 0.935 6.25 1.02 Intr - 63223 62985 239 2 2 22 80 243 0.355 14.23 1.01 Init - 70154 70007 148 2 1 97 73 111 0.780 10.75 1.00 Prom - 76758 76719 40 -3.66 2.00 Prom + 77429 77468 40 -9.85 2.01 Init + 80964 81040 77 2 2 81 78 3 0.459 -0.83 2.02 Intr + 82106 82376 271 2 1 57 99 220 0.196 17.54 2.03 Intr + 99995 100142 148 1 1 123 24 238 0.792 20.71 2.04 Intr + 101030 101091 62 0 2 96 80 22 0.761 0.65 2.05 Intr + 104435 104536 102 1 0 52 87 108 0.976 7.47 2.06 Intr + 106179 106304 126 2 0 122 7 87 0.938 4.88 2.07 Intr + 106907 107128 222 1 0 104 80 138 0.949 12.92 2.08 Intr + 108531 108689 159 2 0 84 105 134 0.999 14.88 2.09 Intr + 111278 111385 108 1 0 100 77 121 0.999 12.68 2.10 Intr + 112288 112447 160 0 1 89 75 137 0.661 12.06 2.11 Intr + 115086 115211 126 1 0 45 45 183 0.998 10.35 2.12 Intr + 115327 115371 45 2 0 122 92 46 0.990 6.78 2.13 Intr + 116467 116546 80 2 2 102 114 -24 0.973 0.87 2.14 Intr + 116864 116976 113 1 2 64 105 132 0.880 11.68 2.15 Intr + 118137 118320 184 0 1 52 66 208 0.998 14.79 2.16 Intr + 118921 119005 85 0 1 119 109 110 0.957 15.59 2.17 Intr + 119830 119975 146 2 2 110 54 195 0.965 18.30 2.18 Intr + 120512 120602 91 1 1 127 89 18 0.671 5.37 2.19 Intr + 120824 120897 74 0 2 112 98 36 0.926 6.13 2.20 Intr + 121014 121103 90 2 0 31 78 139 0.456 7.39 2.21 Intr + 122062 122147 86 0 2 120 92 82 0.965 10.42 2.22 Intr + 125015 125084 70 2 1 41 116 80 0.974 5.28 2.23 Intr + 125886 126004 119 2 2 90 105 79 0.975 9.06 2.24 Term + 127439 127580 142 2 1 89 32 184 0.993 10.30 2.25 PlyA + 131386 131391 6 1.05 3.00 Prom + 131905 131944 40 -3.26 3.01 Init + 145151 145162 12 1 0 75 100 20 0.476 1.44 3.02 Intr + 145602 145725 124 2 1 66 110 93 0.787 9.46 3.03 Intr + 153394 153539 146 2 2 116 106 106 0.999 15.20 3.04 Intr + 156311 156518 208 1 1 99 98 131 0.653 13.85 3.05 Intr + 159412 159556 145 2 1 97 94 134 0.948 14.24 3.06 Intr + 159646 159700 55 1 1 82 58 67 0.890 2.08 3.07 Intr + 166295 166447 153 1 0 80 42 142 0.944 9.07 3.08 Term + 168349 168405 57 0 0 105 42 94 0.942 4.19 3.09 PlyA + 170058 170063 6 1.05 4.00 Prom + 172399 172438 40 -4.96 4.01 Init + 178117 178173 57 0 0 98 78 9 0.480 2.21 4.02 Intr + 183215 183511 297 1 0 53 116 47 0.380 1.07 4.03 Intr + 187259 187424 166 1 1 61 94 100 0.541 7.43 4.04 Intr + 195603 195721 119 1 2 51 68 265 0.243 20.98 4.05 Intr + 198381 198484 104 2 2 29 67 136 0.703 4.57 4.06 Intr + 205182 205317 136 0 1 113 96 140 0.999 17.87 4.07 Intr + 208978 209083 106 0 1 76 95 85 0.979 7.79 4.08 Intr + 211890 211974 85 1 1 101 105 136 0.987 15.48 4.09 Intr + 212851 212975 125 1 2 123 22 128 0.893 9.93 4.10 Intr + 216131 216204 74 0 2 85 113 59 0.994 7.23 4.11 Intr + 218170 218300 131 0 2 77 110 135 0.984 14.09 4.12 Intr + 226239 226306 68 0 2 81 106 6 0.596 0.25 4.13 Intr + 226520 226671 152 0 2 76 68 241 0.521 20.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 10492 9857 636 1 0 76 41 692 0.939 58.89 S.002 Term + 146672 146727 56 0 2 77 43 64 0.808 -1.48 S.003 Init + 151813 151852 40 0 1 83 108 29 0.917 4.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:32680324_32907900|GENSCAN_predicted_peptide_1|441_aa MGDRCCHGNVLRLDCINVNILVMTLWDLCIEGNRVKATRNVSVSFLKTVSQGGRGRSALR RPWRKCGCGCAERPAAVAAAAEEEAEQGGGGGGTCGAGARAMGRLHCTEDPVPEAVGGDM QQLNQLGAQVERFLAQLSEFATTNQISLGSLRSIVKSLLLVPNGALKKSLTAKQVQADFI TLGLSEEKATYFSEKWKQNAPTLARWAIGQTLMINQLIDMEWKFGVTSGSSELEKVGSIF LQECGEPALPSASEEQVAQDTEEVFRSYVFYHHQQEQEAEGAAAPADPEMVTLPLQPSST MGQVGRQLAIIGDDINRRYDSEFQTMLQHLQPTAENAYEYFTKIASSLFESGINWGRVVA LLGFSYRLALHIYQRGLTGFLGQVTRFVVDFMLHHCIARWIAQRGGWVAALNLGNGPILN VLVVLGVVLLGQFVVRRFFKS >gi568815578f:32680324_32907900|GENSCAN_predicted_CDS_1|1326_bp atgggagatcgttgctgccatggaaatgttctgcgtcttgactgtatcaatgtcaatatc ctggtgatgacactgtgggatctgtgcatcgagggaaaccgggtaaaggccacaaggaat gtctctgtatcatttcttaaaactgtatcccagggcggccgggggaggagtgcgctgcga cggccgtggcgaaagtgcggttgtggatgcgcggagaggccggcagcggtggcagcggca gcggaggaggaagctgagcagggcggcggcggcggtggaacctgcggggctggggcgcgc gccatgggccgcctgcactgcactgaggacccggtgccggaggccgtgggcggcgacatg cagcagctgaaccagctgggcgcgcaggtggaaagatttctggctcagctctctgaattt gccaccaccaatcagatcagtcttggctccctcagaagcatcgtgaaaagcctccttctg gttccaaatggtgctttgaagaagagtctcacagccaagcaggtccaggcggatttcata actctgggtcttagtgaggagaaagccacttacttttctgaaaagtggaagcagaatgct cccacccttgctcgatgggccataggtcagactctgatgattaaccagctcatagatatg gagtggaaatttggagtgacatctgggagcagcgaattggagaaagtgggaagtatattt ttacaagagtgcggagagcctgccctgccctctgcttctgaggagcaggtagcccaggac acagaggaggttttccgcagctacgttttttaccaccatcagcaggaacaggaggctgaa ggggcggctgcccctgccgacccagagatggtcaccttacctctgcaacctagcagcacc atggggcaggtgggacggcagctcgccatcattggggacgacatcaaccgacgctatgac tcagagttccagaccatgttgcagcacctgcagcccacggcagagaatgcctatgagtac ttcaccaagattgcctccagcctgtttgagagtggcatcaattggggccgtgtggtggct cttctgggcttcagctaccgtctggccctacacatctaccagcgtggcctgactggcttc ctgggccaggtgacccgctttgtggtggacttcatgctgcatcactgcattgcccggtgg attgcacagaggggtggctgggtggcagccctgaacttgggcaatggtcccatcctgaac gtgctggtggttctgggtgtggttctgttgggccagtttgtggtacgaagattcttcaaa tcatga >gi568815578f:32680324_32907900|GENSCAN_predicted_peptide_2|961_aa MDRGDFRWSHREKPLHKVTFWEKTEGSCSRLRGRSPRGRSERPPTDGTGSLAVGRAGGNA ARPAALGLSGPSKPSSAIGAGDSRAQRPARPPAGLPPASPDPRLRRPAAPQPALRQESMK GDTRHLNGEEDAGGREDSILVNGACSDQSSDSPPILEAIRTPEIRGRRSSSRLSKREVSS LLSYTQDLTGDGDGEDGDGSDTPVMPKLFRETRTRSESPAVRTRNNNSVSSRERHRPSPR STRGRQGRNHVDESPVEFPATRSLRRRATASAGTPWPSPPSSYLTIDLTDDTEDTHGTPQ SSSTPYARLAQDSQQGGMESPQVEADSGDGDSSEYQDGKEFGIGDLVWGKIKGFSWWPAM VVSWKATSKRQAMSGMRWVQWFGDGKFSEVSADKLVALGLFSQHFNLATFNKLVSYRKAM YHALETLPLQKARVRAGKTFPSSPGDSLEDQLKPMLEWAHGGFKPTGIEGLKPNNTQPEN KTRRRTADDSATSDYCPAPKRLKTNCYNNGKDRGDEDQSREQMASDVANNKSSLEDGCLS CGRKNPVSFHPLFEGGLCQTCRDRFLELFYMYDDDGYQSYCTVCCEGRELLLCSNTSCCR CFCVECLEVLVGTGTAAEAKLQEPWSCYMCLPQRCHGVLRRRKDWNVRLQAFFTSDTGLE YEAPKLYPAIPAARRRPIRVLSLFDGIATGYLVLKELGIKVGKYVASEVCEESIAVGTVK HEGNIKYVNDVRNITKKNIEEWGPFDLVIGGSPCNDLSNVNPARKGLYGGHRLVGITLNC QEAIGNLLVSGNKEGDDRPFFWMFENVVAMKVGDKRDISRFLECNPVMIDAIKVSAAHRA RYFWGNLPGMNRPVIASKNDKLELQDCLEYNRIAKLKKVQTITTKSNSIKQGKNQLFPVV MNGKEDVLWCTELERIFGFPVHYTDVSNMGRGARQKLLGRSWSVPVIRHLFAPLKDYFAC E >gi568815578f:32680324_32907900|GENSCAN_predicted_CDS_2|2886_bp atggacaggggagacttcaggtggagtcatcgggaaaagcctctccataaagtgaccttc tgggagaaaaccgagggcagctgctcccggctccgcggccgcagcccgcgtggacgctcc gagcgccccccgacggacgggaccggctccctggcggtcgggcgagcgggcggcaacgct gcccggccggcagcgctggggttaagtggcccaagtaaacctagctcggcgatcggcgcc ggagattcgcgagcccagcgccctgcacggccgccagccggcctcccgccagccagcccc gacccgcggctccgccgcccagccgcgccccagccagccctgcggcaggaaagcatgaag ggagacaccaggcatctcaatggagaggaggacgccggcgggagggaagactcgatcctc gtcaacggggcctgcagcgaccagtcctccgactcgcccccaatcctggaggctatccgc accccggagatcagaggccgaagatcaagctcgcgactctccaagagggaggtgtccagt ctgctaagctacacacaggacttgacaggcgatggcgacggggaagatggggatggctct gacaccccagtcatgccaaagctcttccgggaaaccaggactcgttcagaaagcccagct gtccgaactcgaaataacaacagtgtctccagccgggagaggcacaggccttccccacgt tccacccgaggccggcagggccgcaaccatgtggacgagtcccccgtggagttcccggct accaggtccctgagacggcgggcaacagcatcggcaggaacgccatggccgtcccctccc agctcttaccttaccatcgacctcacagacgacacagaggacacacatgggacgccccag agcagcagtaccccctacgcccgcctagcccaggacagccagcaggggggcatggagtcc ccgcaggtggaggcagacagtggagatggagacagttcagagtatcaggatgggaaggag tttggaataggggacctcgtgtggggaaagatcaagggcttctcctggtggcccgccatg gtggtgtcttggaaggccacctccaagcgacaggctatgtctggcatgcggtgggtccag tggtttggcgatggcaagttctccgaggtctctgcagacaaactggtggcactggggctg ttcagccagcactttaatttggccaccttcaataagctcgtctcctatcgaaaagccatg taccatgctctggagactctgcctttgcagaaagctagggtgcgagctggcaagaccttc cccagcagccctggagactcattggaggaccagctgaagcccatgttggagtgggcccac gggggcttcaagcccactgggatcgagggcctcaaacccaacaacacgcaaccagagaac aagactcgaagacgcacagctgacgactcagccacctctgactactgccccgcacccaag cgcctcaagacaaattgctataacaacggcaaagaccgaggggatgaagatcagagccga gaacaaatggcttcagatgttgccaacaacaagagcagcctggaagatggctgtttgtct tgtggcaggaaaaaccccgtgtccttccaccctctctttgagggggggctctgtcagaca tgccgggatcgcttccttgagctgttttacatgtatgatgacgatggctatcagtcttac tgcactgtgtgctgcgagggccgagagctgctgctttgcagcaacacgagctgctgccgg tgtttctgtgtggagtgcctggaggtgctggtgggcacaggcacagcggccgaggccaag cttcaggagccctggagctgttacatgtgtctcccgcagcgctgtcatggcgtcctgcgg cgccggaaggactggaacgtgcgcctgcaggccttcttcaccagtgacacggggcttgaa tatgaagcccccaagctgtaccctgccattcccgcagcccgaaggcggcccattcgagtc ctgtcattgtttgatggcatcgcgacaggctacctagtcctcaaagagttgggcataaag gtaggaaagtacgtcgcttctgaagtgtgtgaggagtccattgctgttggaaccgtgaag cacgaggggaatatcaaatacgtgaacgacgtgaggaacatcacaaagaaaaatattgaa gaatggggcccatttgacttggtgattggcggaagcccatgcaacgatctctcaaatgtg aatccagccaggaaaggcctgtatggtggacatagactggtaggcatcaccctgaactgt caggaggccattgggaacctgctggtctcagggaataaggagggtgatgaccggccgttc ttctggatgtttgagaatgttgtagccatgaaggttggcgacaagagggacatctcacgg ttcctggagtgtaatccagtgatgattgatgccatcaaagtttctgctgctcacagggcc cgatacttctggggcaacctacccgggatgaacaggcccgtgatagcatcaaagaatgat aaactcgagctgcaggactgcttggaatacaataggatagccaagttaaagaaagtacag acaataaccaccaagtcgaactcgatcaaacaggggaaaaaccaacttttccctgttgtc atgaatggcaaagaagatgttttgtggtgcactgagctcgaaaggatctttggctttcct gtgcactacacagacgtgtccaacatgggccgtggtgcccgccagaagctgctgggaagg tcctggagcgtgcctgtcatccgacacctcttcgcccctctgaaggactactttgcatgt gaatag >gi568815578f:32680324_32907900|GENSCAN_predicted_peptide_3|299_aa MGKKKMAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGS IALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFF DANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKA GPGVVRKNPGVGNGDDEAAELMQQVGTPVFSTACDTCEPQVLCGEHERGDVNVLKLTVED LEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEY >gi568815578f:32680324_32907900|GENSCAN_predicted_CDS_3|900_bp atggggaagaagaagatggcagtgaacgtatactcaacgtcagtgaccagtgataaccta agtcgacatgacatgctggcctggatcaatgagtctctgcagttgaatctgacaaagatc gaacagttgtgctcaggggctgcgtattgtcagtttatggacatgctgttccctggctcc attgccttgaagaaagtgaaattccaagctaagctagaacacgagtacatccagaacttc aaaatactacaagcaggttttaagagaatgggtgttgacaaaataattcctgtggacaaa ttagtaaaaggaaagtttcaggacaattttgaattcgttcagtggttcaagaagtttttc gatgcaaactatgatggaaaagactatgaccctgtggctgccagacaaggtcaagaaact gcagtggctccttcccttgttgctccagctctgaataaaccgaagaaacctctcacttct agcagtgcagctccccagaggcccatctcaacacagagaaccgctgcggctcctaaggct ggccctggtgtggtgcgaaagaaccctggtgtgggcaacggagacgacgaggcagctgag ttgatgcagcaggtgggcacccctgtgtttagcacagcatgtgacacgtgtgagcctcag gtgctgtgcggggagcatgaacgtggagatgtcaacgtattgaaacttactgttgaagac ttggagaaagagagggatttctacttcggaaagctacggaacattgaattgatttgccag gagaacgagggggaaaacgaccctgtattgcagaggattgtagacattctgtatgccaca gatgaaggctttgtgatacctgatgaagggggcccacaggaggagcaagaagagtattaa >gi568815578f:32680324_32907900|GENSCAN_predicted_peptide_4|540_aa MDQDCSGCIKSRPALKSGFDNSSLSGEERLKCKLGKSFLLEKSLGKGMLIHCSLGVSMGK GKPPSPLTLTSFPPFCDLAKSAFHVVLTTTGVKLTMIPYSRSRLMSSEDLAEIPQLQKLS IPHGFQNKEAASSPTPSITLSQVPDLQPGSQLFTEIHLAKIEKMFEEDINSTGALGMDAF IKAMKKVLSSVSDEMLKELFLKVDSDCEGFVTWQKYVDYMMREFQGKEDMRKSQYRLHFY LPMTVVPLNHGCEVVKVVFLIHRFKKIGCFLTVTKDGILQFWSESFSLMSSFRLNQTQQL YNQPMWVIDMVCLHNMNLVAVASTRQKIDFFDISDHKCVRAFTFVDLDSCALVMDYWSDY HRGVFCYGDAKGNVIVFTSENMTSGLFNPRILPRASKWDHWIKVSLQKLLNEKSALHRSY RLKALHPNWCEQVKFIPQMNVVVSCSAIEKSSLVLTILPAKASKKPRLSVLRLRKGILCF DYCPDRNFLVTGGYDAFIRLWNPFVSKRPVWLMKGHQTSVTHILVDSRNNSILISVSKDK >gi568815578f:32680324_32907900|GENSCAN_predicted_CDS_4|1620_bp atggaccaagattgcagtggctgcatcaagtctaggcctgccctgaagtctggatttgac aactcctctctatcaggggaagaaaggctcaagtgcaagcttggcaaaagctttttgctg gagaaatctctgggaaaaggaatgctcatccactgctcgctgggagtgtccatgggcaag gggaagccaccctctcccttgaccttgacttcttttccacctttctgtgacctggctaag tcagccttccacgtggtccttacaacgactggagttaagttgactatgattccctattct aggtcaaggctaatgtcttctgaagacttagcagagatccctcaactccaaaagctgtcc atcccacatggcttccagaacaaggaggctgctagctccccaacaccatccatcaccctt agccaggtgcctgacctccagcctgggtcccagctgtttactgagatacacctggccaag atagagaaaatgtttgaggaggacatcaactcgactggagccctgggcatggacgccttc atcaaggccatgaagaaggttctgagcagtgtgtcggacgagatgctaaaggagctgttt ttgaaggtggactcggactgtgaaggctttgtcacctggcaaaagtatgtggattacatg atgcgtgagttccagggaaaagaggacatgcgaaagagccagtaccgcctgcacttctac cttcccatgacggtcgtccccctgaaccatggctgtgaggtggtgaaggtggtgttttta atccaccggttcaagaagatcgggtgtttcctgactgtcaccaaagacgggatcctgcag ttctggtctgagtccttctcgctgatgagctcctttaggcttaaccagacccagcagctc tacaaccagccgatgtgggtcattgacatggtatgtctgcacaatatgaacctcgttgca gttgcgtctaccaggcaaaagatagatttctttgatattagtgaccacaaatgtgtccgg gccttcacctttgttgatctggacagctgtgctctggtcatggactactggtctgactat cacagaggtgtgttctgctatggagacgccaaaggcaacgtcattgtcttcacctccgaa aacatgaccagtgggctgttcaacccccgtatcctccccagggcctccaagtgggatcac tggatcaaagtttccttgcagaaactcttaaatgagaagtctgctttgcatagaagctac cggctgaaggctctccatcccaactggtgtgagcaggtcaagttcatcccccagatgaat gtggtagtctcctgttcagccatcgagaagtcctctctggtgctgacaatattgccagcc aaagcctctaagaaacccaggttgtcagtgctgcgtttaaggaaagggattctttgcttt gattactgcccagacaggaacttcctggtgactggtggctacgatgccttcatccgcctg tggaacccctttgtctcaaagaggcccgtgtggctgatgaagggacaccagacctcagtg acgcacatccttgtggatagcaggaacaacagcatcctcatcagtgtctccaaggacaag