GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:18:37 Sequence gi568815596f:112960851_113162674 : 201824 bp : 42.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 1104 932 173 0 2 97 86 225 0.927 22.16 1.00 Prom - 5961 5922 40 -4.55 2.00 Prom + 6778 6817 40 -6.55 2.01 Init + 10173 10291 119 2 2 63 81 88 0.498 5.32 2.02 Intr + 14122 14261 140 1 2 16 38 125 0.074 -0.51 2.03 Intr + 18371 18475 105 0 0 32 109 141 0.934 9.87 2.04 Intr + 19159 19298 140 2 2 85 87 69 0.636 5.76 2.05 Term + 23990 24199 210 1 0 59 43 134 0.844 2.31 2.06 PlyA + 24756 24761 6 1.05 3.04 PlyA - 25648 25643 6 1.05 3.03 Term - 32132 31990 143 0 2 79 45 113 0.057 3.21 3.02 Intr - 34026 33947 80 2 2 61 86 76 0.059 2.98 3.01 Init - 36241 36186 56 1 2 102 84 35 0.092 5.31 3.00 Prom - 41757 41718 40 -7.15 4.00 Prom + 44570 44609 40 -5.25 4.01 Init + 45022 45031 10 0 1 81 98 -2 0.640 0.88 4.02 Intr + 45124 45237 114 2 0 81 109 105 0.993 11.40 4.03 Intr + 45748 45887 140 2 2 58 113 141 0.931 12.86 4.04 Term + 46982 47194 213 1 0 75 45 144 0.938 5.05 4.05 PlyA + 48649 48654 6 1.05 5.00 Prom + 54598 54637 40 -4.75 5.01 Sngl + 56756 56986 231 1 0 63 45 471 0.997 35.16 5.02 PlyA + 57650 57655 6 1.05 6.08 PlyA - 58829 58824 6 1.05 6.07 Term - 65382 65134 249 0 0 79 38 122 0.087 1.02 6.06 Intr - 68210 68089 122 0 2 47 113 71 0.533 4.79 6.05 Intr - 70305 70198 108 1 0 62 105 84 0.952 6.84 6.04 Intr - 70916 70847 70 2 1 51 111 13 0.255 -2.26 6.03 Intr - 73428 73283 146 1 2 36 73 151 0.107 7.48 6.02 Intr - 84644 84457 188 1 2 20 52 136 0.329 1.61 6.01 Init - 86410 86310 101 1 2 96 76 58 0.598 5.18 6.00 Prom - 86846 86807 40 -8.65 7.00 Prom + 87178 87217 40 -7.05 7.01 Init + 88122 88200 79 2 1 22 60 142 0.625 6.07 7.02 Intr + 90376 90545 170 2 2 49 37 190 0.398 8.74 7.03 Intr + 93970 94179 210 0 0 81 3 113 0.084 0.29 7.04 Intr + 98562 98617 56 2 2 82 117 42 0.239 3.36 7.05 Intr + 100002 100087 86 0 2 89 81 62 0.680 4.14 7.06 Intr + 101169 101401 233 1 2 60 78 184 0.486 11.17 7.07 Term + 101603 101827 225 1 0 126 49 220 0.988 17.70 7.08 PlyA + 103418 103423 6 1.05 8.00 Prom + 104900 104939 40 -5.95 8.01 Init + 106246 106376 131 0 2 75 11 137 0.490 4.37 8.02 Intr + 111861 111920 60 0 0 97 103 7 0.422 0.03 8.03 Intr + 113365 113564 200 1 2 5 99 185 0.689 9.37 8.04 Intr + 113873 114000 128 0 2 143 78 42 0.729 8.18 8.05 Intr + 114302 114510 209 1 2 111 58 184 0.568 14.65 8.06 Term + 115126 115324 199 1 1 21 42 152 0.450 -0.21 8.07 PlyA + 115868 115873 6 1.05 9.06 PlyA - 116028 116023 6 -0.45 9.05 Term - 116708 116545 164 0 2 -64 49 449 0.041 22.92 9.04 Intr - 124029 123885 145 0 1 92 17 108 0.006 3.03 9.03 Intr - 136332 136181 152 1 2 38 59 132 0.224 4.26 9.02 Intr - 137629 137521 109 1 1 62 42 57 0.196 -2.56 9.01 Init - 138807 138661 147 0 0 35 74 112 0.237 4.64 9.00 Prom - 140764 140725 40 -6.65 10.00 Prom + 150286 150325 40 -5.65 10.01 Init + 156233 156368 136 1 1 77 80 128 0.851 11.25 10.02 Intr + 159216 159278 63 1 0 76 61 139 0.048 7.77 10.03 Intr + 166839 166890 52 1 1 112 110 49 0.103 6.55 10.04 Intr + 167769 167891 123 0 0 76 90 59 0.763 3.68 10.05 Intr + 168726 168814 89 0 2 96 113 63 0.908 8.30 10.06 Intr + 170195 170307 113 0 2 150 83 74 0.987 12.28 10.07 Term + 171806 172021 216 1 0 107 38 305 0.987 23.56 10.08 PlyA + 172899 172904 6 1.05 11.04 PlyA - 173786 173781 6 1.05 11.03 Term - 174512 174398 115 1 1 92 36 59 0.006 -1.84 11.02 Intr - 196824 196725 100 1 1 107 1 213 0.252 12.75 11.01 Intr - 197018 196881 138 0 0 30 33 172 0.249 5.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 124515 124724 210 2 0 52 38 198 0.926 6.14 S.002 Term + 159216 159352 137 1 2 76 33 140 0.868 4.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_1|58_aa MQVGAVKTQVPQAPQLASLCKAAAAAAEASARAEEQQEWTRCESMSTLKDGAPYKAES >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_1|174_bp atgcaagttggtgctgtgaagacacaggtgcctcaggcaccacagctggccagcctctgc aaagcagcagcagcagcagcagaagcctcagcaagagcagaggaacagcaggagtggact cgttgtgagtccatgtctacactcaaagacggggcaccttacaaggctgaaagn >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_2|237_aa MHTGSCENREKAATCEPRREPLGDTKTIDTLILDFQPPECLGFHRNQLQEDVMDQLIGSK SLIMGLGYVEEQRLSAMSMHIIPSTLVCKPITGTINDLNQQVWTLQGQNLVAVPRSDSVT PVTVAVITCKYPEALEQGRGDPIYLGIQNPEMCLYCEKVGEQPTLQLKEQKIMDLYGQPE PVKPFLFYRAKTGRTSTLESVAFPDWFIASSKRDQPIILTSELGKSYNTAFELNIND >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_2|714_bp atgcacacagggtcatgtgagaacagagagaaggcggccacctgtgagccaaggagagag cccttaggagacaccaaaactattgacaccttgatcttggacttccagcctccagaatgt ctggggtttcatagaaaccagcttcaggaagatgtaatggatcagctcataggaagcaag tctttgataatgggcttgggctatgtggaggagcagaggctttcggccatgtccatgcac atcattccatctacgttggtgtgtaaacctattactgggactattaatgatttgaatcag caagtgtggacccttcagggtcagaaccttgtggcagttccacgaagtgacagtgtgacc ccagtcactgttgctgttatcacatgcaagtatccagaggctcttgagcaaggcagaggg gatcccatttatttgggaatccagaatccagaaatgtgtttgtattgtgagaaggttgga gaacagcccacattgcagctaaaagagcagaagatcatggatctgtatggccaacccgag cccgtgaaacccttccttttctaccgtgccaagactggtaggacctccacccttgagtct gtggccttcccggactggttcattgcctcctccaagagagaccagcccatcattctgact tcagaacttgggaagtcatacaacactgcctttgaattaaatataaatgactga >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_3|92_aa MTNEKQLQCVALMENNKKGSFADTLLDRDSPNDDMSQTLLCHLAQGLLLKEAPQQSQIAK TRLGTYINAWTSTNGYKNHNKSGNVHFTKQTK >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_3|279_bp atgaccaatgagaagcagctgcagtgtgtggcactcatggaaaataacaaaaaggggtca tttgctgacaccctgctagatagggacagccctaatgatgacatgtctcagactttgctc tgccatcttgctcaaggcttactgctaaaggaagctccccaacaaagccagattgcaaag actagattaggtacttacatcaatgcatggacatctacaaatggctacaagaatcacaat aaatcagggaatgttcacttcaccaagcagacaaaataa >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_4|158_aa MEKALKIDTPQQGSIQDINHRVWVLQDQTLIAVPRKDRMSPVTIALISCRHVETLEKDRG NPIYLGLNGLNLCLMCAKVGDQPTLQLKEKDIMDLYNQPEPVKSFLFYHSQSGRNSTFES VAFPGWFIAVSSEGGCPLILTQELGKANTTDFGLTMLF >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_4|477_bp atggaaaaagcattgaaaattgacacacctcagcaggggagcattcaggatatcaatcat cgggtgtgggttcttcaggaccagacgctcatagcagtcccgaggaaggaccgtatgtct ccagtcactattgccttaatctcatgccgacatgtggagacccttgagaaagacagaggg aaccccatctacctgggcctgaatggactcaatctctgcctgatgtgtgctaaagtcggg gaccagcccacactgcagctgaaggaaaaggatataatggatttgtacaaccaacccgag cctgtgaagtcctttctcttctaccacagccagagtggcaggaactccaccttcgagtct gtggctttccctggctggttcatcgctgtcagctctgaaggaggctgtcctctcatcctt acccaagaactggggaaagccaacactactgactttgggttaactatgctgttttaa >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_5|76_aa MKKKKERKKKKKKKKKKKKKKKKKKKKEKKKERKKERKKEKRRKEEEGEGGEEEEKGAGG GGGGRRREKEEKKSES >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_5|231_bp atgaagaagaagaaggagaggaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaaggagaaaaagaaagaaagaaagaaagagagaaagaaagaa aagagaagaaaagaagaagaaggagaaggaggagaagaagaagaaaaaggagctggagga ggaggaggagggagaaggagggagaaggaggagaagaagagtgaatcatga >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_6|327_aa MAEFGTGTENTQEQPKHVVVTKSKEILKKIHSDWFKKIDDPVHMKYEENYSSKIINKLLK TSDTEKILKVAGGKRTQCIQIHKKKDDGRFFVRINTKEQEKLPAGEDKLAGQENQQNPVH PEQGSQMGRDRRQQKDKEDPDHVGLPPHHHLIYLVLFTKGSEDIMNPQREAAPKSYAIRD SRQMVWVLSGNSLIAAPLSRSIKPACRDTEFSDKEKGNMVYLGIKGKDLCLFCAEIQGKP TLQLKLQGSQDNIGKDTCWKLVGIHTCINLDVRESCFMGTLDQWGIGVGEEQILLSIPQV DNSEMRSFFMVPQRIPAVLRHNHPWQV >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_6|984_bp atggctgaatttgggacgggcacagaaaatacacaagagcagcctaagcatgttgttgta acaaaaagtaaggaaatacttaaaaaaatccacagtgattggttcaagaagattgatgac cccgtgcacatgaaatatgaagaaaactacagcagcaaaatcataaacaaattgctcaaa accagtgatactgagaaaatcttaaaagtagctggaggaaaaaggacacaatgcatacaa atacataaaaagaaggatgacggcagattttttgtcagaatcaatacaaaggaacaggaa aaactccctgcaggcgaagataagcttgcagggcaagaaaatcagcagaatccagtacat ccagagcagggaagtcagatgggcagggatagaaggcagcagaaagataaagaggaccca gatcatgtgggcttgcctcctcaccaccatctgatctatcttgttctcttcacaaaaggc tctgaagacatcatgaacccacaacgggaggcagcacccaaatcctatgctattcgtgat tctcgacagatggtgtgggtcctgagtggaaattctttaatagcagctcctcttagccgc agcattaagcctgcctgtagagacacagaattcagtgacaaggaaaagggtaatatggtt tacctgggaatcaagggaaaagatctctgtctcttctgtgcagaaattcagggcaagcct actttgcagcttaagcttcagggctcccaagataacatagggaaggacacttgctggaaa ctagttggaattcacacatgcataaacctggatgtgagagagagctgcttcatgggaacc cttgaccaatggggaataggagtgggtgaggagcaaattctcttatccatcccacaggtg gacaattctgagatgcgttcattcttcatggttcctcagaggatcccagcagtattgaga cataatcacccatggcaggtctaa >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_7|352_aa MLAIVLLNDDEEEASHNNDTDDGSCVVSCWRAMVKLHSPAEASPQALEASEAQTRKAPEC GSGVQKLPVALALFMGLSQQKLWVMLMQEVGSHSLWKLHSCDFAGYNPPPGCFHGLTLSV AFPGTQHKLLVDLPFWDLEDSGPLLTVPLDNAPGSLHPVELKMVLSGALCFRMKDSALKV LYLHNNQLLAGGLHAGKVIKAWGQAVLQQSCALEPRTGEEGGKGIQGPASGLFPTGEEIS VVPNRWLDASLSPVILGVQGGSQCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRR DMGLTSSFESAAYPGWFLCTVPEADQPVRLTQLPENGGWNAPITDFYFQQCD >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_7|1059_bp atgctagctattgttcttcttaatgatgatgaagaggaagcatcacacaataatgatact gatgatggcagttgtgttgtttcctgctggagagcaatggtcaaactacacagtccggcg gaggcgagtccgcaggctctggaagcctctgaggcacagacgaggaaagcgcctgaatgt ggatctggggtacagaagttgccggtggctctggccctgttcatgggactcagtcagcag aaactgtgggtcatgcttatgcaagaggtgggttcccatagtctttggaagctccactcc tgtgactttgcagggtacaaccctcctcctggctgctttcatgggctgacactgtctgtg gcttttccaggcacacagcataagctgttggtggatctaccattctgggatctggaggac agtggccctcttctcacagttccactagacaatgccccagggagtctacaccctgtggag ctcaagatggtcctgagtggggcgctgtgcttccgaatgaaggactcggcattgaaggtg ctttatctgcataataaccagcttctagctggagggctgcatgcagggaaggtcattaaa gcctgggggcaggccgtcttacagcagtcctgtgccctagagcccaggacaggggaagaa ggagggaaaggcatccagggccctgcatctggcctctttcccacaggtgaagagatcagc gtggtccccaatcggtggctggatgccagcctgtcccccgtcatcctgggtgtccagggt ggaagccagtgcctgtcatgtggggtggggcaggagccgactctaacactagagccagtg aacatcatggagctctatcttggtgccaaggaatccaagagcttcaccttctaccggcgg gacatggggctcacctccagcttcgagtcggctgcctacccgggctggttcctgtgcacg gtgcctgaagccgatcagcctgtcagactcacccagcttcccgagaatggtggctggaat gcccccatcacagacttctacttccagcagtgtgactag >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_8|308_aa MDGWMDGWMDEWMDERVDRRVDGWAHGWMDGWMDDGWMEEWTDGEDQTPLIAGMCSLPMA RYYMRENRPNAQGGDSEHGSGRAWGPVVHKGNGPSALSYCFRIKYADQKALYTRDGQLLV GDPVADNCCAEKICILPNRGLARTKVPIFLGIQGGSRCLACVETEEGPSLQLEDVNIEEL YKGGEEATRFTFFQSSSGSAFRLEAAAWPGWFLCGPAEPQQPVQLTKESEPSARTKFYFE QSCPSAMEPLETTKKPQAELSVLCTGTKPWAAYNLNSSSSLVIFYQILHSESTIPWLTVG TVRYLPGA >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_8|927_bp atggatggatggatggatggatggatggatgaatggatggatgaaagggtggacagaagg gtggatggatgggcacatgggtggatggatggttggatggatgatggatggatggaagaa tggacagatggtgaggaccagacaccactgattgcaggaatgtgttccctccccatggca agatactacatgagagaaaatcgcccaaatgctcaaggtggtgattcagagcatggaagt ggaagggcttgggggccagtggtgcataaagggaatgggccatcagcactgtcatactgt ttcagaattaaatatgcagaccagaaggctctatacacaagagatggccagctgctggtg ggagatcctgttgcagacaactgctgtgcagagaagatctgcatacttcctaacagaggc ttggcccgcaccaaggtccccattttcctggggatccagggagggagccgctgcctggca tgtgtggagacagaagaggggccttccctacagctggaggatgtgaacattgaggaactg tacaaaggtggtgaagaggccacacgcttcaccttcttccagagcagctcaggctccgcc ttcaggcttgaggctgctgcctggcctggctggttcctgtgtggcccggcagagccccag cagccagtacagctcaccaaggagagtgagccctcagcccgtaccaagttttactttgaa cagagctgtccatctgctatggagcctctagaaacaaccaagaagccccaggcagaactc agtgtcctctgcacagggacaaagccctgggctgcttacaacttaaattctagctcaagt cttgtgattttctaccagatacttcacagtgaatccacaattccgtggttgactgttggt acagtccgctatctgccaggtgcttaa >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_9|238_aa MNRAVYGDPHRELLLQELPQEHTRKAEKVHRPFEGGGLPLQAPWDSRGTQVLESMADRPE DGSYHGILQILPSTSPEPSSSTGWPVLEVLARKIRQEKAIKDIQTSKEEVKLLLFADDMT VYLENPKDSSRKFLELLRDDECLTILEGNHLSGCANNPCSSWQPEATILGGQSDRVTPLL EIIHGGGGGGGEGGGGGEGEEGGGGGRRRGGRGGGEEEEEEEEEEEEEEEEGEEEEEE >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_9|717_bp atgaacagagcagtgtatggagacccacatcgtgaacttttgctccaagaacttccacag gaacataccaggaaagccgagaaagtccacagaccctttgaaggaggcggattgccactg caggctccatgggacagtcgaggaactcaagtgctggaatccatggctgacagacctgaa gacggatcatatcacgggattctgcagatactccccagtactagcccagagcccagtagc tccactgggtggccagtactggaagtcctagccagaaaaatcagacaagagaaagcaata aaggacatccaaacaagtaaagaggaagtcaaactgttgctgtttgctgatgacatgact gtatacttagaaaaccctaaagactcatccagaaagttcctagaactgttgagagatgat gagtgtctaacaattcttgaaggaaaccacttgagtggctgtgccaataatccatgctcc agttggcagccagaagcaactattttaggaggtcaatctgatcgtgtcactcccctgctt gaaatcatccatggaggaggaggaggaggaggagaaggaggaggaggaggagaaggagaa gaaggaggaggaggaggaagaagaagaggaggaagaggaggaggagaggaggaagaagag gaagaagaagaagaggaagaagaagaagaagaaggggaggaggaggaggaggaatag >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_10|263_aa MQRHEATSRFAPNGKKEEGCSEHQVLLEKAGREYLALWTMSRQRPADLYEEGGGGGGEGE DNADSKETICRPSGRKSSKMQAFRLRLNLETRIITNRPMLPYAKHFGLPGHPSSSMWALQ SHLMRIWDVNQKTFYLRNNQLVAGYLQGPNVNLEEKIDVVPIEPHALFLGIHGGKMCLSC VKSGDETRLQLEAVNITDLSENRKQDKRFAFIRSDSGPTTSFESAACPGWFLCTAMEADQ PVSLTNMPDEGVMVTKFYFQEDE >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_10|792_bp atgcagagacatgaagctacaagcaggttcgctcccaacggcaaaaaggaggaggggtgt tcagaacatcaggtgcttctagagaaagcagggagagagtatctggccttgtggacaatg tcacggcagaggccagctgacttgtatgaagaaggaggtggaggaggaggagaaggtgaa gacaatgctgactcaaaggagacgatctgccgaccctctgggagaaaatccagcaagatg caagccttcagactgaggctaaatctagaaactaggataatcacaaacaggccaatgctg ccatatgcaaagcactttggtttgcctggccacccctcgtcgagcatgtgggctcttcag agccacctgatgagaatctgggatgttaaccagaagaccttctatctgaggaacaaccaa ctagttgctggatacttgcaaggaccaaatgtcaatttagaagaaaagatagatgtggta cccattgagcctcatgctctgttcttgggaatccatggagggaagatgtgcctgtcctgt gtcaagtctggtgatgagaccagactccagctggaggcagttaacatcactgacctgagc gagaacagaaagcaggacaagcgcttcgccttcatccgctcagacagtggccccaccacc agttttgagtctgccgcctgccccggttggttcctctgcacagcgatggaagctgaccag cccgtcagcctcaccaatatgcctgacgaaggcgtcatggtcaccaaattctacttccag gaggacgagtag >gi568815596f:112960851_113162674|GENSCAN_predicted_peptide_11|117_aa XLLLLASQRGLEPGSNSAAGVARGERAQADSFSGDSVPLPSQSPPLGTLTLSDSRSAGAR RRGESAARHTVTVEGGDGRGIKNISTTHALPCQLLPFYIQERENSEMLDDLLNVIGS >gi568815596f:112960851_113162674|GENSCAN_predicted_CDS_11|354_bp nggctcctcctgctcgccagccagcggggcttggagccaggatccaacagcgcggctgga gtcgcccgcggcgagagagcccaggctgactccttcagtggagacagcgtccccttgccg tcccagtctccgccgctcgggaccctaaccctcagcgacagccgctccgcaggtgctcgc cggcgtggcgagagcgctgcacggcacacggtcaccgtcgagggcggcgacggccggggg attaaaaatatatcaacgacacatgccctgccctgccagctacttccattttatatacaa gaaagagaaaactcagagatgttggatgacctcctcaacgtcataggatcttag