GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:49:33 Sequence gi568815577f:33995199_34197352 : 202154 bp : 42.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14411 14560 150 1 0 69 81 101 0.139 7.59 1.02 Intr + 17502 17592 91 2 1 -16 93 86 0.023 -2.65 1.03 Intr + 27287 27411 125 0 2 55 78 129 0.237 7.98 1.04 Intr + 43571 43672 102 1 0 64 115 93 0.653 9.05 1.05 Term + 44964 45047 84 2 0 84 43 98 0.679 1.57 1.06 PlyA + 45154 45159 6 1.05 2.00 Prom + 54098 54137 40 -5.55 2.01 Init + 57120 57251 132 2 0 48 31 172 0.790 7.69 2.02 Intr + 59735 59863 129 1 0 19 97 96 0.279 3.57 2.03 Intr + 64362 64488 127 2 1 57 77 92 0.157 4.33 2.04 Intr + 74034 74156 123 1 0 104 17 125 0.132 6.64 2.05 Intr + 78339 78547 209 1 2 30 114 293 0.603 23.97 2.06 Intr + 83136 83188 53 2 2 21 82 72 0.016 -3.31 2.07 Term + 97117 97249 133 1 1 86 48 87 0.029 1.08 2.08 PlyA + 98482 98487 6 1.05 3.00 Prom + 98799 98838 40 -7.65 3.01 Sngl + 100001 102157 2157 1 0 56 43 1591 0.969 145.25 3.02 PlyA + 104989 104994 6 1.05 4.00 Prom + 106084 106123 40 -3.85 4.01 Init + 130185 130282 98 2 2 68 114 61 0.984 6.54 4.02 Intr + 131029 131148 120 1 0 74 92 114 0.771 9.09 4.03 Term + 132723 132822 100 0 1 29 52 162 0.553 3.32 4.04 PlyA + 133352 133357 6 1.05 5.03 PlyA - 133527 133522 6 -1.75 5.02 Term - 135414 135117 298 2 1 80 48 141 0.201 3.05 5.01 Init - 140770 140463 308 1 2 43 80 379 0.121 29.41 5.00 Prom - 143189 143150 40 -5.65 6.00 Prom + 143667 143706 40 -3.65 6.01 Init + 146522 146586 65 1 2 72 55 31 0.631 -0.92 6.02 Intr + 147210 147398 189 0 0 96 44 151 0.133 9.28 6.03 Term + 148220 148562 343 2 1 18 43 205 0.179 2.00 6.04 PlyA + 148723 148728 6 1.05 7.10 PlyA - 148773 148768 6 1.05 7.09 Term - 149824 149525 300 1 0 99 48 113 0.389 2.64 7.08 Intr - 151582 151456 127 0 1 63 69 130 0.779 8.36 7.07 Intr - 158707 158640 68 1 2 104 72 72 0.139 3.98 7.06 Intr - 159898 159689 210 1 0 83 -46 211 0.269 5.39 7.05 Intr - 166316 166248 69 2 0 125 26 84 0.077 4.36 7.04 Intr - 166617 166474 144 0 0 101 65 125 0.752 11.06 7.03 Intr - 167949 167882 68 1 2 95 37 23 0.394 -4.39 7.02 Intr - 171320 171193 128 1 2 131 32 35 0.456 1.60 7.01 Init - 172360 172179 182 1 2 71 89 101 0.552 7.23 7.00 Prom - 173475 173436 40 -6.95 8.08 PlyA - 174038 174033 6 1.05 8.07 Term - 186023 185847 177 2 0 10 48 213 0.681 6.20 8.06 Intr - 187414 187264 151 0 1 71 100 124 0.796 11.14 8.05 Intr - 188452 188232 221 1 2 -35 77 164 0.277 -0.82 8.04 Intr - 191374 191227 148 0 1 142 59 70 0.853 8.82 8.03 Intr - 192589 192402 188 1 2 42 78 132 0.836 5.17 8.02 Intr - 196471 196359 113 2 2 24 75 149 0.703 6.38 8.01 Intr - 199384 199146 239 0 2 59 78 143 0.007 6.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 121852 121725 128 2 2 81 40 137 0.857 5.66 S.002 Term - 166316 166236 81 2 0 125 55 81 0.916 5.21 S.003 Init - 196932 196848 85 0 1 66 64 109 0.833 7.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_1|183_aa MPRLAKLTWKPGKHHLHGSAPLTSTGEEALDLTGSVVRKEYGRTDSWLTKEAPPEALVKM KGRTCGMSSPVSAEPGTANAEVQLFEPSEPEEWKGIEVFLPPSRKLRVAEFEHHKPQREV TETTSVADDMVCSQMLLQSEGTHSPAVRKLTGDGSQHHISCEIQQPAKRPVTTGESPAVT GRA >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_1|552_bp atgccccgcttagccaaactcacctggaagccaggaaaacaccatctgcatgggtcagcc ccactgaccagcacaggggaagaagcactggatctgactggctcagtggtccgtaaggaa tacggcaggaccgactcctggctgacaaaggaagcccctccagaagccctggtgaagatg aaagggaggacttgtggaatgtccagcccagtgtctgcagagccaggcacagcaaatgct gaagtccagctgtttgaaccatctgaaccggaagagtggaaaggaattgaggtgtttctc cctccttctagaaagttaagggtagcagaatttgaacaccacaagcctcagagagaagtg acagagacaacttctgtagcagatgatatggtgtgctctcagatgctgcttcagagtgaa ggcactcattccccggctgtcaggaagctgactggtgatgggtcacagcaccacattagc tgtgaaatccaacagcctgcaaaacgacctgtgaccacaggggaaagtccggcagtcact ggaagggcttag >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_2|301_aa MKTKPQGDSPGTVERETHSCTSRLEADVNDGPGVTIMVELDAGHYPSFQEAVRVLRKQKC DLINMLFEKDHSGSPRVEEGRPGSVRQMRKLNHRVQLLVQGYTVIGHRRSGFGSMLLTTK LYCCQLATFEQRIQGQKGKKEPDTGTKETKENPKSTGSWMAKKRPSGLQRAASAFKPGLA LSDRAFAAWEPSGAAVSRSPLSPPSRPFASREPAGFRAALADPPGMPRYELALILKAMQR QAIEEGRSPVPAPELQLSLYQVLVSACVLFKLEKSRHDPQSGMVSSEKGTVRSGFEGYEE A >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_2|906_bp atgaaaacgaaaccccaaggagacagcccaggcacagtagaaagagagacacacagttgt accagtcgtttagaagcagatgtgaacgatggcccaggtgtgacgatcatggtagaactt gatgcaggacattacccctcgttccaggaggcagttagagttttacgtaaacagaaatgt gatctgatcaacatgctgtttgagaaggaccactctgggagcccgagagtagaagaggga agacccgggtcagtgaggcagatgaggaaactgaaccacagagttcagttacttgttcaa gggtacacagtcattggacacaggcggtctggctttggctccatgctcttgaccaccaag ctgtactgctgccaactagccacctttgaacagcgcattcagggacagaaggggaaaaag gagccagacacagggaccaaggagacaaaggaaaaccccaagagtacaggatcatggatg gccaagaagagaccctcaggactgcagagagctgccagcgcctttaagcccgggctcgcg ctctcggaccgtgctttcgccgcctgggagccgtccggcgcagcagtttctaggtcccca ctgtccccgccgtcccgccccttcgcgtcccgggaaccggctggcttccgagccgcactc gccgatcctccaggcatgccccgctacgagctggctttaatcctgaaagccatgcagcgg caagctattgaagaaggaaggagtcctgttcctgcccctgagctacagctaagtctttac caggtgctggtctctgcatgtgttttgtttaagttggaaaaatctagacacgacccacag agtggtatggtatccagtgagaagggaactgtccgcagtggctttgaaggatatgaggaa gcctag >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_3|718_aa MRAVLDTADIAIVALYFILVMCIGFFAMWKSNRSTVSGYFLAGRSMTWVAIGASLFVSNI GSEHFIGLAGSGAASGFAVGAWEFNALLLLQLLGWVFIPIYIRSGVYTMPEYLSKRFGGH RIQVYFAALSLILYIFTKLSVDLYSGALFIQESLGWNLYVSVILLIGMTALLTVTGGLVA VIYTDTLQALLMIIGALTLMIISIMEIGGFEEVKRRYMLASPDVTSILLTYNLSNTNSCN VSPKKEALKMLRNPTDEDVPWPGFILGQTPASVWYWCADQVIVQRVLAAKNIAHAKGSTL MAGFLKLLPMFIIVVPGMISRILFTDDIACINPEHCMLVCGSRAGCSNIAYPRLVMKLVP VGLRGLMMAVMIAALMSDLDSIFNSASTIFTLDVYKLIRKSASSRELMIVGRIFVAFMVV ISIAWVPIIVEMQGGQMYLYIQEVADYLTPPVAALFLLAIFWKRCNEQGAFYGGMAGFVL GAVRLILAFAYRAPECDQPDNRPGFIKDIHYMYVATGLFWVTGLITVIVSLLTPPPTKEQ IRTTTFWSKKNLVVKENCSPKEEPYKMQEKSILRCSENNETINHIIPNGKSEDSIKGLQP EDVNLLVTCREEGNPVASLGHSEAETPVDAYSNGQAALMGEKERKKETDDGGRYWKFIDW FCGFKSKSLSKRSLRDLMEEEAVCLQMLEETRQVKVILNIGLFAVCSLGIFMFVYFSL >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_3|2157_bp atgagagctgtactggacacagcagacattgccatagtggccctgtattttatcctggtc atgtgcattggtttttttgccatgtggaaatctaatagaagcaccgtgagtggatacttc ctggcggggcgctctatgacctgggtagcaattggtgcctctctgtttgtgagcaatatt gggagtgagcacttcattgggctggcaggatctggagctgcaagtggatttgcagtgggc gcatgggaattcaatgccttactgcttttacaacttctgggatgggttttcatcccaatt tacatccggtcaggggtatataccatgcctgaatacttgtccaagcgatttggtggccat aggattcaggtctattttgcagccttgtctctgattctctatattttcaccaagctctcg gtggatctgtattcgggtgccctttttatccaggagtctttgggttggaatctttatgtg tctgtcatcctgctcattggcatgactgctttgctgactgtcaccggaggccttgttgca gtgatctacacagacactctgcaggctctgctcatgatcattggggcacttacacttatg attattagcataatggagattggcgggtttgaggaagttaagagaaggtacatgttggcc tcacccgatgtcacttccatcttattgacatacaacctttccaacacaaattcttgtaat gtctcccctaagaaagaagccctgaaaatgctgcggaatccaacagatgaagatgttcct tggcctggattcattcttgggcagaccccagcttcagtatggtactggtgtgctgaccaa gtcatcgtgcagagggtccttgcagccaaaaacattgctcatgccaaaggctctactctt atggctggcttcttaaagctcctgccaatgtttatcatagttgtcccaggaatgatttcc aggatactgtttactgatgatatagcttgcatcaacccagagcactgcatgctggtgtgt ggaagcagagctggttgctccaatattgcttacccacgcctggtgatgaagctggttcct gtgggccttcggggtttaatgatggcagtgatgattgcagctctgatgagtgacttagac tctatctttaacagtgccagtaccatattcaccctcgatgtgtacaaacttatccgcaag agcgcaagctcccgggagttaatgattgtggggaggatatttgtggcatttatggtggtg atcagcatagcatgggtgccaatcatcgtggagatgcaaggaggccagatgtacctttac attcaggaggtagcagattacctgacacccccagtggcagccttgttcctgctggcaatt ttctggaagcgctgcaatgaacaaggggctttctatggtggaatggctggctttgttctt ggagcagtccgtttgatactggcctttgcctaccgtgccccagaatgtgaccaacctgat aataggccgggcttcatcaaagacatccattatatgtatgtggccacaggattgttttgg gtcacgggactcattactgtaattgtgagccttctcacaccacctcccacaaaggaacag attcgaaccaccaccttttggtctaagaagaacctggtggtgaaggagaactgctcccca aaagaggaaccatacaaaatgcaagaaaagagcattctgagatgcagtgagaataatgag accatcaaccacatcattcccaacgggaaatctgaagacagcattaagggccttcagcct gaagatgttaatctgttggtaacctgcagagaggagggcaacccagtggcatccttaggt cattcagaggcagaaacaccagttgacgcttactccaatgggcaagcagctctcatgggt gagaaagagagaaagaaagaaacggatgatggaggtcggtactggaagttcatagactgg ttttgtggctttaaaagtaagagcctcagcaagaggagtctcagagacctgatggaagag gaggctgtttgtttacagatgctagaagagactcggcaagttaaagtaatactaaatatt ggactttttgctgtgtgttcacttggaattttcatgtttgtttatttctccttatga >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_4|105_aa MDRGAIVRDLENLGERALPYRISAHSQQHNRGGTQVMGLPSALPQGPSKTVSADVLCDEN AACSYVSRAVLCSPRGYDTYPKLRQGADLKPDSGVREPAFGWLME >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_4|318_bp atggacagaggagcaatagtgagggacttggaaaacctgggtgaacgagcgcttccttat aggatctctgcccacagtcagcagcacaacagaggcggaactcaggttatgggtctccca tctgcattgccacaaggacccagcaagacagtctctgcagatgtgctctgcgacgagaat gctgcttgtagttatgtgagtcgagctgtcttgtgcagccctagaggctatgacacgtat ccgaaattacggcaaggcgctgatttgaaacctgacagtggtgtgcgagagccagctttt ggctggctaatggagtga >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_5|201_aa MQISDISLQDSIAVKEKYAKYLPHNAGPYAAKRFCKAQCPIVESLTNSMIMHGHNNSKKL MTVRIIKYAFHFIHLLTGENPHLQVLVNTIINGGPQEDSTGFGHSHATKLSTTWPKPSRD LKTLREMASPRQTVPQYWKTMPPKTLHSQQHGCKRASSSHVPSAPPLERPSPSTIQSPCR LPGAASPSPTPLVVQGENCIF >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_5|606_bp atgcagatcagtgacatttccctgcaggattccattgcagtgaaggagaagtatgccaag tacctgcctcacaatgcagggccttatgctgccaaacgcttctgcaaagctcagtgcccc attgtggagtccctcactaactccatgataatgcatggccacaacaacagcaagaagctc atgactgtgcgcatcatcaagtatgccttccatttcatccacctgctcacaggcgagaac cctcacctccaggtcctggtgaacaccatcatcaatggtggtccccaggaggactccaca ggctttggacactcacatgctacaaaactctctactacatggcccaaaccctctagagat ctgaaaacactcagagaaatggcttccccaagacagactgtgccccaatactggaaaaca atgccaccaaaaacactccattcacagcagcatggttgtaagagagccagcagctctcat gtcccttctgcccctcccctggagaggcctagcccctccaccattcagtcaccatgcaga ctgcccggtgctgccagcccttccccgactcccctagttgtccagggggaaaactgtatt ttctga >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_6|198_aa MGSFLALSASRLLRTLSIFEDQYFLVDFYAPTAAVESMVEHLSRDIDVIRGNIVKHPLTQ ELKECEGIVPVPLAEKLYSTKKRKKMGEDEVVSQNSHLHLFLGLLGADWKDLTSPSPFPG GALHKVSVTITDDGTSAHKMLGEVPVLGISEKQEGAESSGLCFVFNHGKTGGWRDVQEMP KAPGGPGLPALAQRLLPL >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_6|597_bp atgggctctttccttgcactgtcagccagcaggctgctgaggaccttgtcaatctttgaa gaccagtatttcttggtggatttttatgcacccaccgcagctgttgaaagcatggtggag cacttgtctcgagatatagatgtgattagagggaatattgtcaaacaccctctgacccag gaactaaaagaatgtgaagggattgtcccagtcccactcgcagaaaaattatattccaca aagaagaggaagaagatgggtgaggatgaggtcgtttcccaaaacagtcacctgcatctc ttcctcggtctcctcggggctgattggaaagacttgactagcccttcccccttcccaggg ggtgccctgcacaaggtgtctgtcaccatcacagatgacggaacttctgcacataaaatg ttgggggaagtgccggttctgggaatctctgaaaagcaggaaggagccgagagttctggc ctctgctttgtttttaaccatggcaagaccggaggatggagagatgtgcaagagatgccc aaagcgccaggagggcctggtctcccagctctggcccagcgcctcctccctctgtag >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_7|431_aa MVAWPVLPAPTLGGMFVGKHRNHTAADSCHHYRTLLCYPTSSAFCRKQHPSENATVAPAT WTHHRAASPRPARAGKYGPRVLHSWLAGSQPHAAAVAFSIHGQGLTGLGAPSLSWDLKRR KSSNSQVIVSLYYGTLPSDASSKLERVNFPNLKLVGLGLGSGEGNPEAQHAGKRAIKLQA VMQPEPLTIAPSARDPYHLTVTIATNPIQHKSSSYVRTSPRSSSDRPRRQNEMDLPTLIH LPEENQLRKWENNTATSTQQCSDRKAWEKSSFLLVGMTIKLHDHFDDAAGEEAWGLALGA ASEGPRGAGHPEPAPRGIIRTAGIGNIYLQEPVSSLPDLAPQFLVTSHDWLLALQSSACT CPPHSILCTPPSSSYTVHTLSASMPILSSTRPGPVFVYGSTASTLPLSSPEGGDQSLGPP PAWAGRPDALL >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_7|1296_bp atggtggcatggcctgtgctccctgctcccacacttggaggaatgtttgttggaaaacat cgaaaccacactgcagctgattcttgtcaccattaccgcaccctgctgtgctacccgacg tcaagtgcattctgcaggaagcagcacccctctgaaaatgccactgtggcaccagctaca tggacccaccacagagcagcatctccccgccctgccagggctgggaagtatggcccaaga gtcctgcacagctggctggctggttcccagccacatgctgctgccgtagccttctccatc catggtcaaggtctaacaggtctaggagctccaagtttatcttgggaccttaagaggaga aaatcatccaactcacaggtaattgtgtctctgtactatgggaccctcccctcagatgca tcctccaaactggaaagagttaatttcccaaaccttaaactggttggcttaggactgggt tcaggggaaggaaacccagaagcccaacatgccggcaaaagggccatcaaactccaagcg gtcatgcaaccagaacctctgacaatagccccttctgccagggacccttaccatctgact gtcaccattgccacaaacccaattcaacacaaaagcagctcctacgtcagaaccagtcct cgcagcagttcagataggcctaggaggcaaaatgaaatggatctacccacactgattcat ttacctgaagagaatcagctcaggaagtgggagaacaacacagccacaagcacgcaacag tgctcagacaggaaggcttgggaaaaatcaagttttttgctggtgggaatgaccatcaaa ctgcatgaccattttgatgatgcagcaggcgaggaagcctggggtctggctctaggtgca gcgtcggaagggcccagaggagcaggtcatccagagccagcccctcgtggcataatcagg actgcagggattggcaacatctacctccaggagcctgtctccagtctcccagatctggcc cctcagttccttgtgacatcacacgactggctcctcgcccttcagagctcagcttgcacg tgccctccccactccatcctctgtactccccccagcagctcctacactgtacatactttg tctgcctccatgccaattctcagctccacgaggccagggcctgtctttgtctatggctct actgccagcactttgccattatcatcccctgaaggaggagaccagagcctcggaccacca ccagcctgggcagggagaccggatgctctactctga >gi568815577f:33995199_34197352|GENSCAN_predicted_peptide_8|412_aa XQPTCTPEGQTNKHRQSGFRVNQISSNMILQSEQLSSAPFNTRFLSELGGPAKSDGCLKW VKVWGGAKKAQYLQSDDLDRETKSSAFRSMKTTHYPIREMPVTEMPVEIGSFWLQAELVI EVNWRRETMTLTSNLEIRLENSTQETPTLAFKSKSTVWIGSMDLIISLDSGPRGKNLNSR WCVYIDTYSCLANEVQGHLHRLPAHRTAGPLRPGTRAISVTLFPELGTEEPASQDEQEQA KGNGSQTSASIRVAWKIYRNRLPKIWGATQELAFLTFLLVENLPSEKDGPIQGRATGLEY MGQRLHKPQYQDTVSERPRAEMERRSAAETRHLLVTPAQSWLLQQWLVWKMGQKWTLSMI SATVTYNPSVKGRNFPMQKQQKVNRGNIAMSKGSSTQYLRLCYKATQRVDEK >gi568815577f:33995199_34197352|GENSCAN_predicted_CDS_8|1239_bp nnacaaccaacttgcacacctgaaggtcaaacgaacaaacatcgtcagagtggattcagg gtaaaccaaatctcttctaatatgatactgcagagtgaacaacttagctctgcaccattc aacacaagatttctgagtgagcttggaggccctgcaaaaagtgacggctgccttaaatgg gttaaagtttggggaggggccaagaaagcccagtacctccaaagcgatgacctagacaga gagacaaaaagcagtgcttttcgaagtatgaaaactacccattatcccattagagagatg ccggttacagagatgccggtggaaatcggaagcttctggcttcaggcagaactagtcatt gaagtcaactggagaagagaaacgatgactttaacatccaacctagaaatccgccttgaa aatagcacccaagaaacgccaacgctggcattcaaaagcaaaagcactgtatggattggg tctatggatctcataatttccttagactctgggccaaggggcaaaaacttaaattcaaga tggtgtgtgtacatagatacatactcgtgtttagctaatgaagtacaaggtcacctgcac aggctgcctgcccaccggactgctggccccttgaggccaggaaccagggccatctctgtc accctcttcccagagcttggcaccgaggagcctgcaagccaggatgagcaggaacaggcc aagggcaatggttctcaaacttcagccagcatcagagtcgcctggaagatctatcgcaac agactgccaaagatctggggtgcaactcaagaactggcatttctaaccttcctcctagtg gaaaatttgccttcagaaaaagatggcccaatccagggaagagccactggcctagagtat atgggacagaggttgcacaagcctcaataccaggacactgtgtcagagaggcccagggca gaaatggagagacgcagtgcagctgaaacaaggcaccttctggttacacctgcgcaaagc tggctgctccagcaatggctggtgtggaagatgggtcagaagtggactctgtctatgatt tcagcaacggtgacatataatcccagtgttaaaggacggaattttccaatgcagaagcag cagaaagtgaacagaggaaacattgccatgagcaaaggctcctcaacacagtatctccga ctgtgctacaaagccacccagcgtgtggatgaaaagtga