GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:48:04 Sequence gi568815597r:169753716_169988840 : 235125 bp : 37.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 5254 4949 306 1 0 50 59 199 0.373 8.92 1.02 Intr - 31945 31671 275 2 2 20 16 233 0.033 5.63 1.01 Init - 33910 33775 136 1 1 100 86 42 0.735 5.55 1.00 Prom - 36462 36423 40 -7.75 2.04 PlyA - 38834 38829 6 1.05 2.03 Term - 39952 38862 1091 2 2 50 32 577 0.450 39.56 2.02 Intr - 41426 41189 238 2 1 54 66 139 0.227 4.56 2.01 Init - 43636 43631 6 1 0 97 110 0 0.476 4.13 2.00 Prom - 46502 46463 40 -5.55 3.00 Prom + 47977 48016 40 -7.95 3.01 Init + 48927 49010 84 2 0 66 108 38 0.326 4.47 3.02 Intr + 49454 49594 141 1 0 84 113 27 0.932 4.43 3.03 Intr + 50360 50525 166 1 1 68 95 97 0.860 7.01 3.04 Term + 51562 51674 113 2 2 8 44 131 0.497 -1.56 3.05 PlyA + 54100 54105 6 1.05 4.00 Prom + 63072 63111 40 -7.55 4.01 Init + 66430 66524 95 0 2 80 100 71 0.673 7.40 4.02 Intr + 73336 73462 127 1 1 100 37 86 0.353 4.46 4.03 Intr + 75624 75734 111 2 0 112 107 2 0.409 4.16 4.04 Intr + 83284 83397 114 0 0 85 99 66 0.686 7.12 4.05 Intr + 88702 88824 123 0 0 115 110 20 0.813 6.76 4.06 Intr + 89959 90054 96 0 0 96 110 23 0.958 4.59 4.07 Intr + 93975 94088 114 2 0 81 90 52 0.944 4.42 4.08 Intr + 98102 98221 120 1 0 50 80 136 0.878 8.77 4.09 Term + 99075 99272 198 2 0 82 47 60 0.799 -2.28 4.10 PlyA + 99527 99532 6 1.05 5.17 PlyA - 99596 99591 6 -0.45 5.16 Term - 101249 100534 716 0 2 87 37 605 0.794 47.96 5.15 Intr - 105497 105326 172 2 1 90 116 64 0.998 7.99 5.14 Intr - 109082 108898 185 0 2 55 71 234 0.668 16.99 5.13 Intr - 110793 110654 140 2 2 98 63 66 0.877 4.29 5.12 Intr - 113258 113181 78 1 0 73 97 47 0.687 1.85 5.11 Intr - 116642 116419 224 2 2 66 93 41 0.695 -1.60 5.10 Intr - 122376 122263 114 0 0 119 99 73 0.960 11.22 5.09 Intr - 125104 124919 186 1 0 65 89 125 0.886 9.26 5.08 Intr - 128863 128679 185 2 2 41 27 106 0.535 -1.61 5.07 Intr - 129367 129164 204 2 0 43 40 175 0.634 6.45 5.06 Intr - 129652 129468 185 0 2 53 101 99 0.083 6.21 5.05 Intr - 140286 140073 214 1 1 54 92 79 0.042 1.75 5.04 Intr - 143332 143241 92 0 2 54 60 79 0.103 0.42 5.03 Intr - 144727 144595 133 2 1 44 74 100 0.082 2.98 5.02 Intr - 154076 154002 75 0 0 78 70 75 0.045 3.27 5.01 Init - 161573 161477 97 2 1 99 103 67 0.851 7.98 5.00 Prom - 163533 163494 40 -7.45 6.10 PlyA - 163658 163653 6 1.05 6.09 Term - 168066 167961 106 2 1 101 44 161 0.859 9.90 6.08 Intr - 200395 200296 100 2 1 119 97 75 0.661 9.75 6.07 Intr - 207520 207331 190 1 1 84 59 210 0.742 15.94 6.06 Intr - 218883 218798 86 1 2 30 115 51 0.122 0.62 6.05 Intr - 224468 224370 99 0 0 78 20 90 0.150 0.36 6.04 Intr - 228382 228257 126 2 0 55 54 86 0.258 1.63 6.03 Intr - 229152 228987 166 0 1 54 72 197 0.973 13.31 6.02 Intr - 229667 229555 113 0 2 74 87 65 0.948 4.18 6.01 Intr - 230975 230867 109 2 1 70 86 70 0.493 3.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:169753716_169988840|GENSCAN_predicted_peptide_1|239_aa MAMGFCSSGEKIVLTAEYGQVGIYSRGARWVKESVDEKILRGSIKGSRERQADERQNSMA EKERRGASERREAGDDWRGDRLLDSQTPGEYHLPTPSPFQLPIHPTESHLHHSIKLPHSS FKSMCNLILPGCQTRTWMSRLKGVNVQLRPLLQRVQALTKHWWLTCGIEPVGAQESRIEV WEPPRLRFPRMYGNAWMSRQKFAAGVEPSWRTSTRAVQKGNVGSEPPHRVPTGALPGGA >gi568815597r:169753716_169988840|GENSCAN_predicted_CDS_1|717_bp atggcaatgggattttgcagtagtggggagaaaatagtcttaactgcagagtatgggcaa gtgggaatttatagccgaggagcaaggtgggtcaaggagtcagtggatgaaaaaatacta agaggaagcatcaagggctccagagagagacaggcagatgaacggcagaacagcatggca gagaaggagagaagaggagcatctgaacgtcgagaggctggggacgactggagaggagat cggctgctggacagccaaactccaggagaatatcatcttcccactccatcccccttccag ctccccatccatcccactgagagccacctccaccactccataaaactcccacattcatcc ttcaagtccatgtgtaacctgattcttcctggatgccagacaaggacctggatgtctagg ctaaaaggtgtcaacgtacagctcaggccattgcttcagagggttcaagccctgaccaag cattggtggcttacatgtggtattgagcctgtgggtgcacaggagtcaagaattgaagtt tgggaacctccgcggcttagatttccgaggatgtatggaaatgcctggatgtccaggcag aagtttgctgcaggggtagaaccctcatggagaacctctactagggccgtgcagaaggga aatgtggggtcagagcccccacacagagtccccactggggcactgcctggtggtgct >gi568815597r:169753716_169988840|GENSCAN_predicted_peptide_2|444_aa MQGNRASWDRTFLRHTFVKQESFKPDPPKPRPETRSPANPCALSAGSKTIERRAAGFLVS PGPSRPERKSGFSSPWRTRLTEDHLENELTPIRDGALTLDSSKELSVSESQKGEERDRKC SAEQFDLPQDHLWEHKSMENAAPSQDTDSPLSAASSSRNLEPHGKQPSLRAAKEHAMPKD LKKMLENKVIETLPGFQHVKLSVVKTILLKENFPGENIVSKSFSSHSDLITGVYEGGLKI WECTFDLLAYFTKAKVKFAGKKVLDLGCGSGLLGITAFKGGSKEIHFQDYNSMVIDEVTL PNVVANSTLEDEENDVNEPDVKRCRKPKVTQLYKCRFFSGEWSEFCKLVLSSEKLFVKYD LILTSETIYNPDYYSNLHQTFLRLLSKNGRVLLASKAHYFGVGGGVHLFQKFVEERDVFK TRILKIIDEGLKRFIIEITFKFPG >gi568815597r:169753716_169988840|GENSCAN_predicted_CDS_2|1335_bp atgcaggggaaccgcgctagctgggaccgcaccttcctgagacatactttcgtcaaacag gagagcttcaaaccagacccgccaaagcccagaccggaaactcgcagtcccgccaaccct tgcgcgctctccgccggctccaaaaccatagagaggagggcggctggtttcttggtgagc ccgggtccctcaaggccggaaagaaagtcgggcttctctagcccctggaggactcgactc actgaagaccatctggaaaatgaattaacacccattagagatggagctttgaccctggat tcctcaaaagagctgtcagtctcagaaagtcaaaaaggagaagagagggacagaaaatgt tctgcagaacaatttgacttgcctcaggatcacttgtgggaacataagtcaatggaaaat gcagctccctctcaagacacagacagtccactcagtgcagccagcagttcaaggaacttg gagccacatggaaaacagccctccttgagagctgccaaagagcatgctatgcctaaagat ttaaagaagatgttagaaaataaagtcatagaaacattaccaggtttccagcatgttaag ttatcagtagtgaaaaccatcttgttgaaagagaacttccctggagaaaacatagtttca aaaagcttttcttctcactctgatctgattacaggtgtttatgagggaggcttaaaaatc tgggaatgtacctttgacctcctggcttatttcacaaaggccaaagtgaaatttgctggg aaaaaagtcttggatcttggttgtggatcaggtttactaggtataactgcattcaaggga gggtccaaagaaattcactttcaagattataacagtatggtgattgatgaagtaacctta cctaatgtagtagctaactccactttggaagatgaagaaaatgatgtaaatgagccagat gtgaaaagatgcaggaaaccaaaagtaacacaactatataaatgccgatttttttctggt gagtggtctgagttttgtaagcttgtactaagtagtgaaaaactttttgtaaaatatgat ctcattctcacctcagaaaccatttacaacccagattattatagtaatttgcaccagact ttccttagactgttaagtaaaaatggacgtgtacttttggccagcaaagcacattatttt ggtgtaggtggaggtgttcatctctttcagaagtttgtagaagaaagagatgtttttaag accagaatactcaaaataattgatgaaggattgaagaggttcataattgaaataactttt aagtttcctggttaa >gi568815597r:169753716_169988840|GENSCAN_predicted_peptide_3|167_aa MMYELTSQARGLSSQNLEIQTTLRNILQTMVQLLGALTGCVQHICATQESIILENIQSLP SSVLHIIKSTFVHCKNSESVYSGCLHLVSDLLQALFKEAYSLQKQLMELLDMVCMDPLVD DNDDILNMVIAHPCDFSDFIHIWLLAEAEVLEKAMCIIQQVHPISDT >gi568815597r:169753716_169988840|GENSCAN_predicted_CDS_3|504_bp atgatgtatgaattaaccagtcaagccagaggactgtcaagccaaaatttggaaatccag accactctaaggaatattttacaaacaatggtgcagctcttaggagctctcacaggatgt gttcagcatatctgtgccacacaggaatccatcattttggaaaatattcagagtctcccc tcctcagtccttcatataattaaaagcacatttgtgcattgtaagaatagtgaatctgtg tattctgggtgtttacacctagtttcagaccttctccaggctcttttcaaggaggcctat tctcttcaaaagcagttaatggaactgctggacatggtttgcatggaccctttagtagat gacaatgatgatattttgaatatggtaatagctcacccatgtgacttcagtgacttcatt cacatctggctgttggcagaggcagaagtacttgagaaagccatgtgcatcatccagcag gttcaccctatctcagatacctga >gi568815597r:169753716_169988840|GENSCAN_predicted_peptide_4|365_aa MKEWTVDLSNRKDKKGKGENGRKALPVAGWGSKFPPSLYATRISKAHQEEIAGAFLVTLD PLISQLLTFQPFMQGLKSKGKAEVAVTLYQHVCVHLCTFITSFHPSLFAELALPPELREQ TVHEVTTVGTAECRKWLSRSRTLGELESLNTVLSALLAVCNSAGEALDTGKQTAIIEVVS QLWAFLNIKQVADQPYVQQTFSLLLPLLGFFIQTLDPKLILQAVTLQTSLLKLELPDYVR LAMLDFVSSLGKLFIPEAIQTGFVDETEAAKVERVKQEKGIFWEPFANVTVEEAKRSSLQ PYAKRARQEFPWEEEYRSALHTIAGALEATESLLQKGPAPAWLSMEMEALQERMDKLKRY IHTLG >gi568815597r:169753716_169988840|GENSCAN_predicted_CDS_4|1098_bp atgaaagagtggacagttgatctgagtaacagaaaagataagaaagggaaaggagagaat gggagaaaagcattgcctgtggcagggtggggaagcaagtttcctccaagcctttatgct accaggatttctaaagcacaccaagaggaaatagcaggtgctttcctagtgacactggat ccacttatcagtcagctgctcacatttcagcctttcatgcagggattaaagagtaagggg aaagctgaggtggctgtcaccttgtatcagcatgtttgtgttcatctgtgtacatttatt acttcctttcatccctcactgtttgctgaactggcgttacctcctgagcttagggaacaa actgtccatgaggtcaccacagtaggcactgcagaatgcaggaaatggctgagcaggagt cgtactttgggagaactagaatctctgaacacagtactgtctgctttgcttgcagtatgt aattctgctggtgaagctttggatacaggaaaacaaactgcaattatcgaagttgtgagt cagctttgggcttttttaaacattaaacaggtagcagatcaaccttatgttcaacagaca ttcagccttttacttccactgttgggatttttcattcaaactctagatcctaaactgata cttcaggcagtaactttgcagacctcgctacttaaattagagcttcctgactatgttcgt ttggcaatgttggattttgtatcttctttaggaaaactttttatacctgaagctatccag actgggtttgtagatgaaactgaagctgccaaagtggaacgtgtgaaacaggaaaaaggt attttctgggaaccctttgctaatgtgactgtagaagaagcaaagaggtcatctttacag ccttatgcaaaaagagctcgtcaggagttcccctgggaagaagagtacaggtcagcgctg catacaatagcaggggctttggaagcaactgagtcactactccaaaagggtcctgctcca gcctggctttcaatggaaatggaggcgctccaagaaaggatggataagctaaaacgttac atacatactctagggtga >gi568815597r:169753716_169988840|GENSCAN_predicted_peptide_5|999_aa MAPGGCGDSASFCILGLETTLTRSAEKKFTLPGSPLISVDTAILQNSGGPVSAMPPSYIF KMRTEESVREYKISMHKKGRRSSESSSRELRTGTRAIEGSVGTCDHPSGKVGSESDLFDF LTSLSKVNKGEGVGLIGGVGGRKVGLLSRFRTRLYGVGETRPPEDCGCSGHSLTGRQGWL VSTGGPTGYQSQDPRPPPPPPSCRLQTGTGSWPLGTGWHSRRLWAVGTRALPSEFCLLSD QQQYCVRNWWVLGLTDFKNEAADPRGVKPWTFAVSVTAHKGSVDPKSEQQQDLLQRAKGQ SFHNVEADLRGLPLLAGQPAFILLSGPTHILLTDSGAQLASPSGSRTGAAGGAACQSHAM CPHSSALWVVDGTGLRGAGGGAGQGGWGRTGAQGGHLKTLRHPCLLRFLSCTVEADGIHL VTERVQPLEVALETLSSAEVCAGIYDILLALIFLHDRGHLTHNNVCLSSVFVSEDGHWKL GGMETVCKVSQATPESPEFTTLPECHGHARDAFSFGTLVESLLTILNEQGELELYYCIYR ISRKIIHDVFCVEFKGLQSLSTVRNLLLTRNDFLEVVNFLKSLTLKSEEEKTEFFKFLLD RVSCLSEELIASRLVPLLLNQLVFAEPVAVKSFLPYLLGPKKDHAQGETPCLLSPALFQS RVIPVLLQLFEVHEEHVRMVLLSHIEAYVEHFTQEQLKKVILPQVLLGLRDTSDSIVAIT LHSLAVLVSLLGPEVVVGGERTKIFKRTAPSFTKNTDLSLEGDPFSQPIKFPINGLSDVK NTSEDSENFPSSSKKSEEWPDWSEPEEPENQTVNIQIWPREPCDDVKSQCTTLDVEESSW DDCEPSSLDTKVNPGGGITATKPVTSGEQKPIPALLSLTEESMPWKSSLPQKISLVQRGD DADQIEPPKVSSQERPLKVPSELGLGEEFTIQVKKKPVKDPEMDWFADMIPEIKPSAAFL ILPELRTEMVPKKDDVSPVMQFSSKFAAAEITEVSTFME >gi568815597r:169753716_169988840|GENSCAN_predicted_CDS_5|3000_bp atggcccctgggggatgcggggactcggcctccttctgcatcttaggattggaaaccaca ctcacccgttcagctgagaaaaagtttactttgccaggttcccctttgatctcagttgac acagccatccttcagaatagtggaggcccagtgtctgccatgccgccttcttacatattt aaaatgaggactgaggaatcagtgagggaatacaagatttctatgcataagaaaggcaga aggagctcagagagcagctccagggaactgagaactggaaccagagctatagagggctct gtgggaacttgtgaccaccctagcgggaaagtaggctcagaaagtgatctttttgatttc ttaacttctttatcgaaggtgaacaagggagaaggggttggcctcattggtggcgtcgga gggaggaaggtgggccttctgtcccgtttccggacccgtctctatggtgtaggagaaacc cggcccccagaagattgtgggtgtagtggccacagccttacaggcaggcaggggtggttg gtgtcaacaggggggccaacagggtaccagagccaagaccctcggcctcctcccccgccg ccttcctgcaggctgcagactggtactggttcgtggcctctaggaacaggatggcacagc aggaggctgtgggcagtgggcacgcgagcattaccatctgagttctgtctcctgtcagat cagcagcagtattgtgtccggaattggtgggttcttggtctcactgacttcaagaatgaa gccgcggaccctcgcggagtgaagccgtggaccttcgcggtgagtgttacagctcataaa ggcagtgtggacccaaagagtgagcagcaacaagatttattgcaaagagcgaaaggacaa agcttccacaatgtggaagcggacctgagagggttgccactgctggctgggcagcctgct tttattctcttatccggcccgacccacatcctgctgactgactcaggagcccagctggct tcacccagtggatcccgcactggggctgcaggtggagctgcctgccagtcccacgccatg tgcccgcactcctcagccctttgggtggtcgatgggactgggctccgtggagcagggggc ggcgctggtcagggaggctggggtcgcacaggagcccagggagggcatttgaagacactt cgtcacccttgcttgctaagatttttatcttgtactgtggaagcggatggcattcatctt gtcactgagcgagtacagcccctggaagtggctttggaaacattgtcttctgcagaggtc tgtgctgggatctatgacatattgctggctcttatcttccttcatgacagaggacaccta acacacaataatgtctgtttatcatctgtgtttgtgagtgaagatggacactggaagcta ggaggaatggaaactgtttgtaaagtttctcaggccacaccagagtctccagaattcaca actctcccagagtgtcatggacatgcccgggatgccttttcatttggaacattggtggaa agtttgctcacaatcttaaatgaacagggtgagttggagttatactactgcatctataga ataagtaggaaaatcatccatgacgtattctgtgtggaattcaaaggactacagtctttg agcacagtaaggaatcttttgcttaccagaaatgattttctggaagttgtgaatttcttg aaaagtttaacattgaagagtgaagaggagaaaacggaattctttaaatttctgctggac agagtcagctgcttgtcagaggaattgatagcttcaaggttggtgcctcttctgcttaat cagttggtgtttgcagagccagtggctgttaagagttttcttccttatctgcttggcccc aaaaaagatcatgcgcagggagaaactccttgcttgctctcaccagccctgttccagtca cgggtgatccccgtgcttctccagttgtttgaagttcatgaagagcatgtgcggatggtg ctgctgtctcacatcgaggcctacgtggagcacttcactcaggagcagctgaagaaagtc atcttgccacaggttttgctgggcctgcgtgatactagcgattccattgtggcaattact ctgcatagcctagcagtgctggtctctctgcttggaccagaggtggttgtgggaggagaa cgaaccaagatcttcaaacgcactgccccaagttttactaaaaatactgacctttctcta gaaggcgatccattttctcagcctattaaatttcccataaatggactctcagatgtaaaa aatacttcggaggacagtgaaaacttcccatcaagttctaaaaagtctgaggagtggcct gactggagtgaacctgaggagcctgaaaatcaaactgtcaacatacagatttggcctaga gaaccttgtgatgatgtcaagtcccagtgcactaccttggatgtggaagagtcatcttgg gatgactgcgagcccagcagcttagatactaaagtaaacccaggaggtggaatcactgct acaaaacctgttacctcaggggagcagaagcctattcctgctttgctttcactcactgaa gagtctatgccttggaaatcaagcttaccccaaaagattagccttgtacaaaggggggat gacgcagaccaaatcgagccgccaaaagtgtcatcacaagaaaggccccttaaggttcca tcagaacttggtttaggagaggaattcaccattcaagtaaaaaagaagccagtaaaagat cctgagatggattggtttgctgatatgatcccagaaattaagccttctgctgcttttctt atattacctgaactgaggacagaaatggtcccaaaaaaggatgatgtctccccagtgatg cagttttcctcaaaatttgctgcagcagaaattactgaggtgagtacttttatggagtaa >gi568815597r:169753716_169988840|GENSCAN_predicted_peptide_6|364_aa LMKMLFECSDERIDLELISFCINLAANKRNVQLICEGNGLKMLMKRALKFKDPLLMKMIR NISQHDGPTKNLFIDYVGDLAAQISNDEEEEFVIECLGTLANLTIPDLDWELVLKEYKLV PYLKDKLKPGAAEDDLVLEVVIMIGTVSMDDSCAALLAKSGIIPALIELLNAQQEDDEFV CQIIYVFYQMVFHQATRDVIIKETQAPAYLIDLMHDKNNEIRKVCDNTLDIIAEYDEEWA KKIQSEKFRWHNSQWLEMVESRQMDESEQYLYGDDRIEPYIHEGDILERPDLFYNSDGLI ASEGAISPDFFNDYHLQNGDVVGQHSFPGSLGMDGFGQPVGILGRPATAYGFRPDEPYYY GYGS >gi568815597r:169753716_169988840|GENSCAN_predicted_CDS_6|1095_bp ttaatgaagatgctgtttgaatgttcagatgaacgaattgacttggaactcatttctttc tgcattaatcttgctgctaacaaaagaaatgtacagcttatctgtgaaggaaatgggctg aagatgctcatgaagagggctctgaagtttaaggatccattgctgatgaaaatgattaga aacatttctcagcatgatggaccaactaaaaatctgtttattgattatgttggggacctt gcagcccagatctctaatgatgaagaagaggagtttgtgattgaatgtttgggaactctt gcaaacttgaccattccagacttagactgggaattggttcttaaagaatataagttggtt ccatacctcaaggataaactaaaaccaggtgctgcagaagatgatcttgttttagaagtg gttataatgattggaactgtatccatggatgactcttgtgctgcattgctagccaaatct ggcataatccctgcactcattgaattgctaaatgctcaacaagaagatgatgaatttgtg tgtcagataatttatgtcttctaccagatggttttccaccaagccacaagagacgtcata atcaaggaaacacaggctccagcatatctcatagacctaatgcatgataagaataatgaa atccgaaaggtctgtgataatacattagatattatagcggaatatgatgaagaatgggct aagaaaattcagagtgaaaagtttcgctggcataactctcagtggctggagatggtagag agtcgtcagatggatgagagtgagcagtacttgtatggtgatgatcgaattgagccatac attcatgaaggagatattctcgaaagacctgaccttttctacaactcagatggattaatt gcctctgaaggagccataagtcccgatttcttcaatgattaccaccttcaaaatggagat gttgttgggcagcattcatttcctggcagccttggaatggatggctttggccaaccagtt ggcattcttggacgccctgccacagcatatggattccgccctgatgaaccttactactat ggctatggatcttga