GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:00:19 Sequence gi568815594r:67640483_67854335 : 213853 bp : 36.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 13089 13427 339 2 0 88 37 327 0.984 23.38 1.02 PlyA + 13473 13478 6 1.05 2.00 Prom + 14516 14555 40 -6.15 2.01 Sngl + 16035 16790 756 2 0 24 38 282 0.961 12.69 2.02 PlyA + 16966 16971 6 1.05 3.10 PlyA - 17439 17434 6 1.05 3.09 Term - 20425 20124 302 2 2 81 37 156 0.716 4.20 3.08 Intr - 23465 23348 118 2 1 101 64 70 0.726 5.02 3.07 Intr - 24810 24707 104 1 2 106 78 44 0.991 4.17 3.06 Intr - 28192 28069 124 1 1 48 114 73 0.991 5.14 3.05 Intr - 33295 33215 81 1 0 68 86 81 0.950 4.82 3.04 Intr - 37240 37129 112 0 1 72 92 62 0.999 4.46 3.03 Intr - 38051 37957 95 2 2 74 103 83 0.998 6.24 3.02 Intr - 41109 41081 29 1 2 108 115 12 0.985 2.82 3.01 Init - 41700 41637 64 0 1 74 99 1 0.457 1.26 3.00 Prom - 42922 42883 40 -5.95 4.00 Prom + 46936 46975 40 -4.45 4.01 Init + 58626 58681 56 2 2 87 92 39 0.830 5.01 4.02 Intr + 60486 60671 186 0 0 60 97 213 0.809 17.28 4.03 Term + 60781 60886 106 1 1 41 37 178 0.796 4.90 4.04 PlyA + 61032 61037 6 1.05 5.03 PlyA - 62507 62502 6 1.05 5.02 Term - 76411 76066 346 0 1 22 43 404 0.973 22.28 5.01 Init - 76906 76509 398 1 2 84 63 383 0.965 29.62 5.00 Prom - 79020 78981 40 -5.55 6.04 PlyA - 79201 79196 6 -1.75 6.03 Term - 80228 79603 626 0 2 58 37 483 0.430 33.76 6.02 Intr - 85881 85754 128 2 2 -24 9 154 0.030 -4.30 6.01 Init - 90818 90694 125 2 2 71 60 100 0.378 5.19 6.00 Prom - 90872 90833 40 -5.65 7.04 PlyA - 90985 90980 6 1.05 7.03 Term - 100242 99998 245 1 2 104 38 127 0.823 4.38 7.02 Intr - 104305 104086 220 1 1 34 92 64 0.020 -1.65 7.01 Init - 113853 113332 522 0 0 61 115 301 0.989 25.00 7.00 Prom - 125138 125099 40 -5.05 8.08 PlyA - 125211 125206 6 1.05 8.07 Term - 126167 126039 129 2 0 90 44 160 0.938 9.00 8.06 Intr - 130202 130086 117 2 0 18 111 73 0.194 2.34 8.05 Intr - 182016 181954 63 0 0 107 66 45 0.048 2.20 8.04 Intr - 185392 185250 143 2 2 53 115 47 0.961 3.05 8.03 Intr - 187038 186779 260 2 2 48 119 123 0.797 7.58 8.02 Intr - 202143 202076 68 0 2 77 105 32 0.839 0.58 8.01 Intr - 213704 213586 119 0 2 128 119 25 0.985 8.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 197847 197690 158 2 2 65 103 107 0.920 8.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_1|112_aa MGRNQSRKAKNSKNQSASSPPKDWRSSPATEQNWMENDFDELTEVGFRRLVITNFSELKE HVLTHCKEAKNLEKGLEELLTRINSVEKTLNDLMELKTTARELHDACTSFNS >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_1|339_bp atggggagaaaccagagcagaaaagccaaaaattctaaaaaccagagtgcctcttctcct ccaaaggattggagatcctcgccagcaacggaacaaaactggatggagaatgactttgat gagctgacagaagtaggcttcagaaggttggtaataacaaacttctccgagctaaaggag catgttctaacccattgcaaggaagctaaaaaccttgaaaaagggttagaagaattgcta actagaataaacagtgtagagaagaccttaaatgacctgatggagctgaaaaccacagca cgagaacttcacgatgcatgcacaagcttcaatagctga >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_2|251_aa MKRYLKIIRAIYDKPTANIILNGQKLEAFPWKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDLIVYLENPNISAQNLLKLISNFSKVSGYKINVQKSQAFL YTNYRQRDSQTVSELSFTIATKRIKYLGIQLTRNVKDLLKEKYNPLLNEIKEDTNKWKNI PCSWIGRINIIKTPILPKVIYRLNAIPIKLPMTFFTEVEKNYFKVHMEPQKSLHSQDNPK QREQSWKHHTI >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_2|756_bp atgaaacgttacctcaaaataataagagctatttatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattcccttggaaaactggtacaagacaaggatgccct ctctcaccactcctattcaacatagtcttggaagttctcgccagggcaatcaggcaagag aaagaaataaagggtattcaattaggaaaagaggaagtcaaattgtctctgtttgcagat gacctgattgtatatttagaaaaccccaacatctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattccta tacaccaattatagacaaagagatagccaaactgtgagtgaactttcattcacaattgct acaaagagaataaaatacctaggaatccaacttacaaggaatgtgaaggacctcctcaag gagaagtacaacccactgctcaacgaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatatcattaaaacgcccatactgcccaaggtaatt tatagactcaatgccatccccatcaagctaccaatgactttcttcacagaagtggaaaaa aactactttaaagttcatatggaaccacaaaagagcctgcatagccaagacaatcctaag caaagagaacaaagctggaagcatcatactatctga >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_3|342_aa MQKMAKSHVFLSGMGGLGLEIAKNLVLAGIKAVTIHDTEKCQAWDLGTNFFLSEDDVVNK RNRAEAVLKHIAELNPYVHVTSSSVPFNETTDLSFLDKYQCVVLTEMKLPLQKKINDFCR SQCPPIKANPGIVTCLENHPHKLETGQFLTFREINGMTGLNGSIQQITVISPFSFSIGDT TELEPYLHGGIAVQVKTPKTVFFESLERQLKHPKCLIVDFSNPEVNKHFAGLREAAESEM RISHADARGGFPWSWAASPCGYARYSPPSDCFHELALSVAAFPDSWSKLSLDLPFRDLED CGTLLTALLGSANGDSLWGLQPHIILLHCLSKKFSMRAPPLQ >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_3|1029_bp atgcagaagatggccaagtcccatgttttcttaagtgggatgggtggtcttggtttggaa attgcaaagaatcttgttcttgcagggattaaggcagttacaattcatgatacagaaaaa tgccaagcatgggatctaggaaccaacttctttctcagtgaagatgatgttgttaataag agaaacagggctgaagctgtacttaaacatattgcagaactaaatccatacgttcatgtc acatcatcttctgttcctttcaatgagaccacagatctctcctttttagataaataccag tgtgtagtattgactgagatgaaacttccattgcagaagaagatcaatgacttttgccgt tctcagtgccctccaattaaggcaaatcctggcattgttacttgccttgaaaatcatcct cacaaactggagacaggacaattcctaacatttcgagaaattaatggaatgacaggttta aatggatctatacaacaaataacggtgatatcgccattttcttttagtattggtgacacc acagaactggaaccatatttacatggaggcatagctgtccaagttaagactcctaaaaca gttttttttgaatcactggagaggcagttaaaacatccaaagtgccttattgtggatttt agcaaccctgaggtaaataaacactttgcaggtttgagagaggcagcagaatcagagatg agaattagtcacgctgatgcaagaggtgggttcccatggtcttgggcagcttcaccctgt ggttatgcaaggtacagccccccttccgactgctttcatgagttggcattgagtgttgca gcttttccggattcatggtccaagctttcgttggatctaccattccgggatctggaggac tgtggcactcttctcacagctctactaggcagtgccaatggggactctctgtgggggctc caaccccacattatccttctgcactgccttagcaagaagttctccatgagggccccaccc ctgcagtaa >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_4|115_aa MAFKKRICTKHWQAEIIPSPPERKKQKPGAWVPPATPHLRPSRPLTCQSPRKNRTPLPPD GRPQARILPLPPETPPPATGRGVLVSGLGVAAWRAVVQVAEGGVTVATGIRIRQM >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_4|348_bp atggccttcaaaaaacgtatctgtactaaacactggcaagcagagataattcccagcccg ccggaaagaaagaagcagaaacccggagcctgggtcccacccgcgacccctcaccttcgc ccttctcgtcctctcacctgccagtcccccaggaagaacaggacgcctcttccccctgat gggcggccacaggctcggatccttccattgccgcctgagacaccgccgccggctactgga aggggcgtcctggtttccggtttgggtgtggccgcatggcgtgctgtggtgcaggtggcc gaagggggcgttactgttgcgactggcatccgcatccggcagatgtag >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_5|247_aa MVTLRKRTLKVLTFLVLFIFLTSFLNYSHAMVATTWFPKKMVLELLENLKRLIKHRPCTC THCIRQHGLSAWFDERFNQIVQLLLTAQNALLEDNTYQWWLRLQQEKKPNIINNTIKEFR AVPGNVDPMLEKRMNKAPTAGFEAAAGSKTAHHLVYPESFRELGDNVSMVLVPLKTMNLE WVVSTTTTGAISHTYTPVLVKIRVKQDKILIYHPAFIKYVFDNWLQSHRRYPLTSILSVI FSMHVCD >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_5|744_bp atggtgaccctgcggaagaggaccctgaaagtgctcaccttcctcgtgctcttcatcttc ctcacctccttcctgaactactcccacgccatggtggccaccacctggttccccaagaag atggtcctggagctcttggagaacctgaagagactgatcaagcacaggccctgcacttgc acccactgcatcaggcagcatgggctctcagcctggttcgatgagaggttcaaccagata gtgcagctgctgctgactgcccagaacgcgctcttggaggacaacacctaccaatggtgg ctgaggctccagcaggagaagaagcccaatatcatcaacaataccatcaaggaattcaga gcagtacctgggaatgtggacccaatgctggagaagaggatgaacaaggcacccacggca gggtttgaagctgctgccgggagcaaaaccgcccaccatctggtgtaccctgagagcttc cgggagctgggggacaatgtcagcatggtcctggtgcccttaaagaccatgaacttggag tgggtggtgagcaccaccaccacgggtgccatttcccacacctacaccccggtcctcgtg aagatcagagtgaaacaggataagatcctgatctaccacccagccttcatcaagtatgtc ttcgacaactggctgcagagccacaggcggtacccactcaccagcatcctctcggtcatc ttctcaatgcatgtctgcgattag >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_6|292_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNSPNPMPNNKSILREDDSFN TFFNGTIVVKHGSTKLFVDLEPIRAAHLKPSSLNLKVIAMDLAQMHACQGPGNDLYIEAS AALCAGSDFSVSGGLQWVQLVAHGSAGDDNGWLRCHRPPWQGLGDNELDGCSGEVNVSQD FCGKYTDVVNHITQRLDYVLCRTYAAQRTCHSGSVVKEVPQHLVPTPNAYLQPARDLQMC LKLFSILSKVLLRPKDTITSQRQFHSYPETVIKDICKGMWEDSHCVLPLGAP >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_6|879_bp atgggcaaggatttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaatagac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacagcccaaatccaatgccaaacaataagagcattctgagagaagatgactcctttaat accttcttcaacgggactattgtggtcaagcatggctccacaaaattgtttgtggaccta gagccaatacgtgctgcccatctgaagccctctagcctcaacctgaaggttattgccatg gatctggcccagatgcacgcctgtcaaggacctggaaatgacctttacattgaagcatct gctgctctgtgtgcaggctctgatttcagtgtgtcaggaggactgcagtgggtccagctt gtagctcatggaagtgctggtgatgataatggctggctcaggtgccacaggcctccatgg caaggtttaggagacaatgaactcgatggttgcagtggagaagtcaatgtcagccaggat ttctgtgggaaatacacagatgtggtcaaccacatcacacaacgactggactatgtactc tgcagaacttatgcagctcaacgtacttgtcactcaggctctgttgtaaaagaagtgcca caacacttggtccccacacccaatgcctatctccagccagccagagacctgcagatgtgt ctaaaacttttttccatcctgtccaaagtgctgctgaggccaaaggacaccatcacctcc caaaggcagtttcacagctaccctgagacagtgataaaggacatctgcaaaggcatgtgg gaggacagccattgtgtcctgcctctgggtgctccctag >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_7|328_aa MANSASPEQNQNHCSAINNSIPLMQGNLPTLTLSGKIRVTVTFFLFLLSATFNASFLLKL QKWTQKKEKGKKLSRMKLLLKHLTLANLLETLIVMPLDGMWNITVQWYAGELLCKVLSYL KLFSMYAPAFMMVVISLDRSLAITRPLALKSNSKVGQSMVGLAWILSSVFAGPQLYIFRM IHLADSSGQTKVFSQCVTHCSFSQWWHQAFYNFFTFSCLFIIPLFIMLICNAKIIFTLTR VLHQDPHELQLNQSKNNIPRARLKTLKMTVAFATSFTVCWTPYYVLGIWYWFDPEMLNRL SDPVNHFFFLFAFLNPCFDPLIYGYFSL >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_7|987_bp atggcaaacagtgcctctcctgaacagaatcaaaatcactgttcagccatcaacaacagc atcccactgatgcagggcaacctccccactctgaccttgtctggaaagatccgagtgacg gttactttcttcctttttctgctctctgcgacctttaatgcttctttcttgttgaaactt cagaagtggacacagaagaaagagaaagggaaaaagctctcaagaatgaagctgctctta aaacatctgaccttagccaacctgttggagactctgattgtcatgccactggatgggatg tggaacattacagtccaatggtatgctggagagttactctgcaaagttctcagttatcta aagcttttctccatgtatgccccagccttcatgatggtggtgatcagcctggaccgctcc ctggctatcacgaggcccctagctttgaaaagcaacagcaaagtcggacagtccatggtt ggcctggcctggatcctcagtagtgtctttgcaggaccacagttatacatcttcaggatg attcatctagcagacagctctggacagacaaaagttttctctcaatgtgtaacacactgc agtttttcacaatggtggcatcaagcattttataactttttcaccttcagctgcctcttc atcatccctcttttcatcatgctgatctgcaatgcaaaaatcatcttcaccctgacacgg gtccttcatcaggacccccacgaactacaactgaatcagtccaagaacaatataccaaga gcacggctgaagactctaaaaatgacggttgcatttgccacttcatttactgtctgctgg actccctactatgtcctaggaatttggtattggtttgatcctgaaatgttaaacaggttg tcagacccagtaaatcacttcttctttctctttgcctttttaaacccatgctttgatcca cttatctatggatatttttctctgtga >gi568815594r:67640483_67854335|GENSCAN_predicted_peptide_8|299_aa XQKSYFYRSSFQLLNVEYNSQLNSPATQEYRTLSGRIESLITKTFKESNLRNQFIRAHVA KLSNSNPRDWIATSGISTTFPKLRMRVRNILIHNNYKSATHENDIALVRLENSVTFTKDI HSVCLPAATQNIPPGSTAYVTGWGAQEYAGHTVPELRQGQVRIISNDVCNAPHSYNGAIL SGMLCAGVPQGGVDACQGDSGGPLVQEDSRRLWFIVGIGLVKIIDNRTCNNGEADGRVIT SGMLCAGFLEPRVDACQGDSGGPLVGTDSKGILAKGSLLVLKAGEMNVLFQTSLVSTLK >gi568815594r:67640483_67854335|GENSCAN_predicted_CDS_8|900_bp natcaaaaatcttacttttataggagcagttttcaactcctaaatgttgaatataatagt cagttaaattcaccagctacacaggaatacaggactttgagtggaagaattgaatctctg attactaaaacattcaaagaatcaaatttaagaaatcagttcatcagagctcatgttgcc aaactgagcaactctaatcctcgtgactggattgccacgtctggtatttccacaacattt cctaaactaagaatgagagtaagaaatattttaattcataacaattataaatctgcaact catgaaaatgacattgcacttgtgagacttgagaacagtgtcacctttaccaaagatatc catagtgtgtgtctcccagctgctacccagaatattccacctggctctactgcttatgta acaggatggggcgctcaagaatatgctggccacacagttccagagctaaggcaaggacag gtcagaataataagtaatgatgtatgtaatgcaccacatagttataatggagccatcttg tctggaatgctgtgtgctggagtacctcaaggtggagtggacgcatgtcagggtgactct ggtggcccactagtacaagaagactcacggcggctttggtttattgtggggataggatta gtgaagattatagataataggacctgcaacaatggggaggcagatggcagagtcatcaca tctggaatgttgtgtgccgggttcctggagccacgtgtggatgcctgccagggtgactct ggtggaccactggttggtacagattctaaaggcatccttgctaaaggttccctgctggta ttgaaagctggagaaatgaacgtgctcttccaaacaagcctagtgtctacactcaagtga