GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:22:43 Sequence gi568815586f:118036210_118244800 : 208591 bp : 44.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 301 129 173 1 2 111 72 90 0.626 8.44 1.12 Intr - 2179 2079 101 2 2 107 109 36 0.967 7.43 1.11 Intr - 6763 6632 132 2 0 95 68 94 0.989 8.72 1.10 Intr - 7168 6924 245 0 2 45 96 131 0.558 6.74 1.09 Intr - 16269 16101 169 1 1 65 81 116 0.433 7.60 1.08 Intr - 32388 32168 221 2 2 32 121 301 0.177 25.75 1.07 Intr - 34858 34843 16 2 1 83 109 0 0.177 -3.80 1.06 Intr - 35260 35150 111 2 0 50 116 26 0.715 1.95 1.05 Intr - 37783 37490 294 2 0 59 113 263 0.704 22.88 1.04 Intr - 43397 43137 261 0 0 118 70 118 0.613 10.46 1.03 Intr - 46220 45918 303 0 0 116 36 409 0.644 35.06 1.02 Intr - 59605 59324 282 2 0 63 100 222 0.960 18.29 1.01 Init - 67462 67384 79 1 1 103 82 163 0.991 16.45 1.00 Prom - 68916 68877 40 -6.76 2.00 Prom + 78909 78948 40 -2.16 2.01 Init + 87870 87918 49 2 1 96 89 40 0.495 4.01 2.02 Term + 92267 92883 617 0 2 66 48 126 0.417 1.03 2.03 PlyA + 93022 93027 6 -0.45 3.00 Prom + 93290 93329 40 -2.36 3.01 Init + 100001 100135 135 1 0 95 86 278 0.997 28.44 3.02 Intr + 101830 101939 110 0 2 72 94 29 0.951 0.98 3.03 Intr + 103242 103342 101 0 2 91 92 137 0.994 14.15 3.04 Term + 108377 108594 218 0 2 118 50 324 0.998 28.91 3.05 PlyA + 109352 109357 6 1.05 4.16 PlyA - 113610 113605 6 1.05 4.15 Term - 114949 114788 162 1 0 69 50 148 0.994 7.04 4.14 Intr - 116200 116018 183 1 0 106 116 226 0.999 27.18 4.13 Intr - 124149 123937 213 0 0 88 115 114 0.997 13.01 4.12 Intr - 125818 125579 240 1 0 87 116 309 0.915 31.35 4.11 Intr - 136451 136248 204 2 0 84 53 245 0.960 20.00 4.10 Intr - 141120 140992 129 0 0 51 94 114 0.982 9.19 4.09 Intr - 145398 145162 237 0 0 73 78 521 0.987 47.41 4.08 Intr - 153732 153598 135 0 0 105 99 69 0.998 10.46 4.07 Intr - 163048 162815 234 1 0 90 83 220 0.557 19.59 4.06 Intr - 165254 165087 168 2 0 74 50 113 0.753 6.24 4.05 Intr - 176418 176283 136 2 1 82 40 -6 0.050 -5.43 4.04 Intr - 177901 177808 94 2 1 112 95 67 0.765 8.82 4.03 Intr - 197556 197465 92 2 2 68 92 60 0.781 4.04 4.02 Intr - 199462 199349 114 0 0 112 84 20 0.748 3.56 4.01 Intr - 201960 201864 97 1 1 113 82 31 0.770 4.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:118036210_118244800|GENSCAN_predicted_peptide_1|796_aa MAAGGSAPEPRVLVCLGALLAGWVAVGLEAVVIGEVHENVTLHCGNISGLRGQVTWYRNN SEPVFLLSSNSSLRPAEPRFSLVDATSLHIESLSLGDEGIYTCQEILNVTQWFQVWLQVA SGPYQIEVHIVATGTLPNGTLYAARGSQVDFSCNSSSRPPPVVEWWFQALNSSSESFGHN LTVNFFSLLLISPNLQGNYTCLALNQLSKRHRKVTTELLVYYPPPSAPQCWAQMASGSFM LQLTCRWDGGYPDPDFLWIEEPGGVIVGKSKLGVEMLSESQLSDGKKFKCVTSHIVGPES GASCMVQIRGPSLLSEPMKTCFTGGNVTLTCQVSGAYPPAKILWLRNLTQPEVIIQPSSR HLITQDGQNSTLTIHNCSQDLDEGYYICRADSPVGVREMEIWLSVKEPLNIGGIVGTIVS LLLLGLAIISGLLLHYSPVFCWKVGNTSRGQNMDDVMVLVDSEEEEEEEEEEEEDAAVGE QEGAREREELPKEIPKQDHIHRVTALVNGNIEQMGNGFQDLQEEPLLLAELKPGRPHQFD WKSSCETWSVAFSPDGSWFAWSQGHCIVKLIPWPLEEQFIPKGFEAKSRSSKNETKGRGS PKEKTLDCGQIVWGLAFSPWPSPPSRKLWARHHPQVPDVSCLVLATGLNDGQIKIWEVQT GLLLLNLSGHQDVVRDLSFTPSGSLILVSASRDKTLRIWDLNKHGKQIQVLSGHLQWVYC CSISPDCSMLCSAAGEKSVFLWSMRSYTLIRKLEGHQSSVVSCDFSPDSALLVTASYDTN VIMWDPYTGERLRSLH >gi568815586f:118036210_118244800|GENSCAN_predicted_CDS_1|2388_bp atggccgcaggcggcagtgcgcccgagccccgcgtcctcgtctgcctcggggcgctcctg gccggctgggtcgccgtaggattggaggctgttgtcattggagaagttcatgagaatgtt actctgcactgtggcaacatctcgggactgaggggccaggtgacctggtaccggaacaac tcggagcctgtcttccttctctcgtccaactctagcctccggccagctgagcctcgcttc tctctagtggatgccacctccctgcacattgaatcgctgagcctgggagatgagggaatc tacacctgccaggagatcctgaatgtgactcagtggttccaagtgtggctgcaggtggcc agcggcccctatcagattgaggtccacatcgtggccaccggcacactccccaacggcacc ctctacgcagccaggggctcccaggtggacttcagctgcaacagcagctccaggccacca cccgtggttgaatggtggttccaggccctgaattccagcagcgagtcctttggccacaac ctgacagtcaactttttctcactgttactgatatcgccaaacctccaagggaactacacc tgtttagccttgaatcagctcagcaagagacatcgaaaggtgaccaccgagctcctggtc tactatccccctccatcagctccccagtgctgggcacagatggcatcaggatcgttcatg ttgcagcttacctgtcgctgggatgggggataccctgaccctgacttcctgtggatagaa gagccaggaggtgtaatcgtggggaagtcaaagctgggggtggaaatgctgagcgagtcc cagctgtcggatggcaagaagttcaagtgtgttacaagccacatagttgggccagagtcg ggcgccagctgcatggtgcagatcaggggtccctcccttctctctgagcccatgaagact tgcttcactgggggcaatgtgacgcttacatgccaggtgtctggggcctacccccctgcc aagatcctgtggctgaggaaccttacccagcccgaggtgatcatccagcctagcagccgc catctcattacccaggatggccagaactccaccctcactatccacaactgctcccaggac ctggatgagggctactacatctgccgagctgacagccctgtaggggtgagggagatggaa atctggctgagtgtgaaagaacctttaaatatcggggggattgtgggaaccattgtgagc ctccttctgctgggactggccattatctcagggcttctgttgcattatagccctgtgttc tgctggaaagtaggaaacacttccaggggacaaaacatggatgatgtcatggttttggtg gattcagaagaggaagaggaggaggaggaggaggaggaggaagatgctgcagtaggggaa caggagggagcacgtgagagagaggagttgccaaaagaaatacctaagcaggaccacatt cacagagtgaccgccttggtgaatgggaacatagaacagatgggaaatggattccaggat cttcaagaggaaccgctgctgctggccgaactcaagcccgggcgcccccaccagtttgat tggaagtccagctgtgaaacctggagcgtcgccttctccccagatggctcctggtttgct tggtctcaaggacactgcatcgtcaaactgatcccctggccgttggaggagcagttcatc cctaaagggtttgaagccaaaagccgaagtagcaaaaatgagacgaaagggcggggcagc ccaaaagagaagacgctggactgtggtcagattgtctgggggctggccttcagcccgtgg ccttccccacccagcaggaagctctgggcacgccaccacccccaagtgcccgatgtctct tgcctggttcttgctacgggactcaacgatgggcagatcaagatctgggaggtgcagaca gggctcctgcttttgaatctttccggccaccaagatgtcgtgagagatctgagcttcaca cccagtggcagtttgattttggtctccgcgtcacgggataagactcttcgcatctgggac ctgaataaacacggtaaacagattcaagtgttatcgggccacctgcagtgggtttactgc tgttccatctccccagactgcagcatgctgtgctctgcagctggagagaagtcggtcttt ctatggagcatgaggtcctacacgttaattcggaagctagagggccatcaaagcagtgtt gtctcttgtgacttctcccccgactctgccctgcttgtcacggcttcttacgataccaat gtgattatgtgggacccctacaccggcgaaaggctgaggtcactccan >gi568815586f:118036210_118244800|GENSCAN_predicted_peptide_2|221_aa MGFLHVGQAGLELLTSELEKTTLKFIWNQKRACIAKTILSKKNKAGGITLPDFKLYYKAT VTKTAWYWYQNRDIDQWNRTEPSEITPHIYNHLIFDKPIKNKKWGKDFLFNKWCWENWLT ICRKLKLDPFLTPHTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAM ATKAKIEKWDLIKELLHSKRNYHQSEQATYRMGENFCHLPI >gi568815586f:118036210_118244800|GENSCAN_predicted_CDS_2|666_bp atggggtttctccatgttggtcaggctggtctcgaactcctgacctcagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcctgcattgccaagacaatcttaagc aaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataacaccacacatctacaaccatctgatctttgacaaacctatcaaa aacaagaaatggggaaaggatttcctatttaataaatggtgctgggaaaactggctaacc atatgtagaaagctgaaactggatcccttccttacacctcatacaaaaattaattcaaga tggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacttaggc aataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gcaacaaaagccaaaatagaaaaatgggatctaattaaagagcttctgcacagcaaaaga aactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgccatctaccc atctga >gi568815586f:118036210_118244800|GENSCAN_predicted_peptide_3|187_aa MPVDLSKWSGPLSLQEVDEQPQHPLHVTYAGAAVDELGKVLTPTQVKNRPTSISWDGLDS GKLYTLVLTDPDAPSRKDPKYREWHHFLVVNMKGNDISSGTVLSDYVGSGPPKGTGLHRY VWLVYEQDRPLKCDEPILSNRSGDHRGKFKVASFRKKYELRAPVAGTCYQAEWDDYVPKL YEQLSGK >gi568815586f:118036210_118244800|GENSCAN_predicted_CDS_3|564_bp atgccggtggacctcagcaagtggtccgggcccttgagcctgcaagaagtggacgagcag ccgcagcacccgctgcatgtcacctacgccggggcggcggtggacgagctgggcaaagtg ctgacgcccacccaggttaagaatagacccaccagcatttcgtgggatggtcttgattca gggaagctctacaccttggtcctgacagacccggatgctcccagcaggaaggatcccaaa tacagagaatggcatcatttcctggtggtcaacatgaagggcaatgacatcagcagtggc acagtcctctccgattatgtgggctcggggcctcccaagggcacaggcctccaccgctat gtctggctggtttacgagcaggacaggccgctaaagtgtgacgagcccatcctcagcaac cgatctggagaccaccgtggcaaattcaaggtggcgtccttccgtaaaaagtatgagctc agggccccggtggctggcacgtgttaccaggccgagtgggatgactatgtgcccaaactg tacgagcagctgtctgggaagtag >gi568815586f:118036210_118244800|GENSCAN_predicted_peptide_4|812_aa XHKKPLQEVEIAAITHGALHGLAYLHSHALIHRDIKAGNILLTEPGQVKLADFGSASMAS PANSFVGTPYWMAPEVILAMDEGQYDGKVDIWSLGITCIELAERKPPLFNMNAMSALYHI AQNDSPTLQSNECHHCDVPGPNICININPVVSGQSWLVVQTPTLGFSVRLLPNPQAVKHD FVRRDRPLRVLIDLIQRTKDAVRELDNLQYRKMKKILFQETRNGPLNESQEDEEDSEHGT SLNREMDSLGSNHSIPSMSVSTGSQSSSVNSMQEVMDESSSELVMMHDDESTINSSSSVV HKKVGFLVPSTEDHVFIRDEAGHGDPRPEPRPTQSVQSQALHYRNRERFATIKSASLVTR QIHEHEQENELREQMSGYKRMRRQHQKQLIALENKLKAEMDEHRLKLQKEVETHANNSSI ELEKLAKKQVAIIEKEAKVAAADEKKFQQQILAQQKKDLTTFLESQKKQYKICKEKIKEE MNEDHSTPKKEKQERISKHKENLQHTQAEEEAHLLTQQRLYYDKNCRFFKRKIMIKRHEV EQQNIREELNKKRTQKEMEHAMLIRHDESTRELEYRQLHTLQKLRMDLIRLQHQTELENQ LEYNKRRERELHRKHVMELRQQPKNLKAMEMQIKKQFQDTCKVQTKQYKALKNHQLEVTP KNEHKTILKTLKDEQTRKLAILAEQYEQSINEMMASQALRLDEAQEAECQALRLQLQQEM ELLNAYQSKIKMQTEAQHERELQKLEQRVSLRRAHLEQKIEEELAALQKERSERIKNLLE RQEREIETFDMESLRMGFGNLVTLDFPKEDYR >gi568815586f:118036210_118244800|GENSCAN_predicted_CDS_4|2439_bp nttcataaaaaaccacttcaggaagtggagatcgctgccattactcatggagccttgcat ggactagcctacctacattctcatgcattgattcatagggatattaaagcaggaaatatt cttctaacagagccaggtcaggtaaaactagctgattttggatctgcttcaatggcttct cctgccaactccttcgtgggcacaccttactggatggctccagaggtgatcttagctatg gatgaaggacagtatgatgggaaagttgatatttggtcacttggcatcacttgtattgaa ttggcggaacggaagccgccccttttcaacatgaatgcaatgagtgccttatatcacatt gcccagaatgactccccaacgttacagtctaatgaatgtcatcattgtgatgttcctggg cctaacatctgcataaacattaatccagtggtctcaggacagagctggctggtggttcaa acaccaactctgggattttcagttaggttgttgccaaatcctcaggcagtgaagcatgac tttgttcgacgagaccggccactacgtgtcctcattgacctcatacagaggacaaaagat gcagttcgtgagctagataacctacagtaccgaaaaatgaaaaaaatacttttccaagag acacggaatggacccttgaatgagtcacaggaggatgaggaagacagtgaacatggaacc agcctgaacagggaaatggacagcctgggcagcaaccattccattccaagcatgtccgtg agcacaggcagccagagcagcagtgtgaacagcatgcaggaagtcatggacgagagcagt tccgaacttgtcatgatgcacgatgacgaaagcacaatcaattccagctcctccgtcgtg cataagaaagtaggtttcttggtaccctccacagaggatcatgtattcataagggatgag gcgggccacggcgatcccaggcctgagccgcggcctacccagtcagttcagagccaggcc ctccactaccggaacagagagcgctttgccacgatcaaatcagcatctttggttacacga cagatccatgagcatgagcaggagaacgagttgcgggaacagatgtcaggttataagcgg atgcggcgccagcaccagaagcagctgatcgccctggagaacaagctgaaggctgagatg gacgagcaccgcctcaagctacagaaggaggtggagacgcatgccaacaactcgtccatc gagctggagaagctggccaagaagcaagtggctatcatagaaaaggaggcaaaggtagct gcagcagatgagaagaagttccagcaacagatcttggcccagcagaagaaagatttgaca actttcttagaaagtcagaagaagcagtataagatttgtaaggaaaaaataaaagaggaa atgaatgaggaccatagcacacccaagaaagagaagcaagagcggatctccaaacataaa gagaacttgcagcacacacaggctgaagaggaagcccaccttctcactcaacagagactg tactacgacaaaaattgtcgtttcttcaagcggaaaataatgatcaagcggcacgaggtg gagcagcagaacattcgggaggaactaaataaaaagaggacccagaaggagatggagcat gccatgctaatccggcacgacgagtccacccgagagctagagtacaggcagctgcacacg ttacagaagctacgcatggatctgatccgtttacagcaccagacggaactggaaaaccag ctggagtacaataagaggcgagaaagagaactgcacagaaagcatgtcatggaacttcgg caacagccaaaaaacttaaaggccatggaaatgcaaattaaaaaacagtttcaggacact tgcaaagtacagaccaaacagtataaagcactcaagaatcaccagttggaagttactcca aagaatgagcacaaaacaatcttaaagacactgaaagatgagcagacaagaaaacttgcc attttggcagagcagtatgaacagagtataaatgaaatgatggcctctcaagcgttacgg ctagatgaggctcaagaagcagaatgccaggccttgaggctacagctccagcaggaaatg gagctgctcaacgcctaccagagcaaaatcaagatgcaaacagaggcacaacatgaacgt gagctccagaagctagagcagagagtgtctctgcgcagagcacaccttgagcagaagatt gaagaggagctggctgcccttcagaaggaacgcagcgagagaataaagaacctattggaa aggcaagagcgagagattgaaacttttgacatggagagcctcagaatgggatttgggaat ttggttacattagattttcctaaggaggactacagatga