GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:36:32 Sequence gi568815581r:55620669_55821070 : 200402 bp : 40.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 101 251 151 1 1 57 63 73 0.156 1.96 1.02 Intr + 23495 23598 104 0 2 113 53 95 0.030 7.47 1.03 Intr + 37675 37776 102 0 0 93 106 30 0.417 4.75 1.04 Intr + 44556 44690 135 2 0 92 72 56 0.163 4.24 1.05 Term + 47067 47147 81 2 0 98 36 58 0.159 -1.69 1.06 PlyA + 49270 49275 6 1.05 2.05 PlyA - 50803 50798 6 1.05 2.04 Term - 51976 51563 414 1 0 44 48 171 0.171 2.98 2.03 Intr - 65890 65775 116 2 2 51 62 76 0.001 0.55 2.02 Intr - 81953 81756 198 0 0 76 73 111 0.489 6.80 2.01 Init - 87370 87199 172 1 1 53 0 187 0.127 6.05 2.00 Prom - 96244 96205 40 -2.15 3.02 PlyA - 96294 96289 6 1.05 3.01 Sngl - 100402 99998 405 1 0 74 53 394 0.676 30.53 3.00 Prom - 105157 105118 40 -3.95 4.03 PlyA - 105891 105886 6 1.05 4.02 Term - 108617 108493 125 0 2 74 48 89 0.241 1.07 4.01 Init - 111577 111259 319 1 1 64 72 198 0.286 13.36 4.00 Prom - 114608 114569 40 -6.55 5.00 Prom + 117145 117184 40 -3.05 5.01 Init + 129522 129602 81 2 0 87 67 77 0.911 6.52 5.02 Intr + 130313 130576 264 1 0 76 69 272 0.019 20.89 5.03 Intr + 146667 146784 118 2 1 58 113 90 0.557 7.62 5.04 Intr + 150438 150533 96 1 0 91 73 93 0.964 7.16 5.05 Intr + 150576 150697 122 1 2 105 -20 73 0.020 -2.51 5.06 Intr + 153056 153227 172 1 1 17 93 224 0.018 14.39 5.07 Term + 155882 155991 110 0 2 72 44 91 0.591 0.79 5.08 PlyA + 156685 156690 6 1.05 6.00 Prom + 158442 158481 40 -7.85 6.01 Sngl + 169388 170269 882 1 0 42 43 331 0.963 19.77 6.02 PlyA + 170402 170407 6 1.05 7.00 Prom + 171166 171205 40 -3.65 7.01 Init + 176765 176842 78 1 0 53 99 47 0.858 3.41 7.02 Term + 180225 180950 726 2 0 70 43 232 0.700 9.38 7.03 PlyA + 180974 180979 6 1.05 8.00 Prom + 187724 187763 40 -1.25 8.01 Init + 198098 198143 46 1 1 91 103 37 0.043 6.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 130313 130612 300 1 0 76 39 344 0.851 22.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_1|190_aa MWIRTKLPLGTNHQQTLLSSGFSVWSLPSLGHESRNLLLRVFLLKFGQCPRCCVDEEGLS VQVSGEREKNSWSPGKEMGERKQPQWYLDSPFPQFLSYDLKELSSLKTCTPLSANKAPKD LIRCLVLHSLKKHLLKTEYGPATIADARSAKTETELMLSRAQNPVAPPPRDTSTIIASTM GAPNLGLVTS >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_1|573_bp atgtggatcaggactaaacttccactgggaacaaatcatcagcagactttgctgtcttct ggctttagtgtttggtctttgccatctcttggccatgaaagtaggaacctacttctaaga gtcttcttgctgaagtttggtcagtgtcccaggtgctgtgtggatgaagagggtttatca gtacaggtgagtggtgaacgtgaaaagaacagctggagccctggtaaggagatgggggag aggaagcagccacagtggtacttagattctcctttccctcaatttctgtcctatgatctg aaggaattaagttcactgaaaacttgcacacccctttcagcaaataaagccccaaaagat ctcattaggtgtctggtacttcactctctcaagaaacacttattaaaaaccgagtatggt ccagccacgatagcagatgccaggagtgcaaagacagagacagaattgatgctttcaaga gcccagaatccagtggcaccgccaccacgggacacctccaccatcattgctagtacaatg ggagccccaaaccttggtttggtcacctcctaa >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_2|299_aa MQSGEMFLIDESGGMKDFLEEIASDMGLEAGAEFQLVKLGLRERKFLQEEEKAKEKGKFI PSRGFLVSLTSRINLQTVAMSVTVLKDGVPRVCSFTCSDVSRVSSSRWVPGLADFRNEAT DLCIHFHTADKDIPETGQFTIERDLMDLQFHIAEEDSQSWQKLLQFQLWLKGAQVQLRLL LQRVQVSSLGSFHVVFVLRVRRRQELRFGNLHLDFKGCMETPGCLGRSLLQGQSPQGESL LGQCGRKVLGLEPTNRVPTGALPGRAMRRGPLSSRPQNGRSTNSLHCVPGKAADTQCQL >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_2|900_bp atgcaaagtggagaaatgtttcttatagatgagagtggaggaatgaaagacttcttggaa gaaatagcatctgacatgggccttgaagctggggcagaatttcagttggtaaaattggga ttaagagaaagaaaatttcttcaagaagaagaaaaggctaaagaaaaaggaaaattcatt ccttcccgtgggttcttggtctcacttacttcaagaatcaatctgcagaccgtcgcgatg agtgttacagttcttaaagatggtgtgcccagagtttgttccttcacatgttcagatgtg tccagagtttcttcttcccggtgggttcctggtctcgctgacttcaggaatgaagccaca gacctttgcatccattttcacaccgctgataaagatatacctgagactgggcagtttaca atagaaagagatttaatggacttacagtttcacatagctgaggaggactctcaatcatgg cagaagctgcttcagttccagctgtggctaaaaggggctcaggtacagcttaggctgttg cttcagagggtgcaagtctctagccttggcagcttccacgtggtgtttgtcctgagggta cgcagaagacaagaattgaggtttgggaacctccacctagatttcaaaggatgtatggaa acacctggatgtctaggcagaagcctgctgcaggggcagagccctcagggagaatctctg ctagggcagtgtggaaggaaagtgttggggttggagcccacaaacagagttcccactggg gcactgcctggcagagctatgagaagagggccattgtcctccagaccccagaatggtaga tccaccaacagcttgcactgtgtgcctggaaaagctgcagacactcaatgccagctgtga >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_3|134_aa MTEEPIKEILGAPKAHMAATMEKSPKSEVVITTVPLVSEIQLMAATGGTELSCYRCIIPF AVVVFIAGIVVTAVAYSFNSHGSIISIFGLVVLSSGLFLLASSALCWKVRQRSKKAKRRE SQTALVANQRSLFA >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_3|405_bp atgactgaagagcccatcaaggagatcctgggagccccaaaggctcacatggcagcgacg atggagaagagccccaagagtgaagttgtgatcaccacagtccctctggtcagtgagatt cagttgatggctgctacagggggtaccgagctctcctgctaccgctgcatcatccccttt gctgtggttgtcttcatcgccggcatcgtggtcaccgcggtggcttacagcttcaattcc catgggtctattatctccatctttggcctggttgttctgtcatctggactttttttacta gcctccagtgccttgtgctggaaagtgagacaaaggagcaagaaagccaagagacgggag agtcaaacagctctcgtggcaaatcagagaagcttgtttgcttga >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_4|147_aa MEVTQQHSSFFLDSVGWEGRCSKEMHSPCDQSTPSWQPQGSPVASTKPVKVLCPKALLVP RYMLPVPREIHPWAFLSVFPKVGCSCVLSHKTVNRGLRLTSCQSQQVISYLSAFANIRCP LPLTWTITEASQLVSQPAFPLPHIDFP >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_4|444_bp atggaggtcacacagcagcacagcagcttctttctggattccgtgggctgggaagggagg tgctctaaggaaatgcacagtccatgtgatcagagcacaccctcttggcagcctcagggg agcccagttgcttctacaaaacccgtgaaagttctctgtccaaaagccttgttggtcccg cggtacatgcttcctgttcccagagagattcacccttgggctttcctatcagtcttccct aaagttggctgctcctgtgtcctgtcacataaaactgtgaaccgaggtctccgacttacg tcatgtcagtcacagcaggtgatttcctacctgtccgcgtttgccaatatcaggtgccca ttgcctcttacctggacaatcacagaggcatcccaactggtgtctcaacctgctttccct cttccccatattgactttccctag >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_5|320_aa MSFQSLLALTWLLHRFDVQTTYFIQSQFPANDAGAAGAAGAARSPRPQAHTKGVRGLPSR RRSPDCGRMELAAGSFSEEQFWEACAELQQPALAGADWQLLVETSGISIYRLLDKKTGLY EYKVFGVLEDCSPTLLADIYMDSDYRKQWDQYVKELYEQECNGETVVYWEVKYPFPMSNR DVSFPQAAVYCREVELVSQQAQITLSLILAAAFAKDIASCMVFGELRYVYLRQRRDLDME GRKIHVILARSTSMPQLGERSGVIRVKQYKQSLAIESDGKKGSKAWEGLLEVAFRSELTF SAPKLDDVPMSICLMLCGAD >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_5|963_bp atgagtttccagagtttattggcactcacgtggctgttacatcgttttgatgttcaaacg acctactttattcaaagtcagtttcctgccaatgacgctggggcagccggggcagccggg gcagcccggtcaccccgcccccaggcccacactaagggtgtccgcggcctgccctccagg cggaggagcccggactgcggaaggatggagctggccgccggaagcttctcggaggagcag ttctgggaggcctgcgccgagctccagcagcccgctctggccggggccgactggcagctc ctagtggagacctcgggcatcagcatctaccggctgctggacaagaagactggactttat gagtataaagtctttggtgttctggaggactgctcaccaactctactggcagacatctat atggactcagattacagaaaacaatgggaccagtatgttaaagaactctatgaacaagaa tgcaacggagagactgtggtctactgggaagtgaagtacccttttcccatgtccaacaga gacgtatcctttccacaagctgcagtctattgcagagaggtagagctggtttctcagcag gcacagataacattgtctctgattcttgcagcagcatttgctaaagatattgcttcttgt atggtgtttggagagctcagatatgtctaccttcggcagcggcgagacctggacatggaa gggaggaagatccatgtgatcctggcccggagcacctccatgcctcagcttggcgagagg tctggggtgatccgggtgaagcaatacaagcagagcctggcgatcgagagtgacggcaag aaggggagcaaagcttgggaggggctgctggaagtggcatttcgttcagagctgactttc agtgcacccaaactggatgacgtgccaatgtccatttgccttatgctttgtggagctgat tag >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_6|293_aa MIISIDAEKALDKIQQPFMLKTLNKLGIDGMYLKIIRAICDKPTANIILNGQKLEAFPLK TGTRQECPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKAEVKLSLFANDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLEEIKEDTNKWKNIPCTWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTA >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_6|882_bp atgattatctcaatagatgcagaaaaggccttggacaaaattcagcaacccttcatgcta aaaactctcaataaattaggtattgatggcatgtatctcaaaataataagagctatctgt gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacaggaatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagcagaa gtcaaattgtccctgtttgcaaacgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcgaggaaataaaagag gatacaaacaaatggaagaacattccatgcacatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagggcc cgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactgtactacaaggctacagtaaccaaaacagcatga >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_7|267_aa MWNLEPAKDKCLVNTTGFLLVPGLILVCWSLLEIHSRPCLPGYHQWSLQNSKDYCLFLPL EASSQRGTHQIPAGTLLYEVSVDPCWEVSPSLEAQGSGTYLMHYIMVKGSKQQEELTNLN IYAPNTGAPRCIKQILRELQRDLDSHTMTVGDCNTPLSILDRSVWQKINNDIQDLNSALD QVDLIDIYKTLHSKSTEYTFFSAPHHTYSKTDRIIGSKTLLSKCKRTEIITNSLSDHSAI KLELRIKKLTQKHRTTWKLNNLLLNDF >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_7|804_bp atgtggaatttagaacctgccaaagataaatgtctggtaaataccacaggatttctgttg gtaccagggttaatactggtctgctggagtttgctggagatccactccagaccctgtttg cctgggtatcaccagtggagtctgcagaatagcaaagattactgcctgttccttcctctg gaagcttcgtcccagaggggcacccaccagataccagctggaactctcctgtatgaagtg tctgttgacccctgctgggaggtgtctcccagtctggaggcacagggatcagggacctac ttgatgcattacataatggtaaaaggatcaaagcaacaagaagagctaactaacctaaat atatatgcacccaatacaggagcacccagatgtataaagcaaattcttagagaactgcaa agagacttagactcccacacaatgacagtgggagactgtaataccccactgtcaatatta gacagatcagtgtggcagaaaattaataatgatatccaggacttgaactcagctctggac caagtggacctaatagacatctacaaaactctccactccaaatcaacagaatatacattc ttctcagcaccacatcacacttattctaaaactgaccgcataattggaagtaaaacactc ctcagcaaatgtaaaagaacggaaatcataacaaacagtctctcagaccacagtgcaatc aaattagaactcaggattaagaaactcactcaaaagcacagaactacatggaaactgaac aacctgctcctgaatgacttctag >gi568815581r:55620669_55821070|GENSCAN_predicted_peptide_8|16_aa MKKSDRKPQTPKLEDX >gi568815581r:55620669_55821070|GENSCAN_predicted_CDS_8|48_bp atgaagaagagtgacaggaagccacagaccccaaaactagaagatgnn