GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:46:57 Sequence gi568815586r:11085996_11286937 : 200942 bp : 37.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6298 6455 158 1 2 67 96 145 0.146 11.93 1.02 Term + 19366 19568 203 2 2 45 44 128 0.270 0.67 1.03 PlyA + 21382 21387 6 1.05 2.00 Prom + 21482 21521 40 -8.05 2.01 Sngl + 25625 26830 1206 1 0 62 43 428 0.992 31.27 2.02 PlyA + 27067 27072 6 1.05 3.00 Prom + 28185 28224 40 -3.45 3.01 Init + 28312 28515 204 0 0 66 72 99 0.530 5.00 3.02 Intr + 43579 43677 99 0 0 57 40 98 0.200 1.29 3.03 Term + 48518 48706 189 1 0 112 38 68 0.361 0.67 3.04 PlyA + 48977 48982 6 1.05 4.03 PlyA - 49064 49059 6 1.05 4.02 Term - 62324 61792 533 0 2 67 40 231 0.039 9.62 4.01 Init - 85604 85427 178 2 1 94 90 350 0.966 35.27 4.00 Prom - 89855 89816 40 -4.95 5.07 PlyA - 90833 90828 6 1.05 5.06 Term - 95947 95779 169 0 1 17 43 159 0.180 0.67 5.05 Intr - 103849 103718 132 0 0 93 26 89 0.116 2.04 5.04 Intr - 110376 110202 175 1 1 85 19 91 0.175 -0.02 5.03 Intr - 120636 120485 152 2 2 93 88 12 0.197 0.59 5.02 Intr - 123423 123256 168 2 0 75 91 65 0.236 3.64 5.01 Init - 126190 126153 38 1 2 41 81 54 0.220 -0.47 5.00 Prom - 127096 127057 40 -3.65 6.00 Prom + 128403 128442 40 -6.15 6.01 Init + 132549 132601 53 2 2 78 82 63 0.667 5.38 6.02 Term + 135522 136044 523 0 1 -16 48 338 0.978 12.16 6.03 PlyA + 136375 136380 6 1.05 7.00 Prom + 137468 137507 40 -5.85 7.01 Init + 156205 156258 54 0 0 72 58 60 0.335 2.73 7.02 Intr + 166290 166417 128 2 2 20 86 132 0.454 4.76 7.03 Intr + 170820 171013 194 0 2 84 0 158 0.389 4.81 7.04 Intr + 171377 171536 160 0 1 24 98 147 0.690 7.42 7.05 Intr + 171895 172093 199 1 1 1 98 107 0.338 1.43 7.06 Term + 172327 172470 144 0 0 45 39 98 0.346 -2.47 7.07 PlyA + 172642 172647 6 1.05 8.03 PlyA - 176509 176504 6 1.05 8.02 Term - 182153 181198 956 0 2 82 37 1053 0.692 90.83 8.01 Init - 183674 183611 64 2 1 61 116 125 0.995 12.00 8.00 Prom - 184133 184094 40 -1.85 9.03 PlyA - 184801 184796 6 1.05 9.02 Term - 187447 187341 107 2 2 59 42 83 0.249 -1.61 9.01 Init - 194601 194490 112 0 1 86 115 43 0.831 7.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_1|120_aa XWVRGLADFKREPLTFTVSVAALKDGVDPKSEQQQGLLRREKGQSFHRVKGNPGLDLNIY VYLIPEFAGMSLFLFKFCDQCQTGKHLNMPVDEINAVFMENMMISKTAQINSYSNTTSLL >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_1|363_bp nngtgggttcgtggtctcgctgacttcaaaagggagccactgaccttcacggtgagtgtt gctgctcttaaagatggtgtggacccaaagagtgagcaacagcaaggtttattgagaaga gagaaaggacaaagcttccacagagtgaaaggcaacccaggtcttgaccttaacatctat gtgtacctgattcctgaatttgcaggaatgtccttgttcctttttaaattctgtgaccaa tgtcaaacaggaaagcatctcaatatgccagtggatgaaatcaatgctgtctttatggaa aacatgatgatttccaaaacagctcaaattaactcctattcaaacacaacgtccttgctg taa >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_2|401_aa MYQNLWDTAKAVFRGKFIALNAHRRKWERSKVDTLASQLKELEKQEQTNLKASRRQEITK VRAQLKEIETPKTLQKINESRSWFFEKFNKIERQLARLVKKKRENNPIDTIKNNKGDITT DFREIQTIIREYYKHLYANTLENLEEMDKILDTYNIPSLNQEEVKSLNRPITSSETETVI NSLATKKSPGTDIFTTKFYQRYKEEMVPFFLKLFKTLEKEGLLLNSFYEASIILIPKPDR DTAKRENFRPISLMNIDAKILNKILASQIQQHIKRLIHQDQVVFNPGMQGWFNICKSINI IHHINRTNDKNHMIISTDAEKAFDKIQHAFMLKTLKKLGTDGMYLKILSAIYDKPTANIL NGQKREAFPLKIGTKQGCHLSPLLFNIVLEILARTIRKRKK >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_2|1206_bp atgtaccagaatctctgggacacagctaaagcagtgtttagagggaaatttatagcacta aatgcccacaggagaaagtgggaaagatctaaagttgacaccctagcatcacaattaaaa gaactggagaagcaagagcaaacaaatttaaaagctagcagaagacaagagataactaag gtcagagcacaactgaaggagatagaaacacctaaaacccttcagaaaatcaatgaatcc aggagctggttttttgaaaagttcaacaaaatagaaagacagctagccagactagtaaag aagaaaagagagaataatccaatagacacaataaaaaataataaaggagatattaccact gacttcagagaaatacaaactatcatcagagaatactataaacacctctatgcaaatacg ctagaaaatttagaagaaatggataaaatactggacacatacaacatcccaagtctaaac caggaagaagtcaaatccctgaatagaccaataacaagttctgaaactgagacagtaatt aatagcctagcaaccaaaaaaagtccaggaacagacatattcacaaccaaattctaccag aggtacaaagaggagatggtaccattctttctgaaactattcaaaacactggaaaaagag ggactactccttaactcattttatgaggccagcatcatcctgattccaaaacctgataga gacacagcaaaaagagaaaattttaggccaatatccttgatgaacatcgatgcgaaaatc ctcaataaaatactggcaagccaaatccagcagcatataaaaaggcttatccaccaagat caagtcgtcttcaaccctgggatgcaaggctggttcaacatatgcaaatcaataaacata atccatcacataaacagaaccaatgacaaaaaccacatgattatctcaacagatgcagaa aaggcctttgataaaattcaacatgccttcatgctaaaaacactcaagaaactaggtact gatggaatgtatctcaaaatattaagtgctatttatgacaaacccacagccaatatactg aatgggcaaaaacgggaagcattccctttgaaaatcggcacaaaacaaggatgccatctc tcaccactcctgttcaacatagtattggaaattctggccaggacaatcaggaagagaaag aaataa >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_3|163_aa MEYYAAIKKDEFMSFAGTWMKLETIILSKLTQEQKTKHHIITHNWELNNENTWTQGGHRE RHTTGPFRMGYRMNADYHKLKSTLTNAFQDVLSSLSRAELLAGIIHTEIDMKPEFSFASM QTRTCSLPVFAIFPCVTSPSFVFSDFSCLGSFITRYVDRIVNV >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_3|492_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacctggatg aagctggaaaccatcattctcagcaaactaacacaggaacagaaaaccaaacaccacatt atcactcataattgggagctaaacaatgagaacacatggacacagggaggacacagggaa cgtcacacaactgggccttttaggatgggttatcgaatgaatgcagactaccataaactt aaatcaacactcacaaatgctttccaggatgtgctatcttcactgagcagagcagagctt ctggctggaattattcatactgaaattgacatgaaacctgaattctcatttgctagtatg caaacaaggacatgttcacttccagtgtttgcaatttttccttgtgtaacctctccatca tttgtctttagcgacttcagttgcttgggaagttttataacccgatatgtagatcgtata gtaaatgtctaa >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_4|236_aa MLAPEAVSLVMALNMYLQSDVEDHRARDVEVREVHAQLPGQLEEGEQGAGEPLAEDAVRV LEVVTRAMRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLVKLISNFSKVSGY KINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGVQLTRDVKDLFKENYKPLLNE IKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLTMTFFTELENYFKVH >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_4|711_bp atgctggccccggaggccgttagcttagtcatggctctaaatatgtacctgcaatcggat gttgaggatcaccgagcccgcgacgtagaagtacgggaagttcatgcgcagctgccaggc cagctcgaagaaggcgagcagggtgcgggagagccccttgcagaagacgccgtacgagtg ttggaagttgtgaccagggcaatgaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaatccc attgtctcagcccaaaatctcgttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggagtc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat atcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaacaatgactttcttcacagaattggaaaactactttaaagttcactga >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_5|277_aa MTKIQNTDNTKRWSEQSKVPADIKIWRHRVEIQFTDSGLPDCAGIGLLDLTQSTMGMWHL GHRGRLFPSISLFCIPKKNQELACAFTAIGLKSAIFPGIPRSFGIYLECRMLFRKQYLGG ETLYSCSDRRLAPETQDRGLCESHEHYDCFLLPPSLQFGNFPFIHQLLSMQEENSGTRSS IAKDTNGNSLGKENAKRVTISVCVCIIVKEKLSCYPSPHMEKLMNNNKKVAYVVVKRVGG QFASQRQNWLNEKHIVREQGRQNPVLSLQNHRFAAFG >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_5|834_bp atgaccaaaatccagaacactgacaacacaaaacgctggagtgaacaaagcaaagtgcca gcagacatcaagatctggaggcacagagtagaaatccagttcactgactctggcttgcct gattgtgcagggattggtctattagatctgacacaaagcacaatgggaatgtggcattta ggacacagagggcgcctgttccccagtatatccttattttgcatcccaaaaaagaatcaa gaattagcttgtgcttttactgccataggcctgaaatcagctatttttccagggatccct agatcctttgggatctatttagagtgcagaatgctatttcgaaaacagtatctgggaggg gaaacactgtacagctgtagtgataggcgactggctccagagacccaggacagaggccta tgtgagagccatgaacattatgattgctttcttcttccaccttctctacagtttgggaac ttccctttcatccatcagctactttctatgcaagaggaaaatagtggcactagatcttcc atagctaaagatacaaatggaaattcattaggcaaagaaaatgcaaaaagagtaacaata tcggtctgtgtttgcataatagtgaaggagaaactctcttgttacccatcaccccacatg gaaaaactaatgaacaacaacaaaaaagtcgcttatgtggttgttaagagagttggtggc cagtttgccagtcagcgtcaaaattggctgaatgagaaacacattgttagagaacaaggc aggcaaaatcctgtcctgtccttgcagaatcatagatttgcagcttttgggtag >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_6|191_aa MDPNKKEIRDLPEKEFKRKISIQGSYLNVVKAIYDKPTTKLILNEEKLKAFSLRTGTRQG YPLSPLLFNIVLEVLARAIRQEKEIKGIQIGKEQHKLSLFADDMIVYLENPKDSSKKLLD LINEFSKVSGYKINVCKPVAFLYPNSDQTENQIKNSTPFTIAADVYRNIPNQGDKRPLQG KLQNTAERNHR >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_6|576_bp atggatccaaacaaaaaagaaatccgtgatttacctgaaaaagaattcaaaagaaaaatc agcatacaaggatcgtacctcaatgtagtaaaagccatctatgacaaacccacaaccaag ttaatactgaatgaggaaaaattgaaagcattctctctgagaactggaacaagacaagga tacccactttcaccacttttattcaatatagtactggaagttttagccagagcaattaga caagagaaagaaataaagggcatccagattggtaaagaacaacacaaactgtcactgttt gctgatgatatgattgtatacctagaaaaccctaaagactcctccaaaaagctcctagat ctcataaatgaattcagcaaagtttcaggatacaaaattaatgtgtgcaaaccagtagcc ttcctataccccaacagcgaccaaactgagaatcaaatcaagaactcaactccttttaca atagctgcagatgtatataggaatatacctaatcaaggagataaaagacctctacaagga aaactacaaaacaccgctgaaagaaatcatagatga >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_7|292_aa MLSETLMGIVAQNIGSLQQLQWSTEVQVGGIRCHQLQGRYQGPIVIRKHEVRDDLLDVFG SASLKERPQTHSGPYRSKSHLPGTEYLGEGAAVGTASADLNVPAWQPISQLSARALQKDR LPPQVDKSTMMGRNQCKKPENCKNQNASSPPNDHNSLTAREQNWTENEFDKLTEVGFRSD GENGTKLENTLQDIIQEKFPNQARQANIQIQEIQRTPQRYSSRRATPKHIIIRFTKAEMK EKMLRACLTRAPEGSTKHEKEKLVPTTTKTYQIVKTTDIMKKPHQLMGKIIS >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_7|879_bp atgctttctgaaacactgatggggatagtggcccaaaacattggcagtttacaacagtta caatggtccactgaggttcaagtgggaggaatccgatgtcaccagttgcagggaagatat caaggtcccattgtcatcagaaagcatgaggtgagagatgatcttttggacgtttttgga agtgcatctctgaaagaaaggccgcagacccactcggggccttacagatcaaagtcccat ctccctgggacagagtacctgggggaaggggcagctgtgggcacagcttcagcagactta aatgttcctgcctggcagccgatctcccagctcagtgctcgagctctgcagaaggacaga ctgcctcctcaagtagataaatccacgatgatggggagaaaccagtgcaaaaagcctgaa aattgcaaaaaccagaatgcctcttctcctccaaatgatcataactccttgacagcaagg gaacaaaactggacagagaatgagtttgacaaattgacagaagtaggcttcagaagtgac ggggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaaattcccc aaccaagcaagacaggccaacattcaaattcaagaaatacagagaacacctcaaagatac tcctcgagaagagcaaccccaaaacatataatcatcagattcaccaaggctgaaatgaag gaaaaaatgttaagggcctgccttacaagagctcctgaaggaagcactaaacatgaaaag gaaaaactggtaccaacaactacaaaaacataccaaattgtaaagaccaccgacataatg aagaaaccccatcagctaatgggcaaaataatcagctag >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_8|339_aa MLLILLSVALLALSSAQSLNEGKPEGRRPQGGNQPQRTPPPPGKPEGRPPQGGNQSQGPP PRPGKPEGPPPQGGNQSQGPPPRPGKPEGQPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQ GPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPHPGKPEGPPPQGGN QSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNKPQGPPPRPGKPEGPPPQ GGNQSQGPPPRPGKPEGPPSQGGNKPQGPPPHPGKPQGPPPQEGNKPQRPPPPGRPQGPP PPGGNPQQPLPPPAGKPQGPPPPPQGGRPHRPPQGQPPQ >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_8|1020_bp atgctactgattctgctgtcggtggccctgctggccctgagctcagctcagagcttaaat gaaggaaagccagaaggacgacgcccacaaggaggaaaccagccccaacgtaccccacct cctccaggaaagccagaaggacgacccccacaaggaggcaaccagtcccaaggtccccca cctcgtccaggaaagccagaaggaccacccccacaaggaggaaaccagtcccaaggtccc ccacctcgtccgggaaagccagaaggacaacccccacaaggaggaaaccagtcccaaggt cccccacctcgtccgggaaagccagaaggaccacccccacaaggaggaaaccagtcccaa ggtcccccgcctcgtccgggaaagccagaaggaccacccccacaaggaggaaaccagtcc caaggtcccccgcctcgtccgggaaagccagaaggaccacccccacaaggaggaaaccag tcccaaggtcccccgcctcatccgggaaagccagaaggaccacccccacaaggaggaaac cagtcccaaggtcccccacctcgtccgggaaagccagaaggaccacccccacaaggagga aaccagtcccaaggtcccccacctcgtccgggaaagccagaaggaccacccccacaagga ggcaacaaacctcaaggtcccccacctcgtccaggaaagccagaaggaccacccccacaa ggaggaaaccagtcccaaggtcccccacctcgtccaggaaagccagaaggaccaccttca caaggaggcaacaaacctcaaggtcccccacctcatccaggaaagccacaaggaccaccc ccacaagaaggtaacaaacctcaacgtccccctcctccaggaaggccacaaggaccaccc ccaccaggaggcaatccccagcagcctctgccacctcccgctggaaagccccagggacca cctccacctcctcaagggggcagaccacacagacctccccagggacagcctccccagtaa >gi568815586r:11085996_11286937|GENSCAN_predicted_peptide_9|72_aa MNSSKSKVSKNIKFMVKLGWKKMKSLMVSKKLSYLAHAPTTQDGTPALGPGHRRHLHAEY ESCVSGVQFILS >gi568815586r:11085996_11286937|GENSCAN_predicted_CDS_9|219_bp atgaattcttctaaaagtaaagtgagtaaaaacatcaaatttatggtgaagcttgggtgg aagaagatgaaatcattgatggtgtccaaaaagttgtcctacttggcacatgccccaacc actcaggatggtacaccagctctggggccaggacacaggcgacatcttcatgcagaatat gaaagctgtgtaagcggtgtacaatttatcctttcttag