GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:28:06 Sequence gi568815596f:168728420_168964967 : 236548 bp : 39.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1143 1312 170 1 2 43 39 155 0.383 3.16 1.02 PlyA + 1568 1573 6 1.05 2.00 Prom + 6026 6065 40 -3.75 2.01 Init + 10923 10994 72 2 0 94 81 38 0.110 4.82 2.02 Intr + 22623 22717 95 2 2 74 71 89 0.009 3.64 2.03 Intr + 37173 37329 157 0 1 107 121 -6 0.264 3.69 2.04 Term + 41091 41243 153 2 0 99 32 246 0.595 17.04 2.05 PlyA + 42323 42328 6 1.05 3.00 Prom + 51102 51141 40 -5.65 3.01 Init + 54133 54221 89 0 2 103 92 29 0.568 4.96 3.02 Intr + 54626 54713 88 2 1 96 14 53 0.253 -2.25 3.03 Term + 59933 60121 189 1 0 49 49 242 0.752 12.87 3.04 PlyA + 61814 61819 6 1.05 4.03 PlyA - 62583 62578 6 1.05 4.02 Term - 74281 74011 271 0 1 84 50 149 0.363 4.67 4.01 Init - 79626 79472 155 0 2 64 30 149 0.631 6.20 4.00 Prom - 83080 83041 40 -5.35 5.00 Prom + 83401 83440 40 -4.95 5.01 Init + 92034 92044 11 2 2 56 116 0 0.184 -0.48 5.02 Intr + 96215 96298 84 2 0 68 115 48 0.490 3.52 5.03 Intr + 99739 99801 63 1 0 88 70 50 0.332 0.11 5.04 Intr + 100001 100082 82 2 1 90 70 59 0.732 3.02 5.05 Intr + 105808 105906 99 0 0 93 121 97 0.746 12.89 5.06 Intr + 114573 114698 126 2 0 23 111 62 0.460 1.96 5.07 Intr + 115790 115924 135 1 0 34 52 98 0.494 0.64 5.08 Intr + 122665 122763 99 0 0 97 79 9 0.410 0.29 5.09 Intr + 122860 122985 126 0 0 83 59 135 0.878 10.06 5.10 Intr + 126933 127041 109 2 1 83 93 86 0.931 7.54 5.11 Intr + 128271 128359 89 1 2 68 84 66 0.947 2.97 5.12 Intr + 128908 128968 61 0 1 64 70 90 0.760 2.19 5.13 Intr + 133541 133630 90 0 0 45 90 60 0.446 0.95 5.14 Term + 136415 136551 137 0 2 86 41 117 0.719 4.00 5.15 PlyA + 136616 136621 6 1.05 6.00 Prom + 138734 138773 40 -3.65 6.01 Init + 140192 140371 180 1 0 70 41 161 0.670 8.83 6.02 Term + 140660 141283 624 1 0 -23 42 297 0.460 7.40 6.03 PlyA + 141703 141708 6 1.05 7.09 PlyA - 142499 142494 6 1.05 7.08 Term - 143136 143012 125 1 2 117 38 121 0.994 7.57 7.07 Intr - 145264 145166 99 2 0 99 95 19 0.879 2.86 7.06 Intr - 147757 147653 105 2 0 62 115 100 0.994 9.37 7.05 Intr - 148965 148819 147 1 0 65 106 132 0.956 11.99 7.04 Intr - 161114 160968 147 0 0 64 82 128 0.969 9.09 7.03 Intr - 161563 161316 248 0 2 22 88 52 0.720 -5.22 7.02 Intr - 162267 161899 369 2 0 47 116 189 0.294 10.90 7.01 Init - 163817 163675 143 2 2 80 92 29 0.553 2.15 7.00 Prom - 169897 169858 40 -4.35 8.00 Prom + 172839 172878 40 -2.85 8.01 Init + 179151 179369 219 2 0 65 43 176 0.232 9.58 8.02 Term + 184593 184910 318 2 0 20 38 278 0.693 9.90 8.03 PlyA + 185039 185044 6 1.05 9.11 PlyA - 185765 185760 6 1.05 9.10 Term - 195403 195203 201 1 0 113 46 287 0.994 23.41 9.09 Intr - 196384 196238 147 1 0 73 115 171 0.974 17.81 9.08 Intr - 198943 198737 207 1 0 116 105 179 0.991 20.75 9.07 Intr - 202443 202246 198 0 0 96 91 201 0.987 19.83 9.06 Intr - 204114 203958 157 2 1 108 106 133 0.997 16.19 9.05 Intr - 207006 206765 242 0 2 78 94 237 0.669 18.73 9.04 Intr - 208014 207811 204 0 0 98 89 98 0.672 9.37 9.03 Intr - 216347 216186 162 2 0 109 72 140 0.998 13.75 9.02 Intr - 229709 229545 165 2 0 70 98 155 0.975 13.94 9.01 Intr - 235889 235787 103 1 1 117 77 176 0.999 18.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 6076 6192 117 0 0 46 38 132 0.844 1.56 S.002 Term - 13624 13510 115 0 1 106 44 126 0.832 7.06 S.003 Term - 160872 160799 74 1 2 71 48 81 0.874 -0.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_1|56_aa XPEPSQKPSLQNPRVQGSVVLSEQATLAVSFSWQNLFHASDGSVAKETEPQDHWHP >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_1|171_bp ntaccagaaccatcccagaaaccttccctgcagaatcccagagtgcagggctctgtggtg ctatcagagcaggccacgctggctgtgtcctttagttggcagaacctcttccatgcctct gatggcagtgttgccaaggagacggaaccacaggatcattggcatccctag >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_2|158_aa MSKSGEGRTGYNAEQNLTPGIESKMQNIAVVTELNIDDSSDFDQKPCYLKTYPKHKVLNT TLFESWEIVGPYPSWWVFNLLLLLVQGLNCFWSYLIVKIACKAVSRGKVSKDDRSDIESS SDEEDSEPPGKNPHTATTTNGTSGTNGYLLTGSCSMDD >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_2|477_bp atgagcaaaagcggagagggaagaacaggatacaatgcagaacaaaatcttactccggga atagagtctaagatgcaaaacatagctgtggtcactgagctaaatattgatgatagcagt gactttgatcaaaagccatgttacctcaaaacttatcctaaacacaaggtgttaaatacc acattatttgaaagctgggagatcgttggaccttacccttcctggtgggtttttaaccta ctgctattgctagtacaagggttgaactgcttctggtcttacttgattgtgaaaatagct tgcaaagctgtttcaagaggcaaggtgtccaaggatgatcgaagtgatattgagtctagc tcagatgaggaggactcagaacctccgggaaagaatccccacactgcgacaaccaccaat gggaccagtggtaccaacgggtatctcctgactggctcctgctccatggatgattaa >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_3|121_aa MVSRKGSLPTDRKKPETGYQQLPNKISGVGQILSSEKARIEGAAGLYRFADSNRTNKIRD RAEQISLLGEGITVARSGMMGAKLGMEMSAQEDSAAGRIVAQGGGEEESTQGGSEELEEE A >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_3|366_bp atggtgtcaaggaaaggcagtctcccaacagatagaaaaaaacctgaaactggttatcag cagcttcccaataagatctcaggagttggtcaaattctttcttctgagaaggcaagaatt gagggtgctgcaggcctgtacagatttgccgacagtaacagaaccaataaaatacgtgac agagcagagcagatatctctgctaggtgagggaatcactgtggcccggagtggaatgatg ggagccaaattgggcatggagatgtctgcccaggaggacagtgcagctggcagaatcgta gcccaaggtgggggtgaagaggaatccacacaaggtgggagtgaggagttggaggaggaa gcatga >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_4|141_aa MLKVYKPVIENSIAFSGSGEACGKGIHIQPLKVRGCWTKAQSMASIDPWATRHTNHTELL TGQSVSGSLMLKCVWLLSFHLRSSILDTVIKVGCVLDFRDSKGLEYVSPCQTFQGGSPSP GLSSRLCPEKVKNCSGSRRPV >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_4|426_bp atgctgaaagtttataagccagtgattgaaaactcaattgccttcagtggcagtggtgaa gcctgtggcaaaggcattcacattcaacctcttaaagtaaggggctgctggaccaaagca cagtctatggcttccattgacccatgggctaccaggcacacaaaccacactgagctactc accggacaatctgtcagtgggtccctcatgttgaaatgtgtctggcttttgtcctttcac ctacgctccagcattctggacactgtaatcaaggttgggtgtgtcctggacttccgggat tcaaagggcctagagtacgtcagtccttgtcaaacattccaaggaggaagtcccagccct ggactttcatcccgtttgtgtcccgagaaagtaaagaactgctctgggtcaagacggcca gtgtga >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_5|436_aa MSQRANLEISYAKGLQKLASKLSKALQNTRKSCVSSAWAWASEGMKSTADLHQKLGKAIE LEAIKPTYQVLNVQEKKRKSAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTK STEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQKWSPWWEYNRTARNSHAAYARPLVK QYNPRLYINMTKLNNYQFWRSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHC AISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQ RDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAALMDEGYRRITGERKDVVAEGIGQ SAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEGGWWFGSLNGKKGHFPAA YVEELPSNAGNTATKA >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_5|1311_bp atgtcacaaagggcaaacctggaaattagctatgccaaaggacttcagaaactggcaagc aagctgagcaaagcattacagaacacgagaaaaagttgtgttagcagtgcctgggcctgg gcctcagagggaatgaaatccacagcggacctgcatcaaaaacttggcaaagcaattgaa ttggaagcaataaaaccgacttatcaagtcctaaatgtacaagagaagaagagaaaatca gccaagaagaaattaatggttagtaccaagaaacatgaagcacttttccagcttgtagaa agctccaagcaatctatgactgagaaggagaagcggaagctcctcaataaactgacaaaa tcaactgaaaagttggaaaaggaagatgaaaattactaccaaaaaaacatggcgggttat tctaccagactgaaatgggaaaacacactagagaactgctaccagaaatggagcccatgg tgggaatacaacagaactgccagaaacagccacgcagcttatgcacggcccctggtgaag cagtacaaccccaggctctacataaatatgaccaaattaaataactatcaattctggaga agcattctggagctggagaaggaaagaattcaacttttatgcaataacttaaaccagtac agccaacatatttctctttttggccaaaccctgaccacatgccacacgcagattcactgt gccatcagcaagattgacattgaaaaagatatccaggctgtaatggaagaaactgcaatt ttatctacagaaaacaaatctgagttcctgttaacggattactttgaagaagatcctaac agtgcaatggataaagagagacgaaagtctttactaaaaccaaaattattgagactgcag agagacattgaaaaagcctcaaaagacaaggaaggcctggaacgaatgcttaaaacgtac tccagcacctcctccttctctgatgcaaagagccagaaagacacagcagcgttaatggat gagggttatcggagaataacaggggaaaggaaagacgtggttgcagagggaatagggcag tctgcccctggtgcagcccagctcagcagcagactttgcaaggccttgtattcttttcaa gccaggcaagatgatgagttgaatttggaaaagggtgacattgtgattatacacgagaaa aaagaaggaggatggtggtttggatctttgaatgggaaaaaaggccattttcctgccgct tatgtggaggagttaccttcaaatgctggcaacacagctacaaaggcataa >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_6|267_aa MDKFLDTYTLPRLNKEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL HINRTKDKNHIIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIIWN GQKLEAFPLKTGTRQGCPLLPLLFNIVLEVLARAIRQKKEIKGIQLGKEELKLSLFADDM IVYLENPIVSAQNLLKLISNFSKLSGCKINVQKSQAFLYTNNRQTESQIMSELPFTIASK RIKYLGIQLTRDVKDLFKENYNHCSVK >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_6|804_bp atggataaattcctcgacacatacactctcccaagactaaacaaggaagaagttgaatct ctgaataggccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtacaaggaggagctg catataaacagaaccaaagacaaaaaccacataattatctcaatagatgcggaaaaggcc tttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatggg atgtatctcaaaataataagagctatctatgacaaacccacagccaatatcatatggaat ggacaaaaactggaagcattccctttgaaaaccggcacaagacagggatgccctctccta ccactcctattcaacatagtgttggaagttctggccagggcaatcaggcagaagaaggaa ataaagggcattcaattaggaaaagaggaactcaaattgtccctgtttgcagatgacatg attgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaactctcaggatgcaaaatcaatgtgcaaaaatcacaagcatttttatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaccactgctcagtgaaataa >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_7|460_aa MWHLSIPHPAETQPWVTDRISQRRQCSAQYLAQNRNSINPGWFEMKGINLYLKPINDLSW ASPSGAINTFGSQRRLWLLFRKELRIPECPVPPFSPRESKGEFNQSCQGEPALSANQKAS LRLEAGGGKCTRFEIGKLAGLRELSLESGCWLEWARIWCGEGGGTQACLRSPYFTVNFYC SEVEERDRIDVWGLILWEGAGHLAILMISVQKGLELDQGFQKSTVLLVRKWTQGHSACEC PVQLHLPTNKLLGALSYIMVEDELALFDKSINEFWNKFKSTDTSCQMAGLRDTYKDSIKA FAEISRQNKLIQEKKDNLLKLIAEVKGKKQELEVLTANIQDLKEEYSRKKETISTANKAN AERLKRLQKSADLYKDRLGLEIRKIYGEKLQFIFTNIDPKNPESPFMFSLHLNEARDYEV SDSAPHLEGLAEFQENVRKTNNFSAFLANVRKAFTATVYN >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_7|1383_bp atgtggcatctttctattccccaccctgcagaaacccaaccctgggtcacagaccgtata tcccagagaaggcagtgctcagctcagtacctggcacaaaatagaaactcaataaaccct ggttggtttgaaatgaaaggtataaacctttatttgaagccgataaatgacttgtcgtgg gcgagcccttcgggtgctattaatactttcgggtcacagcggagactctggctgttgttc agaaaagaactacgaatcccagaatgccctgttcccccattttcccctcgagaatccaag ggtgagttcaaccaatcctgtcaaggagagcctgcgctatcggccaatcagaaggccagc ctgcgcctggaggcgggtggcgggaagtgcactaggtttgaaatcggaaagttggcgggg ctgcgggagctgagcctagagtccggctgttggctagagtgggcgcggatctggtgtggg gaaggcggcgggactcaggcctgcctgcgaagtccctattttactgttaacttttactgt tctgaggtggaggagagagacaggatagatgtgtggggtcttattctatgggagggtgcc ggccacctggctatcctgatgatctctgttcagaaagggttagagttggatcaggggttt cagaagtcaacagttttacttgtgagaaaatggacccagggtcatagtgcgtgtgagtgt cctgtgcagttacacttacctactaacaagctgctaggagcattgtcctacataatggta gaggacgaactggcacttttcgataaaagcataaatgaattttggaataaattcaaaagt acggacacctcctgtcagatggcgggactaagagatacctacaaggattccatcaaagca tttgcagagatcagcaggcaaaataagctcattcaagaaaaaaaggataacttgttaaaa ttgattgctgaagtaaaaggcaaaaagcaggaattggaagtactgactgcaaatatccag gatcttaaggaagaatattctaggaagaaggaaactatttctactgctaataaagcgaat gcagagaggttgaaaaggctgcagaaatctgcagacttgtataaagatcgacttggacta gaaattcgaaaaatttatggtgagaaattgcagtttattttcactaatattgaccctaag aatcctgagagcccatttatgttttccttacatctcaatgaagcaagggactatgaagtg tcagatagtgcccctcatcttgagggcctagcagaatttcaagagaatgtaaggaagacc aacaatttttcagcttttcttgccaatgttcggaaagcttttactgccacggtttataat taa >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_8|178_aa MLVAEAFEHTPGIQTASLGTYLKTNLFLFLFAVGFYLLLRVLNIDLLWSVPIAKKWCANP DWIHIDTTPFAGLRGEERRRWQHSSPFLALAKWKLKRPPWEDLGALQGEQAICVPGKETQ HYRRQRRASLLSHSPTPVRQLSRSHSNSCSKSWLFPGFRLCLMNSMDRIVICAHTSCC >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_8|537_bp atgctggtggcagaggcctttgaacacactccaggcatccaaacggccagtctgggcaca tacctgaagaccaacctctttctcttcctgtttgcagttggcttttacctgcttcttagg gtgctcaacattgacctgctgtggtccgtgcccatagccaaaaagtggtgtgctaacccc gactggatccacattgacaccacgccttttgctggactccgcggggaggaacgtaggcgc tggcagcacagcagcccgtttctggcgctggccaagtggaaactgaagcggcctccctgg gaggatctgggtgcgctgcagggggagcaggccatctgtgtgcctggaaaagaaacccag cactaccgaagacaaagaagagcttctttactctcgcactcacctacccctgtacgtcaa ctgagccgatcacactctaattcctgttcaaagagttggctttttccaggattcaggctt tgtctgatgaacagcatggaccggattgtcatctgtgctcatacatcctgctgctaa >gi568815596f:168728420_168964967|GENSCAN_predicted_peptide_9|595_aa XASIRQRSKSQLSYLVHEPPLAVVDHKSTYEEDRKDKDIPVQEEVEPAPVRRILKFSAPE WPYMLVGSVGAAVNGTVTPLYAFLFSQILGGYAFAKSGELLTKRLRKFGFRAMLGQDIAW FDDLRNSPGALTTRLATDASQVQGAAGSQIGMIVNSFTNVTVAMIIAFSFSWKLSLVILC FFPFLALSGATQTRMLTGFASRDKQALEMVGQITNEALSNIRTVAGIGKERRFIEALETE LEKPFKTAIQKANIYGFCFAFAQCIMFIANSASYRYGGYLISNEGLHFSYVFRVISAVVL SATALGRAFSYTPSYAKAKISAARFFQLLDRQPPISVYNTAGEKWDNFQGKIDFVDCKFT YPSRPDSQVLNGLSVSISPGQTLAFVGSSGCGKSTSIQLLERFYDPDQGKVMIDGHDSKK VNVQFLRSNIGIVSQEPVLFACSIMDNIKYGDNTKEIPMERVIAAAKQAQLHDFVMSLPE KYETNVGSQGSQLSRGEKQRIAIARAIVRDPKILLLDEATSALDTESEKTVQVALDKARE GRTCIVIAHRLSTIQNADIIAVMAQGVVIEKGTHEELMAQKGAYYKLVTTGSPIS >gi568815596f:168728420_168964967|GENSCAN_predicted_CDS_9|1788_bp nnggcttccatccggcaacgctccaagtctcagctttcttacctggtgcacgaacctcca ttagctgttgtagatcataagtctacctatgaagaagatagaaaggacaaggacattcct gtgcaggaagaagttgaacctgccccagttaggaggattctgaaattcagtgctccagaa tggccctacatgctggtagggtctgtgggtgcagctgtgaacgggacagtcacacccttg tatgcctttttattcagccagattcttgggggatatgcctttgctaaatctggggagctc ctaacaaaaaggctacgtaaatttggtttcagggcaatgctggggcaagatattgcctgg tttgatgacctcagaaatagccctggagcattgacaacaagacttgctacagatgcttcc caagttcaaggggctgccggctctcagatcgggatgatagtcaattccttcactaacgtc actgtggccatgatcattgccttctcctttagctggaagctgagcctggtcatcttgtgc ttcttccccttcttggctttatcaggagccacacagaccaggatgttgacaggatttgcc tctcgagataagcaggccctggagatggtgggacagattacaaatgaagccctcagtaac atccgcactgttgctggaattggaaaggagaggcggttcattgaagcacttgagactgag ctggagaagcccttcaagacagccattcagaaagccaatatttacggattctgctttgcc tttgcccagtgcatcatgtttattgcgaattctgcttcctacagatatggaggttactta atctccaatgaggggctccatttcagctatgtgttcagggtgatctctgcagttgtactg agtgcaacagctcttggaagagccttctcttacaccccaagttatgcaaaagctaaaata tcagctgcacgcttttttcaactgctggaccgacaacccccaatcagtgtatacaatact gcaggtgaaaaatgggacaacttccaggggaagattgattttgttgattgtaaatttaca tatccttctcgacctgactcgcaagttctgaatggtctctcagtgtcgattagtccaggg cagacactggcgtttgttgggagcagtggatgtggcaaaagcactagcattcagctgttg gaacgtttctatgatcctgatcaagggaaggtgatgatagatggtcatgacagcaaaaaa gtaaatgtccagttcctccgctcaaacattggaattgtttcccaggaaccagtgttgttt gcctgtagcataatggacaatatcaagtatggagacaacaccaaagaaattcccatggaa agagtcatagcagctgcaaaacaggctcagctgcatgattttgtcatgtcactcccagag aaatatgaaactaacgttgggtcccaggggtctcaactctctagaggggagaaacaacgc attgctattgctcgggccattgtacgagatcctaaaatcttgctactagatgaagccact tctgccttagacacagaaagtgaaaagacggtgcaggttgctctagacaaagccagagag ggtcggacctgcattgtcattgcccatcgcttgtccaccatccagaacgcggatatcatt gctgtcatggcacagggggtggtgattgaaaaggggacccatgaagaactgatggcccaa aaaggagcctactacaaactagtcaccactggatcccccatcagttga