GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:05:39 Sequence gi568815590f:108343659_108586595 : 242937 bp : 36.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 315 425 111 2 0 78 32 110 0.511 4.66 1.02 Term + 8688 8969 282 2 0 38 47 210 0.196 6.34 1.03 PlyA + 9453 9458 6 1.05 2.00 Prom + 11971 12010 40 -5.45 2.01 Init + 15305 15365 61 1 1 100 92 67 0.527 7.83 2.02 Term + 30045 30121 77 1 2 72 40 78 0.015 -1.58 2.03 PlyA + 30305 30310 6 1.05 3.03 PlyA - 31939 31934 6 1.05 3.02 Term - 32963 32492 472 1 1 33 42 224 0.028 5.72 3.01 Init - 35887 35691 197 1 2 102 37 140 0.030 8.75 3.00 Prom - 43213 43174 40 -4.65 4.00 Prom + 43344 43383 40 -3.65 4.01 Init + 43521 43669 149 2 2 83 59 78 0.591 4.01 4.02 Intr + 51978 52078 101 0 2 107 -13 60 0.017 -3.37 4.03 Intr + 64507 64620 114 2 0 122 106 20 0.188 6.70 4.04 Term + 64779 64798 20 1 2 107 36 35 0.180 -2.30 4.05 PlyA + 65050 65055 6 1.05 5.04 PlyA - 65570 65565 6 1.05 5.03 Term - 66775 66707 69 1 0 108 42 65 0.579 0.86 5.02 Intr - 88435 88283 153 1 0 49 16 182 0.628 6.45 5.01 Init - 90364 90347 18 1 0 60 91 -5 0.309 -2.75 5.00 Prom - 96074 96035 40 -3.95 6.00 Prom + 97051 97090 40 -2.55 6.01 Init + 99342 99587 246 2 0 100 42 183 0.966 12.55 6.02 Intr + 99844 100040 197 0 2 58 88 122 0.869 6.49 6.03 Intr + 104343 104427 85 0 1 33 119 74 0.881 3.90 6.04 Intr + 104924 105032 109 1 1 69 87 -29 0.317 -5.96 6.05 Intr + 106165 106278 114 2 0 84 92 71 0.763 6.60 6.06 Intr + 106770 106834 65 1 2 71 74 56 0.779 0.02 6.07 Intr + 109404 109489 86 2 2 89 86 70 0.969 4.60 6.08 Intr + 112215 112272 58 0 1 67 93 68 0.966 3.17 6.09 Intr + 126168 126253 86 2 2 68 79 136 0.994 8.40 6.10 Intr + 126404 126463 60 2 0 81 80 71 0.842 2.53 6.11 Intr + 132224 132305 82 2 1 91 100 59 0.989 6.12 6.12 Intr + 133124 133234 111 1 0 66 101 27 0.720 1.46 6.13 Intr + 133905 133988 84 2 0 112 29 45 0.464 0.10 6.14 Intr + 135348 135452 105 2 0 65 92 57 0.809 3.29 6.15 Term + 142854 142940 87 2 0 67 45 74 0.566 -2.32 6.16 PlyA + 143231 143236 6 1.05 7.04 PlyA - 143251 143246 6 1.05 7.03 Term - 147118 147097 22 0 1 98 40 24 0.065 -4.59 7.02 Intr - 158166 157299 868 1 1 75 53 182 0.440 2.87 7.01 Init - 159077 158222 856 2 1 37 86 286 0.615 18.13 7.00 Prom - 160348 160309 40 -6.15 8.02 PlyA - 160517 160512 6 1.05 8.01 Sngl - 161759 161430 330 2 0 88 44 306 0.945 21.97 8.00 Prom - 169202 169163 40 -4.65 9.00 Prom + 176019 176058 40 -1.65 9.01 Init + 199299 199356 58 2 1 56 78 56 0.282 2.92 9.02 Intr + 207373 207496 124 2 1 123 83 28 0.614 4.52 9.03 Term + 208678 208852 175 1 1 32 55 141 0.600 1.45 9.04 PlyA + 208960 208965 6 1.05 10.00 Prom + 209555 209594 40 -2.05 10.01 Init + 213818 213836 19 1 1 102 69 12 0.308 0.97 10.02 Intr + 222423 222481 59 1 2 122 116 -23 0.568 1.58 10.03 Term + 227244 227381 138 2 0 117 42 102 0.912 5.48 10.04 PlyA + 227743 227748 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 33121 32492 630 1 0 43 42 256 0.934 12.54 S.002 Init + 123319 123367 49 0 1 64 101 23 0.840 2.36 S.003 Intr + 123481 123530 50 2 2 103 56 49 0.834 0.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_1|130_aa MDTNQEEIPDLPEKEFGRVVIKLIKEAPKKGKVQFKKVAQQRQRDSICLGESKGKEQEFL PDNPENSSGSYPRPPRWYLYESAKTTALLGLKSLQIPGKPSQERWAQTGPDCEDYNKCQT LQCPDTDKHP >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_1|393_bp atggatacaaaccaagaagaaatccctgatttacctgaaaaagaatttggaagggtggtt attaagctaatcaaggaggcaccaaagaaaggcaaagtccaatttaagaaagtggctcag caaagacagagagattccatttgtttgggagaaagtaagggaaaagaacaagagtttctg cctgataatccagagaattcttctggatcttatccaaggccaccaaggtggtacctctat gagtctgcaaaaaccacagcattattgggcttgaagtcccttcaaatacctggaaaacct tctcaagaaagatgggcacaaacaggtccagactgcgaagactacaataaatgccaaact cttcaatgcccagacactgacaaacatccataa >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_2|45_aa MARAVHWPLLVMAGVAGTQGSFLRPSLEADAGTWLPVQPEEQRAN >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_2|138_bp atggcccgagctgtacattggccccttttagtcatggctggagtggctgggacacagggc agcttcctcaggccctcactagaagcagatgctggcacctggcttcctgtacagcctgaa gaacagcgagccaattaa >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_3|222_aa MGHIRTKTMRKVAWVIIEKYYTCLGNDFHTNKHVCEEIAIMPCKNLWNKIAACVTHLMEQ TQRVPVTNDKNHMIISIDAEKAFDKIQQPFLLKPLNKLGIDGTYLKIIRAIHDKPTANII LSRQKLEAFPLKTGKRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGREEVKLSLFAD DMIVYLENPIASAHNLLKLISNFSSLRIQNQCAKITSIPIHQ >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_3|669_bp atgggccacattcgcaccaaaaccatgagaaaggtggcctgggtcattatagaaaagtac tacacgtgcctgggcaatgacttccacaccaacaagcatgtgtgcgaggaaattgccatt atgccctgcaagaatctctggaacaagatagcagcctgtgtaacccatctgatggagcag actcagagggtcccagtaaccaatgacaaaaaccacatgattatctcaatagatgcagaa aaggccttcgacaaaattcaacagcccttcctgctaaaacctctcaataaactaggcatt gatggaacatatctcaaaataataagagctattcatgacaaacccacagccaatatcata ctgagtaggcaaaagctggaagcattccctttgaaaaccggcaaaagacaaggatgccct ctctcaccactcctattcaacatagtattggaagttctggccagggcaatcagacaagag aaagaaataaagggtattcagttaggaagagaggaagtcaaattgtctctgttcgcagat gacatgattgtatatttagaaaaccccatcgcctcagcccacaatctccttaagctgata agcaactttagcagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcctata caccaataa >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_4|127_aa MDEAGNHDSQKTITRTKNQTPHVLTHRWEPNNENTWTQEGEHHTLGPVVSGLSQEPPSPS IGFLPQIFKFISVLEVTVLKKIIGYTFPSASRLPQTHLLPKTSKPKCEENHNDFQSKSHK VGLSSDD >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_4|384_bp atggatgaagctggaaaccatgattctcagaaaactatcacaaggacaaaaaaccaaaca ccgcatgttctcactcataggtgggaaccgaacaatgagaacacatggacacaggaaggg gaacatcacacactggggcctgttgtgagtggcctttcccaagaacctccttcaccctct ataggctttctcccacaaatattcaagtttatttccgtcctggaagtaacggtcctcaaa aagataataggatatacttttccttctgcatcaaggctcccccaaactcaccttctccca aaaacttctaaacctaaatgtgaagaaaatcataatgactttcaatctaaatctcacaag gtggggctttctagtgatgattag >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_5|79_aa MNIGVQRNESPLTKELLFGLLHDPCITVQQILIEAICSNNMMIKQPFDLPSEEPKYHPTQ HEDVEEDKDLCSDSFPVKE >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_5|240_bp atgaacataggggtgcagagaaatgaatctccgcttactaaagaactgttgtttgggctg ctgcatgatccttgcatcacagttcagcagattctcatcgaagccatttgctctaataac atgatgataaagcaaccctttgatttgccttccgaagaacctaaataccatcctactcaa catgaagatgttgaagaggataaagacctttgtagtgattcatttccagttaaggaataa >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_6|524_aa MDEAMTGTTSVNEGYKLARGHSRELQSAKEESHLVSQNFGVYPFSRLKSERKANPITEGG KLGTNKGFVQKAALRPAMKRIRPLRRAACTCVADTAPSALLANEKKLRNSPPPGGGSRDC VSPPSHPAASRFWEDGEGLRALRCHLGSVWCPKLKGQADENTCSWVKDNSGACEELDMTF SFLSSAIIVRPPQPRGTASPVKPLSFVNCPVSEMRDKMRKWREENSRNSEQIVEVGEELI NEYASKLGDDIWIIYEQVMIAALDYGRDDLALFCLQELRRQFPGSHRVKRLTGMRFEAME RYDDAIQLYDRILQEDPTNTAARKRKIAIRKAQGKNVEAIRELNEYLEQFVGDQEAWHEL AELYINEHDYAKAAFCLEELMMTNPHNHLYCQQYAEVKYTQGGLENLELSRKYFAQALKL NNRNMRALFGLYMLSEKELSEEMEIGRNKNETIFKKEWQNMSASHIASNPKASAKTKKDN MKYASWAASQINRAYQFAGRSKKETKYSLKAVEDMLETLQITQS >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_6|1575_bp atggatgaagcaatgacaggcacgacgagcgttaatgaagggtataagctagcgcgaggg cattcaagggaacttcagagcgcaaaggaagaatcccacttagtctctcaaaactttgga gtttatccgtttagtcgattaaaaagtgaacggaaagcaaatcccattacggaaggagga aagttggggacaaataagggattcgtgcagaaggctgcactgcggccggccatgaagaga ataaggcccctccgcagagccgcgtgcacgtgcgtggcggataccgccccctctgctctc ttggccaatgagaaaaagctacgtaactctccaccgcccggtggcgggtcacgtgactgc gtctccccgccctctcaccccgctgcctctaggttctgggaagatggcgaaggtctcaga gctttacgatgtcacttgggaagtgtttggtgtcctaaactcaagggccaagcagatgaa aatacatgtagttgggtgaaagacaatagtggtgcctgtgaagaactggatatgactttt tccttcttgtcttctgccatcattgtgaggcctccccagccacgtggaactgcaagtcca gttaaacctctttcttttgtaaattgcccagtctcggaaatgagagataaaatgagaaaa tggagagaagaaaactcaagaaatagtgagcaaattgtggaagttggagaagaattaatt aatgaatatgcttctaagctgggagatgatatttggatcatatatgaacaggtgatgatt gcagcactagactatggtcgggatgacttggcattgttttgtcttcaagagctgagaaga cagttccctggcagtcacagagtcaagcgattaacaggcatgagatttgaagccatggaa agatatgatgatgctatacagctatatgataggattttacaagaagatccaactaacact gctgcaagaaagcgtaagattgccattcgaaaagcccaggggaaaaatgtggaggccatt cgggagctgaatgagtatctggaacaatttgttggagaccaagaagcctggcatgaactt gcagaactttacatcaatgaacatgactatgcaaaagcagccttttgtttagaggaacta atgatgactaatccacacaaccacttatactgtcagcagtatgctgaagttaagtatacc caaggtggacttgaaaacctcgaactttcaagaaagtattttgcacaggcattgaaactg aacaacagaaatatgagagctttgtttggactttatatgctcagtgaaaaggaactgagt gaagaaatggaaataggaagaaacaaaaatgaaacaatttttaaaaaggaatggcaaaat atgtcggcaagtcatattgcttctaatccaaaagcaagtgcaaaaacgaaaaaggacaac atgaaatatgctagttgggcagctagtcaaataaacagagcttatcagtttgcaggtcga agtaagaaggaaaccaaatattctcttaaggctgtcgaagacatgttggaaacattgcag atcacccagtcttaa >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_7|581_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHM IISIDAEKAFDKIQQCFMLKTFNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEALPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDIIVYLENPIVPA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTQSKIMSELPFTIASKKNKIPRNPTYK GCEGSLQGELQTTAQGNKRGHKQMEEHSMLMGRKNQYHENGHTAQELEKTTLKFIWNQKR ARIAKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEVSEIMPHIYN YLIFDKPEKNKQWGKDSLFNKWCWEHWLAICRKLKLDPFLTPYTKINSRWIKDLYVRPKT IKTLEENLGNTIQDTGMGKDFMSKTPNPMATEAKIDKWDLMKLKSFCTAKETTIRVNRQP TEWEKTFATYSSDKGLISRIYNELKQIYKKKANPIKQRAKDMNRHFSKEDIYAAKKHMKK CSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRMYKAKL >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_7|1746_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcaacacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagacgcagaaaaggcctttgacaaaatacaacaatgcttcatgctaaaa actttcaataaattaggtattgatgggatgtatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatggacaaaaattggaagcactccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaggaaataaaaggcattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatcattgtgtatctagaaaaccccatcgtcccagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcttatacaccaataacagacaaacacagagcaaaatcatgagt gaactcccattcacaattgcttcaaagaaaaataaaatacctaggaatccaacttacaag ggatgtgaaggatctcttcaaggagaactacaaaccactgctcaaggaaataaaagagga cacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaat ggccatactgcccaagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagaggtctcagaaataatgccacatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaacactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttgtatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacacaggcatggggaaggac ttcatgtctaaaacaccaaacccaatggcaacagaagccaaaattgacaaatgggatcta atgaaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaacttttgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaagcaaaccccataaaacagagggcgaag gatatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacagaatgtataaagccaag ttgtaa >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_8|109_aa MGKKQSRNTRNSKNQSASPPPKEHSSSPAMEQSWTENDFDELKEEGFRRSNYSELKEEVR TNGKEVKNLEKKLDEWIPRITNAEKSLKDLMKLKTTARELRDKCTSLSS >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_8|330_bp atggggaaaaaacagagcagaaacaccagaaactctaaaaatcagagcgcctctcctcct ccaaaggaacacagctcctcaccagcaatggaacaaagctggacggagaatgacttcgac gagttgaaagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga accaatggcaaagaagttaaaaaccttgaaaaaaaattagatgaatggatacctagaata accaatgcagagaagtccttaaaggacctgatgaagctgaaaaccacggcacgagaacta cgtgacaaatgcacaagcctcagtagctga >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_9|118_aa MNLQRKQRIRDGEAKPSDWGITAIVQLTGIWWKLEEFTSHSNLQFNKCNNCMTGAASSTV SIGSAARVIRKEKEIKGIQIGKEEVKLFLFADSVTVYLENPIISAQKILQLMNNVSKV >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_9|357_bp atgaacctgcagaggaaacagaggatcagagatggagaagcaaaaccaagtgactgggga atcacagctattgttcaactcactggaatttggtggaagttggaagagtttacatctcat agtaatctacaattcaataaatgtaataattgcatgacaggagctgcttcctcaacagtg agtattggaagtgctgccagggtaatcaggaaagagaaagaaataaagggcatccaaata ggaaaagaggaagtcaaactattcctgtttgcagacagtgtgactgtatatctagaaaac cctataatttcagcccaaaagatccttcagctgatgaacaacgtcagcaaagtttga >gi568815590f:108343659_108586595|GENSCAN_predicted_peptide_10|71_aa MAVMRGESFMFVLHNVKGFSLYIAGKVFQGWDDACPHYGEQIFFTSSTDSNVNLFQKHPF HTSRTNVSPAI >gi568815590f:108343659_108586595|GENSCAN_predicted_CDS_10|216_bp atggctgtaatgaggggtgaatcttttatgtttgttctacataatgtcaaaggtttttct ttgtatatagcaggaaaggtcttccaaggatgggatgatgcctgcccacattatggagag cagatcttctttacttcgtctactgattcaaatgttaatctcttccagaaacaccctttt cacacatccagaactaacgtttcaccagctatctaa