GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:29:23 Sequence gi568815592f:135085900_135317977 : 232078 bp : 40.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 217 212 6 1.05 1.03 Term - 429 271 159 0 0 37 43 123 0.104 -0.34 1.02 Intr - 5326 5169 158 2 2 75 84 99 0.359 7.01 1.01 Init - 6735 6732 4 0 1 87 106 0 0.698 1.91 1.00 Prom - 19183 19144 40 -4.05 2.00 Prom + 25850 25889 40 -6.25 2.01 Sngl + 26638 27174 537 0 0 39 42 246 0.899 10.93 2.02 PlyA + 27978 27983 6 1.05 3.05 PlyA - 28427 28422 6 1.05 3.04 Term - 43336 43242 95 2 2 89 43 95 0.461 2.11 3.03 Intr - 43507 43390 118 1 1 127 42 47 0.463 3.12 3.02 Intr - 50364 50072 293 1 2 93 28 92 0.144 -0.37 3.01 Init - 61666 61555 112 1 1 70 82 67 0.287 4.73 3.00 Prom - 65583 65544 40 -5.65 4.03 PlyA - 65700 65695 6 1.05 4.02 Term - 69365 69020 346 1 1 35 45 287 0.422 12.08 4.01 Init - 81703 81672 32 1 2 85 97 15 0.045 1.36 4.00 Prom - 87478 87439 40 -6.95 5.00 Prom + 93208 93247 40 -4.75 5.01 Init + 95615 95637 23 1 2 107 97 30 0.880 4.87 5.02 Intr + 97483 97635 153 1 0 102 -3 95 0.242 0.07 5.03 Intr + 100004 100121 118 2 1 98 78 150 0.970 14.55 5.04 Intr + 101935 102006 72 0 0 34 131 93 0.963 6.88 5.05 Intr + 103892 103984 93 1 0 25 89 111 0.916 4.14 5.06 Intr + 106038 106120 83 2 2 73 57 29 0.783 -4.18 5.07 Intr + 106425 106659 235 0 1 55 75 260 0.624 18.17 5.08 Intr + 107939 108019 81 1 0 106 110 53 0.993 8.22 5.09 Intr + 108457 108561 105 0 0 73 63 129 0.887 8.39 5.10 Intr + 109849 110103 255 0 0 104 84 91 0.982 7.02 5.11 Intr + 111062 111424 363 1 0 67 75 202 0.984 11.26 5.12 Intr + 113009 113151 143 1 2 57 75 58 0.973 -0.37 5.13 Intr + 114186 114300 115 0 1 61 94 84 0.791 5.83 5.14 Intr + 114391 114516 126 0 0 53 82 176 0.856 13.46 5.15 Intr + 115740 115850 111 2 0 60 131 52 0.973 6.36 5.16 Intr + 117318 117425 108 2 0 112 91 61 0.964 8.36 5.17 Term + 131965 132081 117 0 0 79 53 179 0.652 11.06 5.18 PlyA + 132276 132281 6 1.05 6.00 Prom + 135664 135703 40 -4.45 6.01 Init + 158498 158712 215 1 2 102 -10 156 0.836 5.54 6.02 Intr + 160019 160167 149 2 2 77 97 113 0.806 10.06 6.03 Intr + 162599 162755 157 0 1 45 45 107 0.568 0.25 6.04 Intr + 166305 166375 71 0 2 104 115 85 0.569 10.71 6.05 Term + 167309 167412 104 0 2 100 47 46 0.391 -0.84 6.06 PlyA + 171561 171566 6 1.05 7.03 PlyA - 172835 172830 6 1.05 7.02 Term - 178043 177639 405 2 0 -17 42 219 0.363 1.00 7.01 Init - 180251 180159 93 2 0 72 75 100 0.491 7.53 7.00 Prom - 203190 203151 40 -4.65 8.00 Prom + 203900 203939 40 -5.85 8.01 Init + 205546 205780 235 0 1 72 97 88 0.084 6.45 8.02 Intr + 219135 219155 21 1 0 99 98 16 0.020 0.30 8.03 Intr + 226018 226136 119 2 2 120 66 11 0.324 1.36 8.04 Term + 226776 226925 150 2 0 75 45 102 0.836 1.53 8.05 PlyA + 227034 227039 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_1|106_aa MASRGCPHSMAPFLHLQSQQQQVEDSHRIYGTLLLSTHLPLTLTWERFSSLMTHSICGST SVSLSAAMGNFGKRCAGDVKWPFQTPLPRGFPAGLKERYSTFPGID >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_1|321_bp atggcttctagaggctgcccacattccatggcccctttcctccatcttcaaagccagcaa cagcaggtagaagactcacatcgcatctatggaacccttctgctgtctacacatctccct ctgactctgacctgggaaaggttctcaagtttaatgactcattcaatctgtggcagcaca tcagtctctctgtcagcagccatgggtaactttggcaagaggtgtgctggggatgtcaag tggccattccagacacccttgccaaggggctttcctgctggcctcaaggaaagatacagt acatttcctggaattgattag >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_2|178_aa MEQYREPRNKAKYLQPTDLRQSKHNIKWGKDTLFNKWCWDNWQATCRMKLDLHFSPFTKI NSRWIKDLNLRPETINTLEDNIGKTLLVFGLGKAFMTQNPKANAIKTKINRWDVVKLKGF CTAKEIISRVNRQPTEWEKIFTIYTSDKRLISRIYKELKEISKKKNEQSHQEVGQGHE >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_2|537_bp atggaacagtatagagaacccagaaataaagccaaatacttacagcctactgatcttcga caaagcaaacacaatataaagtggggaaaggacacgctattcaacaaatggtgctgggat aattggcaagccacatgtagaatgaaactggatcttcatttctcaccttttaccaaaatc aactcaagatggatcaaagacttaaatctaagaccagaaaccataaatactttagaagat aacattggaaaaacccttctagtctttggcttaggcaaagcgttcatgacccagaaccca aaagcaaatgcaataaaaacaaagataaatagatgggacgtagttaaactaaaaggcttc tgcacagcaaaagaaataatcagcagagttaacagacaacccacagagtgggagaaaatc ttcacaatctatacatctgacaaaagactaatatccagaatctacaaagaactcaaagaa atcagcaagaaaaaaaatgaacaatcccatcaagaagtgggccaaggacatgaatag >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_3|205_aa MDGPTDLQLAILANNTLRKWLVLDSTIRECVLGNRRAAACVAHDHHSTRHSCWALGMATT AAGVLGSSWSLRALCSLSFQSTTTKLCMELELLVEAGRDFPLYRLHLSPWLFFLCPDVTK LSCNGFMTHIALGGWVKVDLWSWNSTTCKSCISWPSGQLLGKPESPQGQSSQCAATAATA AVASQPQEQSELYCWSLDSSRYSQT >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_3|618_bp atggatgggcccacagacctacagttggcaatcttggccaataacacactccgaaaatgg ttagttctggatagcacgatcagggagtgtgttctggggaacaggagagctgctgcctgt gttgcccatgatcatcattccaccaggcacagttgttgggctctaggcatggccaccact gccgctggtgtgctgggttcctcatggagcctaagagccctgtgttctctgagttttcaa agtactacaaccaagctgtgtatggagctggaattgctggtggaggctggcagggacttt cctctctacaggttacatctttctccgtggcttttcttcctgtgtcctgatgtcacaaag ttatcgtgtaatggcttcatgacccacatagcacttgggggctgggtcaaagtagaccta tggagttggaactccaccacctgtaagagctgcattagctggccctctgggcagctgcta ggaaaaccagaatctccacaaggccagtcaagccagtgtgcagctacagcggctacggca gctgtggcttcacagccacaggaacaatctgagctgtattgttggagcctggattcttct agatattctcaaacctag >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_4|125_aa MLKSASKKSKREATNLEKRLEESLTRITSLEKNINDLMELKNTAREFREAYTSINSQIDQ VEERISEMEDQLNEIKCEDKIREKRMKKNEQSLQEIWDYVKRPNLRLIGVPESDGENGTK MENTL >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_4|378_bp atgctgaaatcagcttccaaaaaatccaaaagggaagctacgaaccttgaaaaaaggtta gaggaatcgctaactagaataaccagtttagagaagaatataaatgacctgatggagctg aaaaacacagcacgagaatttcgtgaagcatacacaagtatcaatagccaaattgatcaa gtggaagaaaggatatcagagatggaagatcaacttaatgaaataaagtgtgaagacaag attagagaaaaaagaatgaaaaagaatgaacaaagcctccaagaaatatgggactatgtg aaaagaccaaacctacgtttgattggtgtacctgaaagtgatggggagaatggaaccaag atggaaaacactctttag >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_5|766_aa MARRPRHSTSFLFRFFARFVSLTECNIHSCTLVTTPGLAGGESSASTLKSAFENPLTVGI YSSDEDDEDFEMCDHDYDGLLPKSGKRHLGKTRWTREEDEKLKKLVEQNGTDDWKVIANY LPNRTDVQCQHRWQKVLNPELIKGPWTKEEDQRALLCHIQRSNFPKAAGTLHCVGVLYRD RTDNAIKNHWNSTMRRKVEQEGYLQESSKASQPAVATSFQKNSHLMGFAQAPPTAQLPAT GQPTVNNDYSYYHISEAQNVSSHVPYPVALHVNIVNVPQPAAAAIQRHYNDEDPEKEKRI KELELLLMSTENELKGQQVLPTQNHTCSYPGWHSTTIADHTRPHGDSAPVSCLGEHHSTP SLPADPGSLPEESASPARCMIVHQGTILDNVKNLLEFAETLQFIDSDSSSWCDLSSFEFF EEADFSPSQHHTGKALQLQQREGNGTKPAGEPSPRVNKRMLSESSLDPPKVLPPARHSTI PLVILRKKRGQASPLATGDCSSFIFADVSSSTPKRSPVKSLPFSPSQFLNTSSNHENSDL EMPSLTSTPLIGHKLTVTTPFHRDQTVKTQKENTVFRTPAIKRSILESSPRTPTPFKHAL AAQEIKYGPLKMLPQTPSHLVEDLQDVIKQESDESGIVAEFQENGPPLLKKIKQEVESPT DKSGNFFCSHHWEGDSLNTQLFTQTSPVADAPNILTSSVLMAPASEDEDNVLKAFTVPKN RSLASPLQPCSSTWEPASCGKMEEQMTSSSQARKYVNAFSARTLVM >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_5|2301_bp atggcccgaagaccccggcacagcacctcttttctttttcgtttctttgcacggtttgtt tccttgacagaatgtaacattcatagttgtactcttgtaaccacaccgggtctagcaggt ggggagtcgagtgcatcaaccctgaaatcagcctttgaaaatcctctgactgttggcata tatagcagtgacgaggatgatgaggactttgagatgtgtgaccatgactatgatgggctg cttcccaagtctggaaagcgtcacttggggaaaacaaggtggacccgggaagaggatgaa aaactgaagaagctggtggaacagaatggaacagatgactggaaagttattgccaattat ctcccgaatcgaacagatgtgcagtgccagcaccgatggcagaaagtactaaaccctgag ctcatcaagggtccttggaccaaagaagaagatcagagagccctgctgtgccacattcaa aggagtaatttccccaaggcagctggaactttacattgtgttggagtcttgtaccgcgac cgaactgataatgctatcaagaaccactggaattctacaatgcgtcggaaggtcgaacag gaaggttatctgcaggagtcttcaaaagccagccagccagcagtggccacaagcttccag aagaacagtcatttgatgggttttgctcaggctccgcctacagctcaactccctgccact ggccagcccactgttaacaacgactattcctattaccacatttctgaagcacaaaatgtc tccagtcatgttccataccctgtagcgttacatgtaaatatagtcaatgtccctcagcca gctgccgcagccattcagagacactataatgatgaagaccctgagaaggaaaagcgaata aaggaattagaattgctcctaatgtcaaccgagaatgagctaaaaggacagcaggtgcta ccaacacagaaccacacatgcagctaccccgggtggcacagcaccaccattgccgaccac accagacctcatggagacagtgcacctgtttcctgtttgggagaacaccactccactcca tctctgccagcggatcctggctccctacctgaagaaagcgcctcgccagcaaggtgcatg atcgtccaccagggcaccattctggataatgttaagaacctcttagaatttgcagaaaca ctccaatttatagattctgattcttcatcatggtgtgatctcagcagttttgaattcttt gaagaagcagatttttcacctagccaacatcacacaggcaaagccctacagcttcagcaa agagagggcaatgggactaaacctgcaggagaacctagcccaagggtgaacaaacgtatg ttgagtgagagttcacttgacccacccaaggtcttacctcctgcaaggcacagcacaatt ccactggtcatccttcgaaaaaaacggggccaggccagccccttagccactggagactgt agctccttcatatttgctgacgtcagcagttcaactcccaagcgttcccctgtcaaaagc ctacccttctctccctcgcagttcttaaacacttccagtaaccatgaaaactcagacttg gaaatgccttctttaacttccacccccctcattggtcacaaattgactgttacaacacca tttcatagagaccagactgtgaaaactcaaaaggaaaatactgtttttagaaccccagct atcaaaaggtcaatcttagaaagctctccaagaactcctacaccattcaaacatgcactt gcagctcaagaaattaaatacggtcccctgaagatgctacctcagacaccctctcatcta gtagaagatctgcaggatgtgatcaaacaggaatctgatgaatctggaattgttgctgag tttcaagaaaatggaccacccttactgaagaaaatcaaacaagaggtggaatctccaact gataaatcaggaaacttcttctgctcacaccactgggaaggggacagtctgaatacccaa ctgttcacgcagacctcgcctgtggcagatgcaccgaatattcttacaagctccgtttta atggcaccagcatcagaagatgaagacaatgttctcaaagcatttacagtacctaaaaac aggtccctggcgagccccttgcagccttgtagcagtacctgggaacctgcatcctgtgga aagatggaggagcagatgacatcttccagtcaagctcgtaaatacgtgaatgcattctca gcccggacgctggtcatgtga >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_6|231_aa MVEFSYCSVVISKLCVAVSLAAKKNPSNNPRNHCPGLESMGSSVSELTVSGGRDSRFSAV PALASSTVKRSKASALNLDVMKRYYFSPLQFLVPSVGQWRHHTPSEACRKDPETTDKKPT EDKRSLVSQEQPQDVPMMSWQPFLVASSVRADEYLLLLGNLTADLGLPGSKLLISGSGMT SDCHSAAMENVHSEEESVSRSFPFSLTQKSSPTSSGNHPDDAQVEGIFPTF >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_6|696_bp atggtggaattttcctattgttcagttgtcatttctaagctctgtgttgctgtcagcctg gcagccaagaaaaacccctctaacaaccctagaaaccactgccctggcttggaatcaatg ggcagcagtgtttctgagctgacagtttcaggtggaagagacagcagattttcagctgta cctgctttggcctcctcaactgtgaaacgctccaaggcctcagccttgaatcttgatgtg atgaaaagatattacttttctcctctacagtttctggtgcccagcgtggggcaatggaga catcacactccctctgaagcctgcagaaaagatcctgagacaaccgacaaaaagccaaca gaagacaaaagaagtttggtttcccaggagcaaccacaggatgtgcccatgatgtcttgg cagcctttcctggtggcttcttcagtgagggctgatgagtatctgcttttactggggaac ttgacagctgacttgggccttccaggatccaaactccttatatcaggtagtggaatgaca agtgactgtcattctgctgctatggagaatgtgcactcagaggaagagagcgtttcaagg tctttccctttctctctaacccaaaaatcatctccgacttcctctgggaaccacccagat gatgcccaagttgaaggcatcttccccacgttctga >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_7|165_aa MGPNPTDWCPFKKRKFYGQEHSKARALRKDHGTASVEIKKTKSESFGQTVLPTWKALGLV LWKEENTSPKYEEIIQNAKAPWSRSSRSPPPSAKGFLNSSTSIYRGLAMCQMLCRCWKHN NEEASPSPQRASVSRQILMLLFVDSAWAWALPPSSPSIASQRLIQ >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_7|498_bp atgggccctaatccaactgactggtgtccttttaagaagaggaaattttacggacaagaa cactcaaaggctcgtgcactgaggaaagaccatggaacagcaagtgttgaaataaaaaag acaaagtcagagtcatttggccagacagtccttcccacctggaaggctctgggacttgtc ttatggaaggaggaaaatacttctcccaagtatgaagaaatcatccaaaatgcaaaggct ccctggtcccgcagtagccggtcccctccaccttcagccaagggattccttaattcatca accagtatttacagaggacttgccatgtgccaaatgctgtgtcgctgctggaaacacaac aacgaagaggcatcaccttcaccccaaagagcttctgtctcccggcagatcctgatgctg ctctttgtagactcagcctgggcctgggccttgccacccagctccccctccattgcctca cagagactcattcagtga >gi568815592f:135085900_135317977|GENSCAN_predicted_peptide_8|174_aa MKPERALFAQRRGHVRTQQEGSHLQARKRGLTGNQSCWHLDLGTSSPQNCVKISFCSLSN EVCGILCGTLNRLMKKCVRKYLQEAVKSQQQMLPSPTNISMNQMKCWKVPGTISNRAGVF HLEEHVVEENGHWPENMEAMLLISSIILNELLIASRISISCRMKELIRITKDVL >gi568815592f:135085900_135317977|GENSCAN_predicted_CDS_8|525_bp atgaaaccagagcgtgctctttttgcacagagaagaggccatgtgaggacacaacaagaa ggcagccacctccaagccaggaagagaggcctcacgggaaaccaatcctgctggcacctt gatcttgggacttctagcccccagaactgtgtgaaaataagtttctgtagtttaagcaat gaagtctgtggtattctctgtggcaccctgaatagactaatgaagaaatgtgtaagaaag tatcttcaggaggcagtgaagagccagcaacaaatgctgccatcacccactaacatttca atgaaccagatgaaatgctggaaggtacctggcacaattagcaacagagcaggtgtgttc cacttggaagaacatgtggtggaggaaaatgggcactggcctgagaatatggaggccatg ctcctaattagcagcatcattttaaacgaacttcttatagcttcccggatctcaatttcc tgtagaatgaaggagttgatcagaatcaccaaggatgttctctaa