GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:19:52 Sequence gi568815586f:104165309_104365836 : 200528 bp : 42.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6605 6728 124 1 1 65 116 145 0.854 15.38 1.02 Term + 7388 7665 278 0 2 18 39 164 0.657 -0.86 1.03 PlyA + 7725 7730 6 1.05 2.03 PlyA - 8718 8713 6 1.05 2.02 Term - 15545 15378 168 2 0 83 40 111 0.370 2.70 2.01 Init - 19110 18925 186 0 0 45 43 135 0.165 3.81 2.00 Prom - 26495 26456 40 -3.45 3.00 Prom + 27830 27869 40 -3.65 3.01 Init + 39774 39921 148 2 1 87 64 126 0.732 10.40 3.02 Intr + 49279 49421 143 2 2 68 110 14 0.064 0.75 3.03 Intr + 50432 50585 154 1 1 22 96 130 0.009 5.92 3.04 Term + 54660 54724 65 1 2 116 46 31 0.252 -1.23 3.05 PlyA + 57111 57116 6 1.05 4.03 PlyA - 58566 58561 6 1.05 4.02 Term - 70088 69896 193 1 1 119 43 132 0.993 7.71 4.01 Init - 70763 70702 62 2 2 103 77 58 0.895 6.97 4.00 Prom - 90873 90834 40 -3.25 5.00 Prom + 99196 99235 40 -5.75 5.01 Sngl + 100001 100531 531 1 0 87 43 840 0.999 75.21 5.02 PlyA + 100561 100566 6 1.05 6.00 Prom + 101901 101940 40 -8.85 6.01 Init + 107828 108014 187 1 1 98 -9 225 0.455 13.07 6.02 Term + 118968 119101 134 1 2 88 48 97 0.253 2.97 6.03 PlyA + 119328 119333 6 1.05 7.03 PlyA - 120905 120900 6 1.05 7.02 Term - 121896 121712 185 1 2 30 51 209 0.566 8.12 7.01 Init - 123717 123594 124 0 1 95 58 182 0.923 16.28 7.00 Prom - 124375 124336 40 -5.95 8.00 Prom + 126686 126725 40 -7.35 8.01 Init + 128343 128586 244 2 1 68 82 130 0.612 6.36 8.02 Intr + 138496 138601 106 2 1 11 24 305 0.059 14.75 8.03 Term + 138677 139628 952 2 1 6 43 810 0.527 58.85 8.04 PlyA + 139872 139877 6 1.05 9.00 Prom + 143261 143300 40 -9.25 9.01 Init + 144630 144791 162 2 0 93 119 48 0.910 8.19 9.02 Intr + 145982 146128 147 1 0 102 -46 168 0.162 4.31 9.03 Intr + 147937 148009 73 0 1 93 86 62 0.242 4.56 9.04 Intr + 150469 150588 120 2 0 76 80 45 0.379 2.05 9.05 Intr + 153605 153747 143 0 2 39 86 131 0.492 7.15 9.06 Intr + 154162 154277 116 0 2 76 103 75 0.999 6.13 9.07 Intr + 155783 156008 226 2 1 112 106 140 0.993 15.26 9.08 Intr + 160029 160121 93 2 0 70 103 83 0.897 7.24 9.09 Intr + 161039 161115 77 1 2 64 69 33 0.909 -3.61 9.10 Intr + 162207 162363 157 0 1 57 113 196 0.999 18.09 9.11 Intr + 166226 166333 108 1 0 78 107 138 0.999 14.26 9.12 Intr + 173831 173965 135 1 0 85 90 182 0.999 17.94 9.13 Intr + 177518 177812 295 1 1 8 19 245 0.324 5.26 9.14 Term + 183045 183277 233 1 2 122 42 121 0.657 6.55 9.15 PlyA + 185168 185173 6 1.05 10.06 PlyA - 187390 187385 6 1.05 10.05 Term - 190430 190316 115 1 1 73 38 91 0.205 -0.34 10.04 Intr - 190652 190614 39 1 0 121 95 14 0.163 1.82 10.03 Intr - 191398 191157 242 1 2 88 19 125 0.196 1.03 10.02 Intr - 197100 196902 199 2 1 107 89 103 0.924 10.73 10.01 Init - 199433 199357 77 2 2 88 100 33 0.717 5.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 58334 58213 122 0 2 109 42 83 0.800 3.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_1|133_aa MAKLGSFARSRCCEHQGSELSLQHQTSAMGFRIRGPVPFSEGSLDALGVIGSRDGSTQPL PATRQGGRGYHTEQQSQSCNQNSLTDANLGRWLIADGVPTSEIDGKPTKFFLDLHKQKTS RSSEQKPKLNHKK >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_1|402_bp atggcgaagctaggcagttttgccagaagtcgttgctgtgagcatcagggttcagagttg agtcttcaacaccagacctctgcgatgggcttccggattcgaggccccgtccctttctca gaaggaagcctagatgcattaggggtaattggatcccgggatggcagcactcaacccctc cccgcaacaaggcaaggtgggcgtggttatcatacagaacagcagagtcaaagctgcaat cagaatagcttgacagatgcaaacctagggcgttggctgattgctgatggtgttcctaca agtgaaatagatgggaagcctactaaattcttcctggatctgcataagcagaaaacttct aggtcaagtgaacaaaagcctaagctgaatcataaaaagtag >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_2|117_aa MLPPTPPSYAHKNSKLHWQRSRAARQRRTRTEASERGEKKKQLDIGDSGWRGVWMGWWRE VRGRDDWFNGIPVPNIGLSKKYALSGIFVEWFILAMVKSGELRQASCNLASLMPFNV >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_2|354_bp atgttgccccccacacccccatcatatgcccataaaaactccaagctccactggcagagg agcagagcagcacggcagagaaggacaaggacagaagcatctgaacgtggagagaagaag aagcagctggacattggagactctggttggagaggagtttggatgggatggtggagagaa gttcggggcagggatgactggttcaatggtatcccagtgcctaatatagggcttagcaaa aaatatgcattatcgggaatatttgttgaatggttcatcctagcaatggtgaagagtgga gagctgaggcaggcatcttgcaatctagcttcactgatgcccttcaatgtctaa >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_3|169_aa MDGKPECEGCFPLKPGRPADSPPTAPGRIPLGVHVVLLSLACWCLLVSVEVGEATWPKLK AKIQEKNAKVLFWQRLHGQGVVGRLAEGVGAGDREYRGARGLRQASFGSVSSHRALCDMG CAEGKAVAAAAPTELQTKGKNGDGRRRSDLLNQPQSEGSFIFGVSSDSI >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_3|510_bp atggatgggaagccagagtgtgaaggttgttttcccctgaagccaggccgcccagcggac tctcctccgactgcccctggccgaattccccttggtgtccatgtcgttctgctgtcactg gcctgctggtgtctgctggtgtctgttgaggtgggggaggccacctggcccaagctgaag gccaagatacaagaaaaaaatgcaaaggttctattttggcagcgtttacatgggcagggt gtcgtggggaggctggcggagggagtgggagcaggtgacagagaatataggggggctcgc ggcctccgccaggcgtccttcggctccgtcagttcccacagggccttgtgcgacatgggc tgcgccgagggcaaggcagtggcggcggccgccccaacggagctgcagacgaaaggcaag aacggcgatggccgccgtaggtcagacctactaaatcagcctcagagtgaaggaagcttc atttttggtgtctcaagtgattctatttga >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_4|84_aa MAQEPAAGGYKIFTLPKPLLRYGKQPSSHSQSVTHIRSQSVTTHICSLFGTLTFVAPTPA PFQVANQDQLRLCGPTPTNGDRTQ >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_4|255_bp atggcccaggagcctgcagctggaggctacaagattttcaccctccctaaaccgctctta aggtacggtaaacaaccttccagccattcccaatctgtaacgcacatccgttcccaatct gtaacaacccacatctgttccttatttggcacccttacttttgtagcccccacccctgct ccatttcaagtagcaaatcaggatcagcttagattgtgcggtccaaccccaaccaatggg gatcggacacagtag >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_5|176_aa MKASGTLREYKVVGRCLPTPKCHTPPLYRMRIFAPNHVVAKSRFWYFVSQLKKMKKSSGE IVYCGQVFEKSPLRVKNFRIWLRYDSRSGTHNMYREYRDLTTAGAVTQCYRDMGARHRAR AHSIQIMKVEEIAASKCRRPAVKQFHDSKIKFPLPHRVLRRQHKPRFTTKRPNTFF >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_5|531_bp atgaaggcctcgggcacgctacgagagtacaaggtagtgggtcgctgcctgcccaccccc aaatgccacacgccgcccctctaccgcatgcgaatctttgcgcctaatcatgtcgtcgcc aagtcccgcttctggtactttgtatctcagttaaagaagatgaagaagtcttcaggggag attgtctactgtgggcaggtgtttgagaagtcccccctgcgggtgaagaacttcaggatc tggctgcgctatgactcccggagcggcacccacaacatgtaccgggaataccgggacctg accaccgcaggcgctgtcacccagtgctaccgagacatgggtgcccggcaccgcgcccga gcccactccattcagatcatgaaggtggaggagatcgcggccagcaagtgccgccggccg gctgtcaagcagttccacgactccaagatcaagttcccgctgccccaccgggtcctgcgc cgtcagcacaagccacgcttcaccaccaagaggcccaacaccttcttctag >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_6|106_aa MPNNLKACNCFHYNGLIQRKTVGVEPAAHGKGFIVVKQRSSQWKPATFYMQTTINKNARA GRAMFPALSGVQNMWTARFSKGGPCHVNATEKLGLHIEGPRELLKV >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_6|321_bp atgcccaataacttgaaggcctgcaactgcttccactacaatgggcttattcaacgcaag actgtgggcgtggagccggcagcccacggcaaaggtttcatagttgtgaagcagagatcc agccagtggaagcctgccaccttctacatgcagaccaccattaacaagaatgctcgggca gggcgggcaatgtttcctgctttaagtggagtacagaatatgtggacagctaggttcagc aagggtgggccttgccatgtcaacgcgactgagaagcttggacttcatattgagggccct agagagctactgaaggtttga >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_7|102_aa MAAYLSLFHKHNGQIGFRGQFRERSFQGPTVLCVAQKQSGACAMLSSPPAGRPPRAPIPG LAALSGFTRRRAYKNLRMMKTSGPPSGLDHLSLCCKCRSEEN >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_7|309_bp atggccgcctatctttctctgtttcacaaacacaacgggcagatcggtttccgcggccaa ttccgagagcgttccttccagggcccgaccgtcctctgtgtggcacaaaagcagagcgga gcgtgcgcaatgctgtcgtccccgccggcaggccgtcccccgcgtgctcccatccctggg ctcgcggctttgtctggtttcacccgacgcagagcttacaagaatttgagaatgatgaag acatcaggccctccttcaggactcgaccacctttcgctctgctgcaaatgccggagtgaa gaaaactga >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_8|433_aa MLSSVGTSPTGSVGLSLCAATRECRNKDTRQRDKRKGSWARGTATTNARRPVVAPNVGLH YYLLDTRQKGQGKECESPPMIEIPVAGMLVEKASRKGDEEKGEEQLVIIPGSEYAANSGE ADVDPKLLELTADEEKCRSIRRQYRQLMYCVRQNREDIVSSANNSLTEALEEANVLFDGV SRTREAALDARFLVMASDLGKEKAKQLNSDMNFFNQLAFCDFLFLFVGLNWMEGDPDKLS DCDDSIALSFWKAIEKEATSWMVKAETFHFVFGSFKLERSAPKPRLEHQKKVRKMEENGN MPTKLQKLDLSSYPEATEKNVERILGLLQTYFRKYPDTPVSYFEFVIDPNSFSRTVENIF YVSFIVRDGFARIRLDEDRLPILEPMNVNQMGEGNDSSCHGRKQGVISLTLQEWKNIVAA FEISEAMITYSSY >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_8|1302_bp atgttgtccagtgtagggaccagccccacagggtcggtgggtctctccctgtgtgcggcg acgagagagtgtagaaataaagacacaagacaaagagataagagaaaaggcagctgggcc cgggggaccgctaccaccaatgcgcggagaccggtagtggccccgaatgtcgggctgcac tattatttattggatacaaggcagaaggggcagggtaaagaatgtgagtcacctccaatg atagagatacccgtggccggcatgttggttgaaaaagcttcccggaagggagacgaagag aaaggagaggagcagctcgtgatcatccccggtagcgagtacgcggcgaactctggggag gccgacgtagacccaaagctcctggagctcaccgctgacgaggagaagtgccgcagcatc cgcaggcagtaccggcagctcatgtactgcgtgcggcagaaccgggaggacatcgtgagc tcggcgaacaactccttaaccgaggctctggaggaagccaacgtcctctttgatggcgtg agccgaaccagagaagcagccctcgacgcccggtttcttgttatggcttctgatttgggt aaagaaaaggcaaagcagttaaactcagatatgaacttctttaatcagttagcattttgt gactttctgtttctgttcgtgggtctgaattggatggaaggcgatcctgacaagttgagt gattgtgatgatagcatagctctttccttctggaaggcaatagaaaaggaagcaacatcc tggatggtaaaagctgagacattccattttgtttttggttcattcaagctagaacgttct gcaccaaagccccgacttgaacaccagaaaaaagttcgcaagatggaagaaaatggcaac atgcctacaaagttgcagaagttggacctgagtagttatccagaagcgacagaaaaaaac gtagaaaggattttgggattgttgcaaacctactttcgaaagtatcctgatactcctgtg tcctattttgagtttgtgattgatccaaactctttttctcgtactgtggagaatatattt tatgtttcttttattgtaagagatggttttgcaagaataaggcttgatgaagacaggctg ccaatattagagccgatgaatgttaaccaaatgggtgagggaaatgattccagttgccat ggcaggaaacagggagttatatctttgactttacaggagtggaaaaacattgtggcagct tttgaaatttctgaggctatgattacatactcctcatactaa >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_9|694_aa MLPTGSHSAVLPPSHCSTAPPSTSQEPSSSADPKLCLSPPTSDSRQERNVQFGLAYQEGR LQKLLKMNGPEDLPKSYDYDLIIIGGGSGGLAAAKARLLVLSVEAAQYGKKVMVLDFVTP TPLGTRWGLGGTCVNVGCIPKKLMHQAALLGQALQDSRNYGWKVEETVKHDWDRMIEAVQ NHIGSLNWGYRVALREKKVVYENAYGQFIGPHRIKATNNKGKEKIYSAERFLIATGERPR YLGIPGDKEYCISSDDLFSLPYCPGKTLVVGASYVALECAGFLAGIGLDVTVMVRSILLR GFDQDMANKIGEHMEEHGIKFIRQFVPIKVEQIEAGTPGRLRVVAQSTNSEEIIEGEYNT VMLAIGRDACTRKIGLETVGVKINEKTGKIPVTDEEQTNVPYIYAIGDILEDKVELTPVA IQAGRLLAQRLYAGSTVKCDYENVPTTVFTPLEYGACGLSEEKAVEKFGEENIEERVVGF HVLGPNAGEVTQGFAAALKCGLTKKQLDSTIGIHPVCAELHRGSGTERASEVADPVMWLV RCMRGDDSGRRCGSGNGVTRLYLRAELIVLGSWLGMGRGRAEDHSGLCPAAGSNSKSRLE VTARTKPLMCVRACCVLGIHNIVCDQALWGKHPPGWLLRLSPSVDAVAKTANHWLVSVPK SKAKFSRGFLGSWHLRVLCLPPPKAPLDLLDRSW >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_9|2085_bp atgctgccaacaggtagccacagtgctgtgcttcctccttcacattgctccaccgcaccc ccttccacatcccaagaaccttcttcttccgctgaccccaagctctgcctttcaccccct acatctgatagtaggcaagagagaaatgtgcagtttgggctggcttatcaggagggcaga cttcaaaagctactaaaaatgaacggccctgaagatcttcccaagtcctatgactatgac cttatcatcattggaggtggctcaggaggtctggcagctgctaaggcaaggctccttgtg ttgtctgttgaggcagcccaatatggcaagaaggtgatggtcctggactttgtcactccc acccctcttggaactagatggggtctcggaggaacatgtgtgaatgtgggttgcatacct aaaaaactgatgcatcaagcagctttgttaggacaagccctgcaagactctcgaaattat ggatggaaagtcgaggagacagttaagcatgattgggacagaatgatagaagctgtacag aatcacattggctctttgaattggggctaccgagtagctctgcgggagaaaaaagtcgtc tatgagaatgcttatgggcaatttattggtcctcacaggattaaggcaacaaataataaa ggcaaagaaaaaatttattcagcagagagatttctcattgccactggtgaaagaccacgt tacttgggcatccctggtgacaaagaatactgcatcagcagtgatgatcttttctccttg ccttactgcccgggtaagaccctggttgttggagcatcctatgtcgctttggagtgcgct ggatttcttgctggtattggtttagacgtcactgttatggttaggtccattcttcttaga ggatttgaccaggacatggccaacaaaattggtgaacacatggaagaacatggcatcaag tttataagacagttcgtaccaattaaagttgaacaaattgaagcagggacaccaggccga ctcagagtagtagctcagtccaccaatagtgaggaaatcattgaaggagaatataatacg gtgatgctggcaataggaagagatgcttgcacaagaaaaattggcttagaaaccgtaggg gtgaagataaatgaaaagactggaaaaatacctgtcacagatgaagaacagaccaatgtg ccttacatctatgccattggcgatatattggaggataaggtggagctcaccccagttgca atccaggcaggaagattgctggctcagaggctctatgcaggttccactgtcaagtgtgac tatgaaaatgttccaaccactgtatttactcctttggaatatggtgcttgtggcctttct gaggagaaagctgtggagaagtttggggaagaaaatattgaggaacgtgttgtgggcttt cacgtactgggtccaaatgctggagaagttacacaaggctttgcagctgcgctcaaatgt ggactgaccaaaaagcagctggacagcacaattggaatccaccctgtctgtgcagagctg caccgtggaagcgggactgagcgagcaagtgaagtggcagatccagttatgtggctggtg aggtgcatgagaggagatgatagtggccggaggtgtggcagtggaaatggagtgacacgg ttatatttgagggcagagttgatagtacttggcagctggctgggaatgggaagagggaga gctgaagaccactcaggtttgtgtcctgcagcaggaagtaatagcaagagcagactcgag gtgacagcgaggacgaaaccactgatgtgtgtgagggcctgctgtgtgctgggtattcac aacattgtctgtgaccaagcgctctggggcaagcatcctccaggctggctgctgaggtta agccccagtgtggatgctgttgccaagactgcaaaccactggctcgtttccgtgcccaaa tccaaggcgaagttttctagagggttcttgggctcttggcacctgcgtgtcctgtgctta ccaccgcccaaggcccccttggatctcttggataggagttggtga >gi568815586f:104165309_104365836|GENSCAN_predicted_peptide_10|223_aa MPPDITKCPLPGKIALVEDHCSGTVRKDRKTQLEAETLTKKAQKRLGPLCLYCYIQGNGG SGQVITDVYLHPERTYHLDAVPPHAAWGAWVQNGAQRWPAFERFSFCLHAPPSPEPGLHP CFRLPGVCLERPFLAPVFLDLNLTIPHHGQGPQDPQGILRQLCSSVGPSYSCRLPSAQPQ SAPENLPLHLLTSVPGMRFSPDICLIEEAKCHLLTEVFPDHLN >gi568815586f:104165309_104365836|GENSCAN_predicted_CDS_10|672_bp atgcctccagacatcactaaatgtcccctgccaggcaaaattgccctggttgaggatcac tgctctggtacagtaaggaaggacagaaagacccagttggaggcagagaccctgaccaaa aaggctcagaagcgtttggggcctctctgtctctattgctacatccaagggaatggcggt tcaggtcaggtcataactgatgtgtatttgcatccagaaagaacgtatcacctggatgcc gtccctccccatgctgcatggggggcgtgggtgcagaacggtgcccagcgttggcctgcg tttgagcgcttctctttctgtctccatgcccctccaagccccgaacctggactacatcca tgctttcgcttgccaggggtctgcctggagcgtcctttcctcgcccctgtgtttctagat ctaaacctcaccatccctcatcatggacagggtccccaggatccccagggaatcctcagg cagctctgttccagtgtgggcccctcctatagctgcaggctgccctcagcccagccccag agtgctcctgagaacctgcctttgcacttgctgacctctgtgcctggaatgcgcttttcc ccagatatctgcttgatagaagaagccaaatgtcaccttctcactgaggtcttccctgac caccttaattaa