GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:31:26 Sequence gi568815592f:3749812_3950786 : 200975 bp : 45.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 1720 1465 256 1 1 97 86 556 0.974 53.69 1.00 Prom - 3995 3956 40 -5.16 2.04 PlyA - 5346 5341 6 1.05 2.03 Term - 8725 8268 458 2 2 69 49 140 0.662 3.49 2.02 Intr - 9397 9270 128 0 2 16 113 63 0.243 1.92 2.01 Init - 11813 11728 86 2 2 83 48 77 0.174 2.18 2.00 Prom - 12954 12915 40 -4.46 3.00 Prom + 23780 23819 40 -5.16 3.01 Init + 25862 25914 53 1 2 75 60 52 0.033 0.46 3.02 Intr + 30991 31108 118 1 1 8 62 101 0.040 -0.03 3.03 Intr + 33838 33918 81 0 0 66 58 57 0.109 0.33 3.04 Intr + 34690 34765 76 0 1 78 96 56 0.198 4.49 3.05 Intr + 38693 38869 177 0 0 122 50 23 0.488 1.79 3.06 Intr + 44208 44331 124 1 1 77 82 33 0.069 1.24 3.07 Intr + 50005 50293 289 1 1 74 94 117 0.200 8.15 3.08 Intr + 59125 59249 125 0 2 34 51 105 0.035 0.78 3.09 Intr + 70895 71078 184 2 1 84 94 56 0.049 5.59 3.10 Intr + 71277 71324 48 2 0 68 66 69 0.086 1.58 3.11 Intr + 73560 73644 85 2 1 25 48 55 0.004 -5.41 3.12 Intr + 79631 79734 104 0 2 101 81 56 0.624 6.09 3.13 Term + 88106 88246 141 1 0 67 43 127 0.745 4.03 3.14 PlyA + 88558 88563 6 1.05 4.00 Prom + 97779 97818 40 -5.36 4.01 Sngl + 100001 100978 978 1 0 102 41 2394 0.964 232.89 4.02 PlyA + 101083 101088 6 1.05 5.03 PlyA - 101528 101523 6 1.05 5.02 Term - 103611 103522 90 0 0 123 43 157 0.920 12.42 5.01 Init - 113728 113684 45 1 0 78 50 42 0.092 0.08 5.00 Prom - 113852 113813 40 -3.06 6.00 Prom + 116014 116053 40 -2.96 6.01 Init + 122598 122797 200 2 2 89 64 107 0.061 6.77 6.02 Intr + 133187 133327 141 2 0 62 57 64 0.046 0.17 6.03 Intr + 137628 137697 70 0 1 84 103 31 0.111 3.38 6.04 Term + 143323 143424 102 0 0 67 42 139 0.693 5.48 6.05 PlyA + 144406 144411 6 1.05 7.03 PlyA - 144541 144536 6 1.05 7.02 Term - 150414 150173 242 1 2 48 48 169 0.918 5.09 7.01 Init - 152621 152489 133 2 1 98 95 38 0.498 5.80 7.00 Prom - 153883 153844 40 -4.66 8.00 Prom + 155114 155153 40 -7.16 8.01 Init + 156224 156331 108 1 0 71 84 160 0.914 12.12 8.02 Term + 161490 161780 291 2 0 108 39 121 0.661 4.54 8.03 PlyA + 163828 163833 6 1.05 9.05 PlyA - 164048 164043 6 1.05 9.04 Term - 164644 164383 262 0 1 63 42 152 0.239 3.10 9.03 Intr - 175924 175836 89 1 2 72 76 19 0.014 -2.13 9.02 Intr - 187565 187355 211 1 1 26 63 123 0.081 2.52 9.01 Init - 191389 191196 194 1 2 65 110 70 0.497 5.34 9.00 Prom - 195140 195101 40 -2.86 10.04 PlyA - 195709 195704 6 1.05 10.03 Term - 197222 197200 23 0 2 148 47 -8 0.422 -0.43 10.02 Intr - 198924 198782 143 2 2 63 44 91 0.546 2.20 10.01 Init - 199844 199615 230 2 2 65 87 130 0.726 8.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 1862 1972 111 1 0 26 50 287 0.976 16.91 S.002 Term + 2866 3084 219 0 0 121 33 89 0.918 3.74 S.003 Init + 119661 119712 52 2 1 61 115 26 0.842 4.00 S.004 Term + 119901 120046 146 1 2 57 55 118 0.938 3.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_1|86_aa MASAVFEGTSLVNMFVRGCWVNGIRRLIVSRRGDEEEFFEIRTEWSDRSVLYLHRSLADL GRLWQRLRDAFPEDRSELAQGPLRQX >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_1|258_bp atggcctcggcggtgtttgagggcacgtcgctcgtgaacatgttcgtgcgcggctgctgg gtgaacggcatccgcaggctcatcgtcagccggcgcggcgacgaagaggagttcttcgag atccgcacggagtggtcggaccgcagcgtgctctacctgcaccgcagcctggcggacctg ggccgcctgtggcagcgcctgcgcgacgcctttcccgaggaccggtccgaactggcgcag gggccgctgcggcaagnn >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_2|223_aa MAGCRSRTLLRREAAKAQREVEHSSCWPRSETTNLGSECVTALEGGASGVVRSSRWVHGV TGFRSEAADLHGGAACQSRACVPTLVSPWVVDGTGRRRAGRGARRGGWGSAGAHGVGEAQ AWRAAGPEPCLAGRQLKPGEKLSTAAAGPGAKPLSARPAGPAGRSECGVRRAYARPELAL ARKRRAQSRFLPAPLPLQPPASRGSRLWSRLAQRRGPHGAAAG >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_2|672_bp atggcgggctgcaggtcccgaaccctgctccgcagggaggcagctaaggcccagcgagaa gtcgagcacagcagctgctggcccagaagtgaaactacaaaccttggcagtgagtgtgtt acagctctagaaggcggtgcatctggagttgttcgttcctcccgatgggttcatggtgtc accggcttcaggagtgaagctgcagaccttcacggtggagctgcctgccagtcccgcgcc tgtgtgcccacacttgtcagcccttgggtggtcgatgggactgggcgccgtagagcaggg cgcggcgctcgtcggggaggctggggcagtgcaggagcccacggcgtgggggaggctcag gcatggcgggctgcagggcccgagccctgcctcgcggggagacagctaaagcccggcgag aagttgagcacagcagctgctggcccaggtgctaagcccctcagtgcccggccggcggga cctgctggccgctccgagtgcggggtccgccgagcctacgcccgcccggaactcgcgctg gcccgcaagcgccgcgcacagtcccggttcctgcccgcgcctctccctctacaacccccc gcaagccgagggagccggctctggtctcggctggcccagagaaggggcccccacggtgca gcggcgggctga >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_3|534_aa MELESLVEASSIRLELCRDHRNEQRLGRKNDDAGVAYGDVEVSGDTGVKADTVCWMQERL VLLIQVPPLEGKGTSVSQCGKPWKKKFADLELVSIGASVTSSAAGMGSPAPEWALMHRLG LLRPHAQALCVTHNSCLSSPLRPTGGVTCSVVLPRKTDPKESLHAGPEMLALTSIYSQAK DLCQSFWLEKCAKETTWTSGKEQCRMVVFGKILMNKNVAKKAQEREWLFPAGGTGEGFCR QRREGRAQQQVLCMQRPGRVWRIQGRRPWLMQLDLFGGEKKATAGEKEEKETQFKSQNIC DELRMQLLTTVLVFAKFSAPELVPVKLVRSADEFSVWAPGGSSCAKLGNSDLIGLGVTRG GISKCSLADSHVQPGLRAPAVNDESWLFGKDPSHAASSHHICKDCARPLSLQATFVGLSA KYLAQGLSCIVERVKVVTPSRPESNWVPLTPAEYIWVRVLLLQPTQCMLEPPQESDSEKD ECEGFCGLVPPLNIATLGTKFPTRELLEDFKTIALSADKNAEELELSYISENDN >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_3|1605_bp atggaactcgaatctttggtggaagcttcgagcatccggcttgagctttgcagagaccac aggaatgagcagagactggggagaaaaaatgatgatgccggtgtggcatatggggatgtt gaagtctctggggacactggggtgaaggccgacacagtttgctggatgcaggagaggctg gtgctgctgatccaggtgccacccttggagggcaagggcacaagtgtatcacagtgtggc aaaccatggaagaaaaagtttgctgatttagaacttgttagtataggtgcatctgtcaca agctcagcggccggcatgggcagcccagctcctgagtgggctctcatgcataggctcggc ctgcttagaccacatgcccaagctctgtgcgtgactcacaacagctgcctgtcttcaccc ctcaggcccacgggtggtgtcacctgcagtgtggtgctccccaggaagacagatcctaag gaatcccttcatgcaggacctgaaatgctggcactcaccagcatttattcacaggccaag gatttgtgccaatcattttggctggaaaagtgtgccaaagagacaacctggacatcagga aaagaacaatgcaggatggtggtgtttggaaaaatcctgatgaataagaatgtggccaag aaagcccaggagagggagtggttatttccagctgggggcactggagagggtttctgcaga cagaggcgggaagggcgcgcccagcagcaggtactgtgcatgcaaaggccaggcagggta tggcggattcagggaagaaggccatggctgatgcaattggatctatttgggggagagaag aaggccacggctggagagaaagaggaaaaagaaacacaattcaaaagccagaatatctgt gatgaacttcgaatgcagttactcactacagtgcttgtttttgccaagttctcagctccc gagctggtgcctgtgaagctggtgcggtctgcagatgagttttctgtttgggctcctgga ggctccagctgtgccaagctagggaattctgacttaattggtctgggtgtgaccaggggt gggatttctaaatgttcccttgctgattcccacgtgcagccggggctgagggcccctgct gtgaacgatgagtcctggctctttggaaaggaccccagccatgcagccagctctcaccac atctgcaaagactgtgccagacccctaagtcttcaggctacctttgtggggttgtccgcc aagtacctggctcagggactgagttgcattgtggaaagagtaaaggttgtgacgccaagt aggcctgagtccaactgggtgccccttacaccagctgaatacatctgggttcgtgtctta ttgctccaacccacacagtgcatgcttgaaccaccccaggaatcagacagtgagaaagat gagtgtgagggcttctgtgggctggttccacctctcaacattgccacactggggaccaag tttccaacacgtgaacttttagaggactttaaaaccatagcactaagtgctgacaagaat gcagaggaactggaactctcatatatttctgagaatgacaactga >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_4|325_aa MAQYKGTMREAGRAMHLLKKRERQREQMEVLKQRIAEETILKSQVDKRFSAHYDAVEAEL KSSTVGLVTLNDMKARQEALVRERERQLAKRQHLEEQRLQQERQREQEQRRERKRKISCL SFALDDLDDQADAAEARRAGNLGKNPDVDTSFLPDRDREEEENRLREELRQEWEAQREKV KDEEMEVTFSYWDGSGHRRTVRVRKGNTVQQFLKKALQGLRKDFLELRSAGVEQLMFIKE DLILPHYHTFYDFIIARARGKSGPLFSFDVHDDVRLLSDATMEKDESHAGKVVLRSWYEK NKHIFPASRWEAYDPEKKWDKYTIR >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_4|978_bp atggcgcagtacaagggcaccatgcgcgaggcaggccgtgccatgcacctcctcaagaag cgcgaaaggcagcgggagcagatggaggtgctgaagcagcgcatcgccgaggagaccatc ctcaagtcgcaggtggacaagaggttctcggcgcattacgacgccgtggaggccgagctg aagtccagcacggtgggcctggtgaccctgaacgacatgaaggcccggcaggaggccctg gtcagggagcgcgagcggcagctggccaagcgccagcacctggaggagcagcggctgcag caggagcggcagcgggagcaggagcagcggcgcgagcgcaagcgtaagatctcctgcctg tcctttgcactagacgacctcgatgaccaggccgacgcggccgaggccaggcgcgccgga aacctgggcaagaaccccgacgtggacaccagcttcctgccagaccgcgaccgcgaggag gaggagaaccggctccgagaggagctgcgccaagagtgggaggcgcagcgcgagaaagtg aaggacgaggagatggaggtcaccttcagctactgggacggctcgggccaccggcgcacg gtgcgggtgcgcaagggcaacacggtgcagcagttcctgaagaaggcgctgcaggggctg cgcaaggacttcctggagctgcgctccgccggcgtggagcagctcatgttcatcaaggag gacctcatcctgccgcactaccacaccttctacgacttcatcatcgccagggcgaggggc aagagcgggccgctcttcagcttcgatgtgcacgatgacgtgcgcctgctcagcgacgcc accatggagaaggacgagtcgcacgcgggcaaggtggtgctgcgcagctggtacgagaag aacaagcacatcttccccgccagccgctgggaggcctatgaccccgagaagaagtgggac aagtacaccatccgctaa >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_5|44_aa MEYYAAIKNDEFMSFDVDVMVKDAAAILDYEVTMDMEDTYDETR >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_5|135_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgatgtagatgtgatg gttaaagatgcagctgccatcttggattatgaagtaaccatggatatggaagacacatat gatgaaacaaggtag >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_6|170_aa MPCGTSSTVKKSTWRGAEVSRQRLAPAGRHREGVACRVDPFASPSLQVTASFMRHPKTLP HSLPAKTFPQLCEQAIIISRQDAGSDDCLIPNVASQFRKRKTQNFPPCAFCASWAHQTLV PNAATLTVAQVPREIKHLWHEGYSPDAGVNIRVALTLLHENASWICRKTD >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_6|513_bp atgccatgtggcacaagcagcacggtgaagaagagcacatggagaggagctgaggtctcc agacagcggctggcacccgccggccgtcatcgtgagggtgtggcctgcagagtggaccct tttgcctcgccaagcctccaggtaaccgccagcttcatgagacaccccaagacactccca cattccttgcctgcgaaaacatttccacagctttgtgagcaagccataattatttccaga caagatgctggctcagatgactgcctcatccctaatgtggcttctcagtttcgcaagagg aaaacccagaactttcctccctgtgccttctgtgcctcctgggctcaccagactttagta cctaatgcagctaccctcaccgtggcccaggttcccagggaaataaaacatctgtggcat gagggctacagccccgacgcaggtgtaaacatccgtgtggctctgacactcctacacgag aatgcttcttggatttgccgtaaaacagactaa >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_7|124_aa MRVTIARVSAFGRRPGTVSGNVRNLVFQMLTSLHSPCECPEPLQGITVLVMLYRGSWLLG LSFASTWLHVAGTLVPRERTRAVRPADGHRGATARLALVPRERKSEAADPEGKAESAMQL QMSG >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_7|375_bp atgagggtgaccatcgccagggtcagtgcatttggtaggagaccagggaccgtttctggg aatgtcaggaatcttgtctttcaaatgctcacttcacttcacagcccctgcgagtgccct gaacctttgcaaggtataactgtcctggtgatgctgtacagaggctcttggctcttgggg ctcagctttgcctcaacgtggcttcacgtggcgggcaccctggtgcccagagagagaacc agagctgtccgtcctgcagatggacacaggggagccacagcacggctcgcgctggtgccc agagagagaaagagtgaagctgctgaccctgaaggcaaggcagagtctgccatgcagctg cagatgtcggggtga >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_8|132_aa MAPSRRPTGCALLQAWGLILTSPRQQQPFIFPTKGRENKRWKSRTFSTLEPSAQIHGWQE LDFSGQWLKDLQATLEKGHEVASDTSSISCSWGSLLSQMVQMLQSLTSRHKRNVLEAIQH CPQCRRMSRSVC >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_8|399_bp atggccccgtcgaggcggccaaccggatgtgccctcctgcaggcctggggcctcatcctc accagccctcggcaacagcagccgttcatttttccaacgaaggggagggaaaacaagcgc tggaagtcaagaactttctcaacactggagcccagtgcccaaatccatggctggcaggag ctggacttcagtggccagtggctgaaagacctgcaggccactcttgagaaagggcatgaa gtggcctcagacacctccagtatctcctgttcatgggggtccctgctctcccaaatggtt cagatgcttcaaagtttaaccagtagacataagaggaatgttctggaagccattcagcac tgtccacaatgtcgcaggatgagcagatctgtctgctag >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_9|251_aa MADGCLAIPQIHGRAAPSGSLIPMQAGSSARCFSNPFCFSESKGLLTAHEPEHLKQIYSQ LEQCSNKASKYKAKTDRIQGEIHEFHVIAGDFNTSISVIDKSNRQNISEVIVVLSRTINQ LDVIDIVDNLSNNSRPGTPGAVLNASPSPLTAAPLPGPETHRGLRKSLNLPVTWKPFSFP LSCLSGLNQCTCYMYLIDVSCLIKCVKSVLRCSLSFTRAPKDNKKKKDAKKLGKKDKDPV NKSWGKAKMKK >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_9|756_bp atggctgatggttgtttggctattccacagattcacggcagagcagctccttctggatcc ctcatccccatgcaagctgggagctctgctcggtgcttctcaaatccgttttgtttttct gagtcaaaggggctgctgacagcccatgagccggagcacctgaagcaaatctattcccag cttgaacagtgcagcaacaaagcatcaaaatataaagcaaaaacagatagaatacaagga gaaatacatgaattccatgttatagctggagacttcaacaccagtatatcagtaattgac aaatctaacaggcagaatatcagtgaggtcatagttgtactgagcagaaccatcaatcaa ctggatgtaattgacattgtagacaatttatctaacaacagcagaccggggaccccgggc gctgtcctcaatgcctccccctcaccactcaccgccgccccattgccaggacccgagact cacaggggtctcagaaaatctttgaatctacctgtgacctggaagcccttcagcttcccg ttgtcctgtctttctggactgaatcaatgtacgtgttatatgtatttgattgatgtctct tgtctcataaaatgtgtaaaatcggttctccgctgctctctgagcttcacaagggcaccc aaggacaacaagaagaagaaagatgccaaaaagttgggcaagaaagacaaagacccagtg aacaaatcctggggcaaagccaaaatgaagaagtga >gi568815592f:3749812_3950786|GENSCAN_predicted_peptide_10|131_aa MSSNSTKNDDTRTGETTQLEDAEQASEPDSVMPEILELSDQKFKVPVNNRLRALMEKVDH MKEQMGNVSREKKFAEFEQDEPIALLRKLKKRSKGNATNQKHSNRNLKMPLTGSSVARKR LRKISDVTLFC >gi568815592f:3749812_3950786|GENSCAN_predicted_CDS_10|396_bp atgtctagcaattcaacaaaaaatgatgacacacgaacaggcgaaacaacacagcttgaa gacgcagaacaagcatcagagccagactcggttatgcctgagattctggaattatcagac cagaaatttaaagtacctgtgaataataggctaagggctctaatggaaaaagtggatcat atgaaagaacagatgggcaatgtaagcagagaaaagaaatttgcagaatttgagcaagat gaacccattgcgctgttacggaaactcaagaaaagatcaaaaggaaatgctacaaatcaa aaacactctaacagaaatttaaaaatgcctttgacgggctcatcagtagctcgaaaacgg ctgaggaaaatcagtgacgtcacactgttctgctga