GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:58:59 Sequence gi568815596f:119580854_119781786 : 200933 bp : 43.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2999 3085 87 1 0 77 46 77 0.348 3.04 1.02 Intr + 6291 6369 79 2 1 76 110 25 0.200 2.62 1.03 Intr + 20365 20524 160 2 1 96 22 80 0.175 1.25 1.04 Intr + 23819 23939 121 2 1 77 102 41 0.655 4.90 1.05 Intr + 24023 24134 112 1 1 87 86 114 0.994 11.05 1.06 Intr + 24328 24436 109 2 1 79 44 78 0.868 1.84 1.07 Intr + 27649 27736 88 1 1 106 77 17 0.833 2.37 1.08 Intr + 30800 30889 90 1 0 22 90 89 0.710 2.69 1.09 Term + 34165 34176 12 0 0 145 46 1 0.410 -0.20 1.10 PlyA + 34619 34624 6 1.05 2.00 Prom + 34958 34997 40 -4.86 2.01 Init + 38772 38917 146 2 2 78 98 92 0.743 8.79 2.02 Term + 39017 39767 751 2 1 -51 42 378 0.957 12.22 2.03 PlyA + 40026 40031 6 1.05 3.00 Prom + 43695 43734 40 -2.46 3.01 Init + 43819 43951 133 0 1 78 -9 86 0.379 -1.80 3.02 Intr + 46800 46933 134 1 2 83 74 123 0.990 10.76 3.03 Intr + 47040 47138 99 2 0 89 51 36 0.513 0.31 3.04 Intr + 49022 49102 81 1 0 86 71 43 0.822 2.23 3.05 Intr + 49717 49824 108 0 0 85 77 32 0.825 2.28 3.06 Intr + 49914 50048 135 2 0 92 78 67 0.912 6.86 3.07 Intr + 58928 59019 92 1 2 98 99 57 0.868 6.59 3.08 Intr + 71121 71216 96 0 0 57 100 74 0.514 4.62 3.09 Term + 75509 75617 109 2 1 52 48 159 0.966 6.28 3.10 PlyA + 75785 75790 6 1.05 4.00 Prom + 86263 86302 40 -1.46 4.01 Init + 86547 86717 171 2 0 71 20 145 0.089 5.44 4.02 Intr + 98216 98423 208 1 1 29 54 133 0.108 2.65 4.03 Term + 100077 100936 860 1 2 59 45 683 0.465 54.02 4.04 PlyA + 101245 101250 6 1.05 5.00 Prom + 103720 103759 40 -6.06 5.01 Init + 104455 104534 80 0 2 90 15 117 0.847 3.24 5.02 Intr + 105345 105394 50 0 2 104 81 42 0.751 3.42 5.03 Intr + 108289 108384 96 2 0 130 61 5 0.594 1.98 5.04 Intr + 109140 109166 27 1 0 74 115 9 0.409 0.29 5.05 Term + 109648 109859 212 2 2 96 53 98 0.665 4.56 5.06 PlyA + 113944 113949 6 1.05 6.04 PlyA - 115591 115586 6 1.05 6.03 Term - 120161 120061 101 0 2 125 55 44 0.800 3.09 6.02 Intr - 120790 120688 103 1 1 44 110 84 0.925 5.95 6.01 Init - 121561 121487 75 1 0 52 44 80 0.680 1.11 6.00 Prom - 125145 125106 40 -3.76 7.08 PlyA - 125836 125831 6 1.05 7.07 Term - 131150 131045 106 1 1 121 42 101 0.353 6.68 7.06 Intr - 133251 133055 197 0 2 88 52 69 0.010 1.51 7.05 Intr - 141697 141668 30 1 0 75 119 -1 0.011 0.03 7.04 Intr - 158140 157979 162 1 0 5 41 147 0.093 1.87 7.03 Intr - 178961 178779 183 2 0 102 80 57 0.565 6.28 7.02 Intr - 179175 179089 87 0 0 93 43 49 0.438 1.07 7.01 Intr - 179516 179324 193 1 1 47 23 165 0.124 5.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_1|285_aa MVIYEPGSGPAPDTKSASTLILDFPASRTCSCPVDFEFYITLIQSHQAFAIEPTSGIIPA NGKMTVTIKFTPFQYGTAQIKMQLWISQFNSQPYECVFTGTCYPNMALPLEEFERLNTLS KKVNVPPEKAMMHINFHRPPAKPKPQKVKEIEYQNLRFPVDLSNPFAVATVLNQEPGKLK IKELREVLDQGTEISKTRQMKEALFEQKVRQDIHEEMENHLKWQVHLGKDPMSFKLKKEL TEEWQKACAKYKLDRGDPILDEEFQRLKTEVSHKRVVRNQEELMA >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_1|858_bp atggtcatctatgaaccaggaagtgggcctgcaccagacaccaaatctgccagcaccttg atcttggacttcccagcctccagaacttgcagctgccctgtagattttgagttttatatc accttgattcagtctcatcaagcctttgctattgagccaacatcaggaataattccggct aatgggaagatgactgtgactattaagtttacaccctttcagtatgggactgcacaaata aaaatgcagttatggatttcgcagttcaactctcaaccatacgaatgtgtcttcaccgga acatgctatcccaacatggccttaccattagaagagtttgaaaggttgaataccctttct aagaaagtaaacgttcctccagaaaaagcaatgatgcatataaattttcaccgaccgcca gcgaagccgaagcctcagaaggtgaaggaaattgagtaccagaacctcagatttccagta gatttatcgaatccatttgctgtggcaactgttttaaaccaagaaccaggaaaattgaag attaaagaattaagagaagttttggaccagggcactgaaatttcaaaaacgagacagatg aaggaggcactctttgaacagaaagtcagacaggacattcacgaagagatggaaaatcat cttaagtggcaggtgcaccttggtaaagatcctatgtcttttaaacttaaaaaagagctt actgaagagtggcaaaaagcatgtgccaaatataagctagacagaggagatcctattttg gatgaggaatttcagcgacttaaaacagaagttagccataaacgggttgttcgcaatcaa gaagagctcatggcataa >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_2|298_aa MRKNQCKKAENSKNQNTSSPPKDHNSSTAREQNWMENEFDKLTELGLRRITSLEKNINNL IELKNTARELHEEYTSIKSQIDQVEERISEIEDQLDEIKREDKIREKRMKRNEQSPQEMW DYMKRPNLRLTGVPKSDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSR RATPRHIIIRFTKVEMKEKMLRVAREKGHVTHKGKPIRPTADLSAETLQDRREWGPIFNI LTEKHFQPTISYPAKLSFISEGEIKSFTDKQMLRDFVTTRPALQELLKEAINLERKIR >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_2|897_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaaaccagaatacctcctctcct ccaaaggatcacaattcctcgacagcaagggaacaaaactggatggagaatgagtttgac aaattgacagaattaggcctcagaagaataaccagtttagagaagaacataaataacctg atagagctgaaaaacacagcacgagaacttcatgaagaatacacaagtatcaagagccaa atcgatcaagtggaagaaaggatatcagagattgaagatcaacttgatgaaataaagcgt gaagacaagattagagaaaaaagaatgaaaaggaatgaacaaagcccccaagaaatgtgg gactatatgaaaagaccaaacctacgtttgactggtgtacctaaaagtgatggggagaat ggaaccaagttggaaaacactcttcaggatattatccaggagaacttccccaacctagca agacaggccaatattcaaattcaggaaatacagagaacaccacaaagatactcctcgaga agagcaaccccaagacacataatcatcagattcaccaaggttgaaatgaaggaaaaaatg ttaagggtagccagagagaaaggtcatgttacccacaaagggaagcccatcagaccaaca gcggacctctctgcagaaaccctacaagaccgaagagagtgggggccaatattcaacatt ctcacagaaaagcattttcaacccacaatttcatatccagccaaactaagcttcataagt gaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtcaccaccagg ccagccttacaagagctcctgaaggaagcaataaatttggaaaggaaaatccggtaa >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_3|328_aa MEYYAAIRKDEFMSFAGTWMKLEIIILSKLTQEQKTKHRMFSLIKANFFKFFLRRISQDD YTSRFSVSPKEVLPFAFPDCSPPQDSNELRLQRKMRRVLPVENQGSSPAICLAPDTPAQS SLAPDGLGLVPIKSSEVQIKQSYSFFNLQVPQLYKIKRYQPFSVHKSSTSYRPQKLARAL KQGAEDEVTTITALPKQDSTTQLSGKTSVLSMKPPEALAMSLDYDPLYVFDIIPGIMHWK SFQSLVLSSLPDPSKMETTKSELCEQNVEVMLTPEMIKVEFPMLNYKDIRKEKEVKDQAQ PAEKAGEKLLEEMRNLRGKALNTYLILE >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_3|987_bp atggaatactatgcagccataagaaaggatgagttcatgtcctttgcagggacatggatg aagctagaaatcatcattcttagcaaactaacacaggaacagaaaaccaaacaccgcatg ttctccctcataaaggcgaatttcttcaaattcttcctgaggcggatcagtcaggatgat tataccagccggttctctgtgtcgcccaaggaggtgctgcccttcgctttcccagactgc agcccaccccaggactccaacgagttgaggctccaaaggaagatgaggagagtattacca gtggaaaaccagggatcatctcctgccatctgccttgctcctgatacacctgcacagagc tccctggctcctgatggccttggactggtcccaattaagtcttcagaagttcaaatcaag cagagttattccttcttcaatctgcaggttcctcaactgtacaaaattaagagatatcag ccattctctgtccacaagtcttcaacaagttacagacctcaaaagcttgcccgagcccta aagcaaggagctgaggatgaagtcaccaccatcacagcccttccgaaacaggactccaca actcagctctctggcaaaacatcagtcttgagcatgaaaccacctgaggccttagccatg tctctagattatgatcctctgtatgtttttgacattattcccggaataatgcactggaaa agcttccagtccctggttctctcctccctgccggacccctccaagatggagaccacaaag agtgagctctgtgagcagaatgtagaagttatgttgactccagaaatgatcaaagtggaa ttccctatgttgaactacaaggacatcaggaaggagaaagaagtgaaagatcaagcacaa ccagcagagaaggccggagagaagctgctcgaggagatgaggaacctgcggggcaaagca ctcaacacatacctgattctagaatga >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_4|412_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFYTAKETIVSVNNLQNGRKCLQSTHLTKISG RRAPPPEVPIPRSGAGELRLPKSPPPDPRPACFAAALPRAHVGVPSGGGGERTDEREGPF SGLMRPGLFGVPISYHLFPDPVVQWLYQYWPQGQPAPLPPQLQSLFQEVLQDIGVPSGHC YKPFTTFTFQPVSAGFPRLPAGAVVGIPASFLGDLVINTNHPVVIHGHTVDWRSPAGARL RASLTLSREAQKFALAREVVYLESSTTAVHALLAPACLAGTWALGVGAKYTLGLHAGPMN LRAAFSLVAAVAGFVAYAFSQDSLTHAVESWLDRRTASLSAAYACGGVEFYEKLLSGNLA LRSLLGKDGEKLYTPSGNIVPRHLFRIKHLPYTTRRDSVLQMWRGMLNPGRS >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_4|1239_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctacacagcaaaagaaactatcgtcagtgtg aacaacctacagaatgggagaaaatgtttacaatctacccatctgacaaagatctccggc cggcgagctccgccccctgaagtccccatcccaagatccggggccggcgagctccgcctc ccgaagtccccgcccccagatccccggccggcgtgctttgcggccgccctgccgcgcgca catgtaggcgttccgagcggcggcggaggtgagcgcacggacgagcgggagggacccttc tccggcctgatgcgacccggcctgtttggagttccaatctcgtaccacctcttcccggat cccgtggtccaatggctctaccagtactggcctcagggccagccagctccgctccctcca cagctgcagagcctcttccaagaggtgctacaggacataggtgttccttcaggccattgc tacaagcccttcaccaccttcaccttccaacctgtgagtgcaggcttcccaagactccct gctggggctgtggtgggcatccctgccagtttcttgggagacctagtgatcaacactaac catcccgtggtcatacatgggcatacagtggactggcggagcccagcaggcgcccggctg agagcttccctgaccttgtcccgtgaagcccagaagttcgccttggccagggaagtggtg tacctggaaagcagtaccactgccgtgcacgccctgctggccccagcttgcctggcaggg acctgggcactgggcgtgggtgccaagtacaccctggggctccatgcaggccccatgaat ttacgggctgccttcagcttggtggcagcagtggcaggctttgtggcctacgccttctcc caggattctctcactcatgccgtggagtcctggctggaccgccgcacggcctccctctct gcagcctatgcctgtggtggagtggagttctatgagaagcttctgtcgggcaacctggcc ctgcgcagtctcttgggcaaagacggggagaagctgtatacacccagcgggaacatcgtc cccagacacttgttccgaatcaaacatttaccctacaccacccgccgggactctgtgctg cagatgtggagggggatgctcaatccgggccgctcctga >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_5|154_aa MALMLALLFTDPPSPAASVPTSVHNARIVLTMKVKESLQNVEPGGSPNMALTHAILSLCL VLRGPPLTRLLLTQSEPCSPLEVKGCEDRVLMPNPESITKAAGPKLGPGPEEQTEEQSPR GEREGAGVHMPHGRQCHCSIITSCVFIDFIVIIK >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_5|465_bp atggcgctgatgctggctctgctcttcactgaccctcctagccctgctgcctcagtcccc accagtgtccacaatgcaaggattgttttgacgatgaaagtgaaagaaagtttgcaaaat gtggagcctggcggttcccccaatatggctcttactcatgcaatcctgtctttgtgtctg gttctcagaggacctccactaacacggttactcctcactcagtcagagccttgctccccg ctagaggtcaaaggctgtgaggacagggtgctcatgccaaaccctgaaagcattacaaag gctgcagggcctaagctggggcctgggcctgaggagcaaactgaggagcagagcccacgt ggagagagggagggagccggcgtccacatgccacacggacggcagtgtcactgctctatc atcacatcctgcgtgtttatagacttcattgtcatcatcaaatga >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_6|92_aa MFLGSSQQPAASSQGVPMRGPTFLMAASKVNPLGGYTGIHWNPSVALSLRWDTGLSERTG SVRPVANQRREGCEPFRDSVLPVIAVASTGLA >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_6|279_bp atgttcctgggcagcagccagcagccagcagccagcagccaaggagtccccatgagaggc cccacattcctgatggcagccagcaaggtgaacccattgggcggttacactgggattcac tggaacccctctgtggccttgtcactgcgctgggacaccggactttctgaaagaacagga tcagtcagacctgtggcgaatcagagaagagaaggctgtgagccttttagggacagcgtc ctccctgtgatcgcggttgccagcacaggcctggcctga >gi568815596f:119580854_119781786|GENSCAN_predicted_peptide_7|319_aa XRVLSAGEKKPRGPGAQRRSASSVPLRGATSAPPEQPPPQRSRHDPEPPAPPHAAGQPRA GAAGEAQFRPLGPRSPERTMLVVGREERKSALRGHTPSPRPQHSNRTWRLRRRRQSVARS GGGSNRRRSRERRGQNPASLAQGEVAPVGAQRDTQLMIWKAFISPSERDLADASSFSTQY MEHADTSPAAENNVCQKHAFPERTTIISGMRKQSLFPQLPKFSNMAHKALCALATTHLFS HIPGLVPLLGSVILNYLLFPEAAGRVMAPGQEGRQSIENGDGAWWTMHVEILLPELPAHN FWDKLSHQVSSDSCQWVPG >gi568815596f:119580854_119781786|GENSCAN_predicted_CDS_7|960_bp nnccgcgtcctctcggctggagaaaaaaagcctcgaggccccggcgcgcagaggagaagc gcgtcctccgtgccgctccggggggctacctcagcgccccccgagcagccgccgccgcag cggtcgcgccacgacccggagccgccagcaccacctcatgctgctgggcagccgcgggcc ggcgccgctggggaggcccaattccggcccctgggacccaggagtcctgagagaacaatg ttggtggtggggcgcgaggagcggaaaagcgcactgcgcggccacaccccctctcctagg ccgcagcatagcaaccgaacatggcggctgcgcaggcgccgccagtcagtggccaggagc ggcggcggcagcaacaggaggcgatctcgcgagaggcgaggccaaaaccctgcctcgcta gcgcagggagaagtcgcaccagtcggagcgcagagggacacgcagctgatgatctggaag gccttcatcagcccttctgaaagagatcttgcggatgccagctccttttcaactcagtac atggaacatgcagacaccagccctgctgctgagaacaatgtctgtcagaaacatgctttc cctgagaggacaaccattattagtgggatgagaaaacagagtctcttcccgcagctgccc aagttctccaacatggcacacaaggccctctgtgccttggccaccactcatctgttcagc cacatccctggtctcgtgccacttctgggctcagtcatcctgaactacctgttgttccct gaagcggcaggccgggtcatggctccaggccaggagggaagacagtccatagaaaatgga gatggagcctggtggaccatgcacgtggaaattctgctgccggagctgccagcccataac ttttgggataaactctcccatcaggtgtccagtgacagctgccagtgggtgccaggctaa