GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:02:23 Sequence gi568815583f:69315994_69546942 : 230949 bp : 45.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6682 6886 205 0 1 -85 46 577 0.001 34.97 1.02 Intr + 9378 9417 40 1 1 96 79 26 0.015 -0.22 1.03 Intr + 33317 33441 125 2 2 95 37 144 0.578 10.23 1.04 Intr + 35401 35578 178 2 1 37 28 120 0.222 -0.12 1.05 Intr + 41117 41214 98 2 2 49 48 88 0.133 0.55 1.06 Intr + 42007 42182 176 2 2 55 40 108 0.179 2.36 1.07 Intr + 63890 64017 128 1 2 140 42 86 0.022 8.68 1.08 Intr + 67662 68024 363 0 0 -29 96 322 0.321 15.60 1.09 Intr + 68684 68889 206 2 2 72 71 194 0.927 14.94 1.10 Intr + 73661 73787 127 0 1 95 84 175 0.978 17.54 1.11 Intr + 75171 75184 14 0 2 121 94 4 0.306 -1.28 1.12 Intr + 78926 79067 142 0 1 143 86 -14 0.310 3.31 1.13 Intr + 81475 81571 97 1 1 81 86 88 0.521 7.91 1.14 Intr + 82465 82512 48 0 0 39 117 33 0.502 0.18 1.15 Intr + 83981 84122 142 1 1 73 109 113 0.915 11.83 1.16 Term + 87588 87829 242 1 2 104 41 122 0.984 5.29 1.17 PlyA + 89625 89630 6 -0.45 2.00 Prom + 89638 89677 40 -4.46 2.01 Init + 94987 95040 54 0 0 80 33 34 0.255 -1.62 2.02 Intr + 101390 101518 129 1 0 87 76 86 0.937 8.19 2.03 Intr + 105654 105759 106 2 1 107 60 72 0.984 6.09 2.04 Intr + 105999 106135 137 1 2 86 92 20 0.982 2.49 2.05 Intr + 106333 106442 110 0 2 84 108 53 0.999 5.98 2.06 Intr + 107166 107336 171 0 0 91 108 88 0.999 10.16 2.07 Intr + 110077 110189 113 1 2 104 50 63 0.999 4.12 2.08 Intr + 110343 110464 122 1 2 108 103 55 0.996 9.21 2.09 Intr + 113118 113220 103 2 1 88 105 47 0.596 6.15 2.10 Intr + 119490 119569 80 1 2 78 121 23 0.002 3.87 2.11 Intr + 119659 119778 120 0 0 64 19 117 0.752 3.09 2.12 Intr + 120145 120268 124 0 1 92 60 16 0.763 -0.64 2.13 Intr + 120571 120729 159 2 0 49 73 166 0.869 11.16 2.14 Intr + 122255 122412 158 0 2 96 76 130 0.967 12.33 2.15 Intr + 123911 124084 174 1 0 48 56 168 0.960 9.74 2.16 Intr + 124315 124494 180 0 0 39 95 150 0.999 10.86 2.17 Intr + 124775 125086 312 1 0 96 44 80 0.576 0.78 2.18 Intr + 128797 129048 252 0 0 79 93 181 0.981 15.33 2.19 Intr + 130016 130098 83 1 2 91 106 4 0.894 0.94 2.20 Intr + 130290 130371 82 0 1 68 49 86 0.837 2.34 2.21 Intr + 130878 130948 71 2 2 90 89 28 0.166 1.08 2.22 Intr + 136916 137027 112 2 1 22 113 147 0.083 10.98 2.23 Intr + 137654 137728 75 1 0 43 116 30 0.663 1.01 2.24 Intr + 139177 139294 118 0 1 62 91 77 0.974 5.44 2.25 Intr + 141609 141684 76 1 1 107 77 75 0.158 6.87 2.26 Intr + 146012 146039 28 2 1 74 92 -3 0.100 -3.18 2.27 Intr + 152291 152400 110 1 2 67 116 71 0.602 6.88 2.28 Term + 163740 163881 142 0 1 79 43 122 0.651 4.20 2.29 PlyA + 163912 163917 6 1.05 3.00 Prom + 164414 164453 40 -4.66 3.01 Init + 170672 170743 72 1 0 65 101 59 0.119 5.97 3.02 Intr + 209989 210156 168 0 0 12 35 165 0.062 3.74 3.03 Term + 223526 223663 138 1 0 104 48 150 0.305 10.56 3.04 PlyA + 225314 225319 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 136956 137027 72 2 0 107 113 114 0.845 16.87 S.002 Term + 139435 139514 80 2 2 83 40 153 0.974 7.93 S.003 Init - 178075 177928 148 1 1 64 82 103 0.851 7.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:69315994_69546942|GENSCAN_predicted_peptide_1|776_aa KKKEKRKKKKKKKKKKKKKKKKKKKKKKKKKKREKKKTRKKKKRKRKKKKKKKKRKKKRN YFIKKRRWKGTSTHLGLQPGAQPATRRESQQAPEGEAQNVTHEELYYPPQEPNPENMCGD GYQAAVPDVVTLLEQNNTSSGTWYAATDQANAFFSNPVYKAHQKQLALSWQGQQYTFTVL PQGYSFAFRYDCKLPAASLEAKQNPVPCFLSSLQNCTYKQGLWVFRGRERTPLTDEDYEE GSQEVVDFGPVMIGVKEEIYYGKFPTLLFFAKSKVFHEQGILFGYRHPQSSATACILSLF QMTNETLNIWTHLLPFWVSGPSVFMVEGEWASVLMVEGEWASVLMVEGEWASVLMVEGEW AFVFMVEGEWAFVFMVEGEWAFVFMVEGEWAFVFMVEGEWAFVFMVEGEWAFVFMVEGEW PSVFIVEGERALCIHGGGFFAWRFVTALYMTDIKNDSYSWPMLVYMCTSCVYPLVSSCAH TFSSMSKNARHICYFLDYGAVNLFSLGSAIAYSAYTFPDALMCTTFHDYYVALAVLNTIL STGLSCYSRKKQADPALQLERLVRLPLGKHFIRPSEMLSQVTENPCFGGWPSPATARFLR VFLEIQKPRLCKVIRVLAFAYPYTWDSLPIFYRGKTMDLIGSAVPVQYPLFLFPGESAQN EATSYHQKHMIMTLLASFLYSAHLPERLAPGRFDYIGHSHQLFHVCVILATHMQMEAILL DKTLRKEWLLATSKPFSFSQIAGAILLCIIFSLSNIIYFSAALYRIPKPELHKKET >gi568815583f:69315994_69546942|GENSCAN_predicted_CDS_1|2331_bp aagaagaaggagaagaggaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagagggagaagaagaagacgaggaag aagaagaagaggaagaggaagaagaagaagaagaagaagaagaggaagaagaagaggaat tatttcatcaagaagagaagatggaagggaacgagcacccacctgggcctgcagccaggt gcacaacctgcaaccaggcgtgagtcccagcaggcccctgaaggtgaggcccaaaatgtg acccacgaggagctgtactaccctccacaagaaccaaatccagagaacatgtgtggggat ggctatcaggctgctgtaccagatgtggttacattgcttgagcaaaataacacatcttct ggtacctggtatgctgccacggatcaggcaaatgcctttttctccaaccctgtctataaa gcccaccagaagcaacttgctctcagctggcaaggccaacaatacaccttcaccgtccta cctcagggctactccttcgccttccgctatgattgtaagcttcctgcagcctccctagaa gccaagcagaacccagtaccatgtttcctgtccagcctgcagaactgtacatacaagcaa ggtctctgggtcttcagaggcagggaaagaacacctctgactgatgaggactatgaggag ggctctcaggaggtggtagactttggcccagtgatgataggggtaaaggaggaaatttac tatggtaaattcccaactctgctcttctttgctaaatccaaggtgttccatgagcaaggc atcctgttcggctaccgccatccacagagttctgccactgcctgcatcctcagccttttc caaatgaccaatgagactctcaacatttggactcacttgctgcccttctgggtgagcggg ccctctgtattcatggtggagggtgagtgggcctctgtgctcatggtggagggtgagtgg gcctctgtgctcatggtggagggtgagtgggcctctgtgctcatggtggagggtgagtgg gcctttgtgttcatggtggagggtgagtgggcctttgtgttcatggtggagggtgagtgg gcctttgtgttcatggtggagggtgagtgggcctttgtgttcatggtggagggtgagtgg gcctttgtgttcatggtggagggtgagtgggcctttgtgttcatggtggagggtgagtgg ccctctgtgttcatcgtggagggtgagcgggccctctgtattcatggtggagggttcttt gcatggaggtttgtgactgcactgtatatgacagacatcaagaatgacagctactcctgg cccatgcttgtgtacatgtgcaccagctgcgtgtacccacttgtgtccagctgtgcgcac accttcagctctatgtccaagaatgcccggcacatttgctacttcctggactatggtgcc gtcaacctcttcagcctgggctcagccattgcctactctgcatacacgttcccggatgcg ctcatgtgcaccactttccatgactactacgtggccctggctgtactgaacaccatcctc agcacaggcctctcctgctactccaggaaaaaacaggcagacccggctttgcagctggag aggctggtccgtttgcccctggggaaacactttatcaggccgagcgaaatgttgtctcaa gtcacagaaaatccatgctttgggggatggccttcgccagccacagccagatttcttcgc gtgtttcttgaaatccagaagcccagactctgtaaggtgattcgtgtcctcgcctttgct tatccgtacacctgggactccctccccatcttctacagggggaaaacaatggatctgata ggctctgcggtgcctgtccagtacccgctattcctgttcccaggggagagtgcacaaaat gaagccacctcgtaccaccagaagcacatgatcatgaccctcctggcctctttcttgtac tctgcacatctgccagaacgcctagcccctggacgctttgactacatcggtcacagtcac cagctgtttcacgtgtgtgtgatcctggccacgcacatgcagatggaagccatacttctg gacaagactctgaggaaggaatggctcctggccacctccaagcccttctctttctctcag atagctggagccatacttctgtgcatcatcttcagcctcagcaacataatttatttctca gctgctctgtatcggattcccaagccagaattacataaaaaagaaacatga >gi568815583f:69315994_69546942|GENSCAN_predicted_peptide_2|1166_aa MVAEETEKSGAVWRFGYLVYCRVRPLGFPDQECCIEVINNTTVQLHTPEGYRLNRNGDYK ETQYSFKQVFGTHTTQKELFDVVANPLVNDLIHGKNGLLFTYGVTGSGKTHTMTGSPGEG GLLPRCLDMIFNSIGSFQAKRYVFKSNDRNSMDIQCEVDALLERQKREAMPNPKTSSSKR QVDPEFADMITVQEFCKAEEVDEDSVYGVFVSYIEIYNNYIYDLLEEVPFDPIKPKPPQS KLLREDKNHNMYVAGCTEVEVKSTEEAFEVFWRGQKKRRIANTHLNRESSRSHSVFNIKL VQAPLDADGDNVLQEKEQITISQLSLVDLAGSERTNRTRAEGNRLREAGNINQSLMTLRT CMDVLRENQMYGTNKMVPYRDSKLTHLFKNYFDGEGKVRMIVCVNPKAEDYEENLQVMRF AEVTQEVEVARPVDKAICGLTPGRRYRNQPRGPVGNEPLVTDVVLQSFPPLPSCEILDIN DEQTLPRLIEALEKRHNLRQMMIDEFNKQSNAFKALLQEFDNAVLSKENHMQGKLNEKEK MISGQKLEIERLEKKNKTLEYKIEILEKTTTIYEEDKRNLQQELETQNQKLQRQFSDKRR LEARLQGMVTETTMKWEKECERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERP SRERDREKVTQRSVSPSPVPLSSNYIAQISNGQQLMSQPQLHRRSNSCSSISVASCISEW EQKIPTYNTPLKVTSIARRRQQEPGQSKTCIVSDRRRGMYWTEGREVVPTFRNEIEIEED HCGRLLFQPDQNAPPIRLRHRRSRSAGDRWVDHKPASNMQTETVMQPHVPHAITVSVANE KALAKCEKYMLTHQELASDGEIETKLIKGDIYKTRGGGQSVQFTDIETLKQESPNGSRKR RSSTVAPAQPDGAESEWTDVETRCSVAVEMRAGSQLGPGYQHHAQPNTASGSASPTLARA MASVSELACIYSALILHDDEVTVTEDKINALIKAAGVNVEPFWPGLFAKALANVNIGSLI CNVGAGGPAPAAGAAPAGGPAPSTAAAPAWSMRLVLSPTCISFLNYNDRRVLESTHLPGS SAQACCMLETCGTAAKCPVWFCDLGTVTSSLWLSVFSSLRCGCQGNWFKNLWHTERRRMK TLDVIRITNFCASKDTIKKVKDYPTE >gi568815583f:69315994_69546942|GENSCAN_predicted_CDS_2|3501_bp atggtggctgaggagacagagaaaagtggagcagtttggaggtttggctacttagtatac tgtagggtgcgcccactgggctttcctgatcaagagtgttgcatagaagtgatcaataat acaactgttcagcttcatactcctgagggctacagactcaaccgaaatggagactataag gagactcagtattcatttaaacaagtatttggcactcacaccacccagaaggaactcttt gatgttgtggctaatcccttggtcaatgacctcattcatggcaaaaatggtcttcttttt acatatggtgtgacgggaagtggaaaaactcacacaatgactggttctccaggggaagga gggctgcttcctcgttgtttggacatgatctttaacagtatagggtcatttcaagctaaa cgatatgttttcaaatctaatgataggaatagtatggatatacagtgtgaggttgatgcc ttattagaacgtcagaaaagagaagctatgcccaatccaaagacttcttctagcaaacga caagtagatccagagtttgcagatatgataactgtacaagaattctgcaaagcagaagag gttgatgaagatagtgtctatggtgtatttgtctcttatattgaaatatataataattac atatatgatctattggaagaggtgccgtttgatcccataaaacccaaacctccacaatct aaattgcttcgtgaagataagaaccataacatgtatgttgcaggatgtacagaagttgaa gtgaaatctactgaggaggcttttgaagttttctggagaggccagaaaaagagacgtatt gctaatacccatttgaatcgtgagtccagccgttcccatagcgtgttcaacattaaatta gttcaggctcccttggatgcagatggagacaatgtcttacaggaaaaagaacaaatcact ataagtcagttgtccttggtagatcttgctggaagtgaaagaactaaccggaccagagca gaagggaacagattacgtgaagctggtaatattaatcagtcactaatgacgctaagaaca tgtatggatgtcctaagagagaaccaaatgtatggaactaacaagatggttccatatcga gattcaaagttaacccatctgttcaagaactactttgatggggaaggaaaagtgcggatg atcgtgtgtgtgaaccccaaggctgaagattatgaagaaaacttgcaagtcatgagattt gcggaagtgactcaagaagttgaagtagcaagacctgtagacaaggcaatatgtggttta acgcctgggaggagatacagaaaccagcctcgaggtccagttggaaatgaaccattggtt actgacgtggttttgcagagttttccacctttgccatcatgcgaaattttggatatcaac gatgagcagacacttccaaggctgattgaagccttagagaaacgacataacttacgacaa atgatgattgatgagtttaacaaacaatctaatgcttttaaagctttgttacaagaattt gacaatgctgttttaagtaaagaaaaccacatgcaagggaaactaaatgaaaaggagaag atgatctcaggacagaaattggaaatagaacgactggaaaagaaaaacaaaactttagaa tataagattgagattttagagaaaacaactactatctatgaggaagataaacgcaatttg caacaggaacttgaaactcagaaccagaaacttcagcgacagttttctgacaaacgcaga ttagaagccaggttgcaaggcatggtgacagaaacgacaatgaagtgggagaaagaatgt gagcgtagagtggcagccaaacagctggagatgcagaataaactctgggttaaagatgaa aagctgaaacaactgaaggctattgttaccgaacctaaaactgagaagccagagagaccc tctcgggagcgagatcgagaaaaagttactcaaagatctgtttctccatcacctgtgcct ctttctagtaactatattgctcagatttccaacggccagcaactcatgagccagccacag ctacataggcgctctaactcttgcagcagcatttctgtagcttcctgtatttcggaatgg gagcagaaaattcctacgtacaacacacctctcaaagtcacatctattgcaaggcgtagg cagcaggagccaggacaaagcaaaacttgtatcgtgtcagacagaaggcgagggatgtac tggactgaaggcagggaggtggttcctacattcagaaatgagatagaaatagaagaggat cattgcggcaggttactctttcaacctgatcagaacgcaccaccaattcgtctccgacac agacgatcacgctctgcaggagacagatgggtagatcataagcccgcctctaacatgcaa actgaaacagtcatgcagccacatgtccctcatgccatcacagtatctgttgcaaatgaa aaggcactagctaagtgtgagaagtacatgctgacccaccaggaactagcctccgatggg gagattgaaactaaactaattaagggtgatatttataaaacaaggggtggtggacaatct gttcagtttactgatattgagactttaaagcaagaatcaccaaatggtagtcgaaaacga agatcttccacagtagcacctgcccaaccagatggtgcagagtctgaatggaccgatgta gaaacaaggtgttctgtggctgtggagatgagagcaggatcccagctgggacctggatat cagcatcacgcacaacccaacaccgcgtccggcagcgccagccctacactcgcccgcgcc atggcctctgtctccgagctcgcctgcatctactcggccctcattctgcacgacgatgag gtgacagtcacggaggataagatcaatgccctcattaaagcagccggtgtaaatgttgag cctttttggcctggcttgtttgcaaaggccctggccaacgtcaacattgggagcctcatc tgcaatgtaggggccggtggacctgctccagcagctggtgctgcaccagcaggaggtcct gccccctccactgctgctgctccagcatggtccatgcgtctggtcctttctcccacctgc atcagcttcctcaactataatgaccgtagggttctggagagcacgcaccttccaggctcc tccgcacaggcctgctgcatgctggagacctgcggcacagctgccaagtgccccgtgtgg ttctgtgatcttgggacagtcacttcctctctgtggttgtcagtcttctcatccctaagg tgtggctgccagggcaactggttcaaaaacctgtggcacacagaacgaagaagaatgaag acactggacgtcatcagaatcacaaacttttgtgcttcaaaggataccatcaagaaagtg aaagattatcccacagaatag >gi568815583f:69315994_69546942|GENSCAN_predicted_peptide_3|125_aa MLYEGEGSLVHCYTPGPQNSTWYMRLAFNEENVTEVKVCDIRGKAIKDIGASALLSRLSD SGGSQPPRHEDTQTAYGEAHSISNQQPVKRESASSATAFISLRVNAEVLAKVCKELMNSQ ISWGP >gi568815583f:69315994_69546942|GENSCAN_predicted_CDS_3|378_bp atgctctatgagggtgaggggtcattggttcactgctacacccctggcccccagaatagt acctggtacatgcgactggctttcaatgaagagaatgtaacagaggtgaaagtgtgtgac attcgaggaaaggccataaaagacattggtgcttcagctttgctctcgcggctgtctgac tctgggggaagccagccaccacgtcatgaggacacccaaacagcctacggagaggcccac tctatttccaaccagcagcctgttaaacgtgagagtgcatcatctgcgaccgccttcatc tcactcagagtaaatgccgaagttctcgccaaggtctgcaaggagctaatgaacagccag atctcctggggaccttga