GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:05:23 Sequence gi568815583f:69352949_69555504 : 202556 bp : 45.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1003 1064 62 0 2 64 36 97 0.372 2.82 1.02 Intr + 4162 4259 98 1 2 49 48 88 0.357 0.55 1.03 Intr + 5052 5227 176 1 2 55 40 108 0.298 2.36 1.04 Intr + 26935 27062 128 0 2 140 42 86 0.022 8.68 1.05 Intr + 30707 31069 363 2 0 -29 96 322 0.321 15.60 1.06 Intr + 31729 31934 206 1 2 72 71 194 0.927 14.94 1.07 Intr + 36706 36832 127 2 1 95 84 175 0.978 17.54 1.08 Intr + 38216 38229 14 2 2 121 94 4 0.306 -1.28 1.09 Intr + 41971 42112 142 2 1 143 86 -14 0.310 3.31 1.10 Intr + 44520 44616 97 0 1 81 86 88 0.521 7.91 1.11 Intr + 45510 45557 48 2 0 39 117 33 0.502 0.18 1.12 Intr + 47026 47167 142 0 1 73 109 113 0.915 11.83 1.13 Term + 50633 50874 242 0 2 104 41 122 0.984 5.29 1.14 PlyA + 52670 52675 6 -0.45 2.00 Prom + 52683 52722 40 -4.46 2.01 Init + 58032 58085 54 2 0 80 33 34 0.255 -1.62 2.02 Intr + 64435 64563 129 0 0 87 76 86 0.937 8.19 2.03 Intr + 68699 68804 106 1 1 107 60 72 0.984 6.09 2.04 Intr + 69044 69180 137 0 2 86 92 20 0.982 2.49 2.05 Intr + 69378 69487 110 2 2 84 108 53 0.999 5.98 2.06 Intr + 70211 70381 171 2 0 91 108 88 0.999 10.16 2.07 Intr + 73122 73234 113 0 2 104 50 63 0.999 4.12 2.08 Intr + 73388 73509 122 0 2 108 103 55 0.996 9.21 2.09 Intr + 76163 76265 103 1 1 88 105 47 0.596 6.15 2.10 Intr + 82535 82614 80 0 2 78 121 23 0.002 3.87 2.11 Intr + 82704 82823 120 2 0 64 19 117 0.752 3.09 2.12 Intr + 83190 83313 124 2 1 92 60 16 0.763 -0.64 2.13 Intr + 83616 83774 159 1 0 49 73 166 0.869 11.16 2.14 Intr + 85300 85457 158 2 2 96 76 130 0.967 12.33 2.15 Intr + 86956 87129 174 0 0 48 56 168 0.960 9.74 2.16 Intr + 87360 87539 180 2 0 39 95 150 0.999 10.86 2.17 Intr + 87820 88131 312 0 0 96 44 80 0.576 0.78 2.18 Intr + 91842 92093 252 2 0 79 93 181 0.981 15.33 2.19 Intr + 93061 93143 83 0 2 91 106 4 0.894 0.94 2.20 Intr + 93335 93416 82 2 1 68 49 86 0.837 2.34 2.21 Intr + 93923 93993 71 1 2 90 89 28 0.166 1.08 2.22 Intr + 99961 100072 112 1 1 22 113 147 0.083 10.98 2.23 Intr + 100699 100773 75 0 0 43 116 30 0.663 1.01 2.24 Intr + 102222 102339 118 2 1 62 91 77 0.974 5.44 2.25 Intr + 104654 104729 76 0 1 107 77 75 0.158 6.87 2.26 Intr + 109057 109084 28 1 1 74 92 -3 0.100 -3.18 2.27 Intr + 115336 115445 110 0 2 67 116 71 0.602 6.88 2.28 Term + 126785 126926 142 2 1 79 43 122 0.651 4.20 2.29 PlyA + 126957 126962 6 1.05 3.00 Prom + 127459 127498 40 -4.66 3.01 Init + 133717 133788 72 0 0 65 101 59 0.119 5.97 3.02 Intr + 173034 173201 168 2 0 12 35 165 0.062 3.74 3.03 Term + 186571 186708 138 0 0 104 48 150 0.269 10.56 3.04 PlyA + 188359 188364 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100072 72 1 0 107 113 114 0.845 16.87 S.002 Term + 102480 102559 80 1 2 83 40 153 0.974 7.93 S.003 Init - 141120 140973 148 0 1 64 82 103 0.851 7.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:69352949_69555504|GENSCAN_predicted_peptide_1|614_aa MTDVDDQEEIALLPDNGSEEDYSFAFRYDCKLPAASLEAKQNPVPCFLSSLQNCTYKQGL WVFRGRERTPLTDEDYEEGSQEVVDFGPVMIGVKEEIYYGKFPTLLFFAKSKVFHEQGIL FGYRHPQSSATACILSLFQMTNETLNIWTHLLPFWVSGPSVFMVEGEWASVLMVEGEWAS VLMVEGEWASVLMVEGEWAFVFMVEGEWAFVFMVEGEWAFVFMVEGEWAFVFMVEGEWAF VFMVEGEWAFVFMVEGEWPSVFIVEGERALCIHGGGFFAWRFVTALYMTDIKNDSYSWPM LVYMCTSCVYPLVSSCAHTFSSMSKNARHICYFLDYGAVNLFSLGSAIAYSAYTFPDALM CTTFHDYYVALAVLNTILSTGLSCYSRKKQADPALQLERLVRLPLGKHFIRPSEMLSQVT ENPCFGGWPSPATARFLRVFLEIQKPRLCKVIRVLAFAYPYTWDSLPIFYRGKTMDLIGS AVPVQYPLFLFPGESAQNEATSYHQKHMIMTLLASFLYSAHLPERLAPGRFDYIGHSHQL FHVCVILATHMQMEAILLDKTLRKEWLLATSKPFSFSQIAGAILLCIIFSLSNIIYFSAA LYRIPKPELHKKET >gi568815583f:69352949_69555504|GENSCAN_predicted_CDS_1|1845_bp atgactgatgtggatgaccaggaggaaattgcactcctacccgacaacgggagtgaggaa gactactccttcgccttccgctatgattgtaagcttcctgcagcctccctagaagccaag cagaacccagtaccatgtttcctgtccagcctgcagaactgtacatacaagcaaggtctc tgggtcttcagaggcagggaaagaacacctctgactgatgaggactatgaggagggctct caggaggtggtagactttggcccagtgatgataggggtaaaggaggaaatttactatggt aaattcccaactctgctcttctttgctaaatccaaggtgttccatgagcaaggcatcctg ttcggctaccgccatccacagagttctgccactgcctgcatcctcagccttttccaaatg accaatgagactctcaacatttggactcacttgctgcccttctgggtgagcgggccctct gtattcatggtggagggtgagtgggcctctgtgctcatggtggagggtgagtgggcctct gtgctcatggtggagggtgagtgggcctctgtgctcatggtggagggtgagtgggccttt gtgttcatggtggagggtgagtgggcctttgtgttcatggtggagggtgagtgggccttt gtgttcatggtggagggtgagtgggcctttgtgttcatggtggagggtgagtgggccttt gtgttcatggtggagggtgagtgggcctttgtgttcatggtggagggtgagtggccctct gtgttcatcgtggagggtgagcgggccctctgtattcatggtggagggttctttgcatgg aggtttgtgactgcactgtatatgacagacatcaagaatgacagctactcctggcccatg cttgtgtacatgtgcaccagctgcgtgtacccacttgtgtccagctgtgcgcacaccttc agctctatgtccaagaatgcccggcacatttgctacttcctggactatggtgccgtcaac ctcttcagcctgggctcagccattgcctactctgcatacacgttcccggatgcgctcatg tgcaccactttccatgactactacgtggccctggctgtactgaacaccatcctcagcaca ggcctctcctgctactccaggaaaaaacaggcagacccggctttgcagctggagaggctg gtccgtttgcccctggggaaacactttatcaggccgagcgaaatgttgtctcaagtcaca gaaaatccatgctttgggggatggccttcgccagccacagccagatttcttcgcgtgttt cttgaaatccagaagcccagactctgtaaggtgattcgtgtcctcgcctttgcttatccg tacacctgggactccctccccatcttctacagggggaaaacaatggatctgataggctct gcggtgcctgtccagtacccgctattcctgttcccaggggagagtgcacaaaatgaagcc acctcgtaccaccagaagcacatgatcatgaccctcctggcctctttcttgtactctgca catctgccagaacgcctagcccctggacgctttgactacatcggtcacagtcaccagctg tttcacgtgtgtgtgatcctggccacgcacatgcagatggaagccatacttctggacaag actctgaggaaggaatggctcctggccacctccaagcccttctctttctctcagatagct ggagccatacttctgtgcatcatcttcagcctcagcaacataatttatttctcagctgct ctgtatcggattcccaagccagaattacataaaaaagaaacatga >gi568815583f:69352949_69555504|GENSCAN_predicted_peptide_2|1166_aa MVAEETEKSGAVWRFGYLVYCRVRPLGFPDQECCIEVINNTTVQLHTPEGYRLNRNGDYK ETQYSFKQVFGTHTTQKELFDVVANPLVNDLIHGKNGLLFTYGVTGSGKTHTMTGSPGEG GLLPRCLDMIFNSIGSFQAKRYVFKSNDRNSMDIQCEVDALLERQKREAMPNPKTSSSKR QVDPEFADMITVQEFCKAEEVDEDSVYGVFVSYIEIYNNYIYDLLEEVPFDPIKPKPPQS KLLREDKNHNMYVAGCTEVEVKSTEEAFEVFWRGQKKRRIANTHLNRESSRSHSVFNIKL VQAPLDADGDNVLQEKEQITISQLSLVDLAGSERTNRTRAEGNRLREAGNINQSLMTLRT CMDVLRENQMYGTNKMVPYRDSKLTHLFKNYFDGEGKVRMIVCVNPKAEDYEENLQVMRF AEVTQEVEVARPVDKAICGLTPGRRYRNQPRGPVGNEPLVTDVVLQSFPPLPSCEILDIN DEQTLPRLIEALEKRHNLRQMMIDEFNKQSNAFKALLQEFDNAVLSKENHMQGKLNEKEK MISGQKLEIERLEKKNKTLEYKIEILEKTTTIYEEDKRNLQQELETQNQKLQRQFSDKRR LEARLQGMVTETTMKWEKECERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERP SRERDREKVTQRSVSPSPVPLSSNYIAQISNGQQLMSQPQLHRRSNSCSSISVASCISEW EQKIPTYNTPLKVTSIARRRQQEPGQSKTCIVSDRRRGMYWTEGREVVPTFRNEIEIEED HCGRLLFQPDQNAPPIRLRHRRSRSAGDRWVDHKPASNMQTETVMQPHVPHAITVSVANE KALAKCEKYMLTHQELASDGEIETKLIKGDIYKTRGGGQSVQFTDIETLKQESPNGSRKR RSSTVAPAQPDGAESEWTDVETRCSVAVEMRAGSQLGPGYQHHAQPNTASGSASPTLARA MASVSELACIYSALILHDDEVTVTEDKINALIKAAGVNVEPFWPGLFAKALANVNIGSLI CNVGAGGPAPAAGAAPAGGPAPSTAAAPAWSMRLVLSPTCISFLNYNDRRVLESTHLPGS SAQACCMLETCGTAAKCPVWFCDLGTVTSSLWLSVFSSLRCGCQGNWFKNLWHTERRRMK TLDVIRITNFCASKDTIKKVKDYPTE >gi568815583f:69352949_69555504|GENSCAN_predicted_CDS_2|3501_bp atggtggctgaggagacagagaaaagtggagcagtttggaggtttggctacttagtatac tgtagggtgcgcccactgggctttcctgatcaagagtgttgcatagaagtgatcaataat acaactgttcagcttcatactcctgagggctacagactcaaccgaaatggagactataag gagactcagtattcatttaaacaagtatttggcactcacaccacccagaaggaactcttt gatgttgtggctaatcccttggtcaatgacctcattcatggcaaaaatggtcttcttttt acatatggtgtgacgggaagtggaaaaactcacacaatgactggttctccaggggaagga gggctgcttcctcgttgtttggacatgatctttaacagtatagggtcatttcaagctaaa cgatatgttttcaaatctaatgataggaatagtatggatatacagtgtgaggttgatgcc ttattagaacgtcagaaaagagaagctatgcccaatccaaagacttcttctagcaaacga caagtagatccagagtttgcagatatgataactgtacaagaattctgcaaagcagaagag gttgatgaagatagtgtctatggtgtatttgtctcttatattgaaatatataataattac atatatgatctattggaagaggtgccgtttgatcccataaaacccaaacctccacaatct aaattgcttcgtgaagataagaaccataacatgtatgttgcaggatgtacagaagttgaa gtgaaatctactgaggaggcttttgaagttttctggagaggccagaaaaagagacgtatt gctaatacccatttgaatcgtgagtccagccgttcccatagcgtgttcaacattaaatta gttcaggctcccttggatgcagatggagacaatgtcttacaggaaaaagaacaaatcact ataagtcagttgtccttggtagatcttgctggaagtgaaagaactaaccggaccagagca gaagggaacagattacgtgaagctggtaatattaatcagtcactaatgacgctaagaaca tgtatggatgtcctaagagagaaccaaatgtatggaactaacaagatggttccatatcga gattcaaagttaacccatctgttcaagaactactttgatggggaaggaaaagtgcggatg atcgtgtgtgtgaaccccaaggctgaagattatgaagaaaacttgcaagtcatgagattt gcggaagtgactcaagaagttgaagtagcaagacctgtagacaaggcaatatgtggttta acgcctgggaggagatacagaaaccagcctcgaggtccagttggaaatgaaccattggtt actgacgtggttttgcagagttttccacctttgccatcatgcgaaattttggatatcaac gatgagcagacacttccaaggctgattgaagccttagagaaacgacataacttacgacaa atgatgattgatgagtttaacaaacaatctaatgcttttaaagctttgttacaagaattt gacaatgctgttttaagtaaagaaaaccacatgcaagggaaactaaatgaaaaggagaag atgatctcaggacagaaattggaaatagaacgactggaaaagaaaaacaaaactttagaa tataagattgagattttagagaaaacaactactatctatgaggaagataaacgcaatttg caacaggaacttgaaactcagaaccagaaacttcagcgacagttttctgacaaacgcaga ttagaagccaggttgcaaggcatggtgacagaaacgacaatgaagtgggagaaagaatgt gagcgtagagtggcagccaaacagctggagatgcagaataaactctgggttaaagatgaa aagctgaaacaactgaaggctattgttaccgaacctaaaactgagaagccagagagaccc tctcgggagcgagatcgagaaaaagttactcaaagatctgtttctccatcacctgtgcct ctttctagtaactatattgctcagatttccaacggccagcaactcatgagccagccacag ctacataggcgctctaactcttgcagcagcatttctgtagcttcctgtatttcggaatgg gagcagaaaattcctacgtacaacacacctctcaaagtcacatctattgcaaggcgtagg cagcaggagccaggacaaagcaaaacttgtatcgtgtcagacagaaggcgagggatgtac tggactgaaggcagggaggtggttcctacattcagaaatgagatagaaatagaagaggat cattgcggcaggttactctttcaacctgatcagaacgcaccaccaattcgtctccgacac agacgatcacgctctgcaggagacagatgggtagatcataagcccgcctctaacatgcaa actgaaacagtcatgcagccacatgtccctcatgccatcacagtatctgttgcaaatgaa aaggcactagctaagtgtgagaagtacatgctgacccaccaggaactagcctccgatggg gagattgaaactaaactaattaagggtgatatttataaaacaaggggtggtggacaatct gttcagtttactgatattgagactttaaagcaagaatcaccaaatggtagtcgaaaacga agatcttccacagtagcacctgcccaaccagatggtgcagagtctgaatggaccgatgta gaaacaaggtgttctgtggctgtggagatgagagcaggatcccagctgggacctggatat cagcatcacgcacaacccaacaccgcgtccggcagcgccagccctacactcgcccgcgcc atggcctctgtctccgagctcgcctgcatctactcggccctcattctgcacgacgatgag gtgacagtcacggaggataagatcaatgccctcattaaagcagccggtgtaaatgttgag cctttttggcctggcttgtttgcaaaggccctggccaacgtcaacattgggagcctcatc tgcaatgtaggggccggtggacctgctccagcagctggtgctgcaccagcaggaggtcct gccccctccactgctgctgctccagcatggtccatgcgtctggtcctttctcccacctgc atcagcttcctcaactataatgaccgtagggttctggagagcacgcaccttccaggctcc tccgcacaggcctgctgcatgctggagacctgcggcacagctgccaagtgccccgtgtgg ttctgtgatcttgggacagtcacttcctctctgtggttgtcagtcttctcatccctaagg tgtggctgccagggcaactggttcaaaaacctgtggcacacagaacgaagaagaatgaag acactggacgtcatcagaatcacaaacttttgtgcttcaaaggataccatcaagaaagtg aaagattatcccacagaatag >gi568815583f:69352949_69555504|GENSCAN_predicted_peptide_3|125_aa MLYEGEGSLVHCYTPGPQNSTWYMRLAFNEENVTEVKVCDIRGKAIKDIGASALLSRLSD SGGSQPPRHEDTQTAYGEAHSISNQQPVKRESASSATAFISLRVNAEVLAKVCKELMNSQ ISWGP >gi568815583f:69352949_69555504|GENSCAN_predicted_CDS_3|378_bp atgctctatgagggtgaggggtcattggttcactgctacacccctggcccccagaatagt acctggtacatgcgactggctttcaatgaagagaatgtaacagaggtgaaagtgtgtgac attcgaggaaaggccataaaagacattggtgcttcagctttgctctcgcggctgtctgac tctgggggaagccagccaccacgtcatgaggacacccaaacagcctacggagaggcccac tctatttccaaccagcagcctgttaaacgtgagagtgcatcatctgcgaccgccttcatc tcactcagagtaaatgccgaagttctcgccaaggtctgcaaggagctaatgaacagccag atctcctggggaccttga