GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:06:36 Sequence gi568815584f:94011683_94216524 : 204842 bp : 44.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4783 4851 69 0 0 66 87 63 0.098 3.07 1.02 Intr + 12121 12272 152 0 2 51 87 133 0.009 8.46 1.03 Intr + 16680 16788 109 0 1 15 97 21 0.012 -4.01 1.04 Intr + 16948 17118 171 0 0 79 42 78 0.079 2.44 1.05 Intr + 18936 19046 111 2 0 53 75 51 0.298 0.88 1.06 Intr + 25063 25167 105 0 0 52 20 107 0.464 0.71 1.07 Intr + 25698 25793 96 2 0 100 78 73 0.993 7.71 1.08 Intr + 27281 27399 119 1 2 116 89 163 0.996 18.46 1.09 Intr + 32289 32373 85 0 1 113 68 125 0.996 12.82 1.10 Intr + 32883 33098 216 2 0 39 63 379 0.853 29.20 1.11 Term + 34034 34240 207 1 0 101 38 356 0.998 29.24 1.12 PlyA + 34814 34819 6 -0.45 2.09 PlyA - 36629 36624 6 1.05 2.08 Term - 39780 39509 272 1 2 21 36 256 0.638 9.35 2.07 Intr - 41445 41316 130 0 1 90 98 244 0.999 25.87 2.06 Intr - 43502 43314 189 2 0 93 70 238 0.995 22.28 2.05 Intr - 46215 46140 76 2 1 92 91 51 0.999 5.32 2.04 Intr - 48931 48416 516 0 0 88 89 554 0.999 47.57 2.03 Intr - 49384 49231 154 2 1 101 90 155 0.999 16.13 2.02 Intr - 50939 50415 525 0 0 81 94 400 0.583 32.82 2.01 Init - 68060 67343 718 2 1 106 110 570 0.995 56.03 2.00 Prom - 72918 72879 40 -3.16 3.00 Prom + 73820 73859 40 -4.46 3.01 Init + 76613 76621 9 1 0 74 115 6 0.200 2.12 3.02 Intr + 78186 78275 90 2 0 77 83 44 0.194 2.99 3.03 Intr + 83154 83217 64 2 1 12 106 96 0.191 2.09 3.04 Intr + 85124 85283 160 0 1 63 94 69 0.489 4.05 3.05 Intr + 88718 88830 113 2 2 63 62 15 0.548 -3.58 3.06 Intr + 89057 89089 33 0 0 101 105 40 0.926 5.19 3.07 Intr + 90021 90293 273 1 0 47 92 301 0.647 23.91 3.08 Intr + 92043 92071 29 1 2 94 75 26 0.156 -0.27 3.09 Intr + 93323 93352 30 1 0 68 115 24 0.113 1.43 3.10 Intr + 99977 100091 115 1 1 75 92 35 0.224 2.62 3.11 Intr + 101024 101130 107 0 2 107 78 30 0.620 3.83 3.12 Intr + 103096 103198 103 0 1 91 105 -23 0.647 -0.55 3.13 Intr + 104099 104260 162 0 0 112 106 227 0.971 26.85 3.14 Term + 104760 104845 86 1 2 112 36 134 0.985 8.42 3.15 PlyA + 104988 104993 6 1.05 4.07 PlyA - 105949 105944 6 1.05 4.06 Term - 112853 112689 165 2 0 100 49 109 0.761 6.12 4.05 Intr - 113615 113533 83 0 2 45 92 12 0.271 -3.34 4.04 Intr - 116310 116164 147 1 0 133 61 50 0.756 7.01 4.03 Intr - 116993 116832 162 0 0 113 106 201 0.994 24.35 4.02 Intr - 117609 117580 30 1 0 108 107 6 0.856 2.60 4.01 Init - 128059 128002 58 1 1 77 91 27 0.321 3.37 4.00 Prom - 135501 135462 40 -2.56 5.03 PlyA - 136258 136253 6 1.05 5.02 Term - 138550 137885 666 1 0 37 42 211 0.346 5.23 5.01 Init - 142399 142358 42 1 0 34 99 57 0.408 1.82 5.00 Prom - 142906 142867 40 -4.26 6.00 Prom + 143044 143083 40 -6.06 6.01 Init + 162811 162900 90 0 0 102 91 154 0.888 17.59 6.02 Intr + 164372 164445 74 1 2 60 68 83 0.472 1.70 6.03 Intr + 186059 186111 53 2 2 115 84 -26 0.018 -1.75 6.04 Term + 190108 190232 125 2 2 85 37 101 0.248 3.25 6.05 PlyA + 190621 190626 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 14798 14696 103 2 1 76 72 108 0.850 6.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:94011683_94216524|GENSCAN_predicted_peptide_1|479_aa MPGPALLLAQEQPGHLGGLCSQAQGISTKGGQTWWGNSKGFMTIWAIREGLMDNDPGFER QVVSDKWAFQKEQRRNILGKYSIFFTFLFLHHWAKLEEGRPVEDTKTLGKAPSGHLCQAC AGHCDTDEKDKVPARCRSSLPAGQDSWTQAVSELQVMPKRPKDIGNTWSSREGTCSRLNN GPNDVHVLIPTTCDYVTVARGTWQVVHEDALLPCAPRAQISDTFSGISMYQCHTLRPLLS ETSFNLISEKCDILSILRDHPENRIYRRKIEELSKRFTAIRKTKGDGNCFYRALGYSYLE SLLGKSREIFKFKERVLQTPNDLLAAGFEEHKFRNFFNALRAGRAQFYSVVELVEKDGSV SSLLKVFNDQSASDHIVQFLRLLTSAFIRNRADFFRHFIDEEMDIKDFCTHEVEPMATEC DHIQITALSQALSIALQVEYVDEMDTALNHHVFPEAATPSVYLLYKTSHYNILYAADKH >gi568815584f:94011683_94216524|GENSCAN_predicted_CDS_1|1440_bp atgcccggcccagccctgctcttggctcaggagcagccaggccacttgggaggcctgtgc tcccaggctcaagggatcagcaccaaaggaggacagacatggtggggaaattcaaaggga ttcatgaccatctgggcaatccgggaaggcctcatggacaatgacccaggatttgaaaga caggtggtatctgacaagtgggcattccagaaggagcaaagaaggaatatccttggaaaa tacagcatcttctttactttcctgtttctacatcactgggctaagcttgaggaagggcgc cccgtggaagacacaaagacacttgggaaggcccccagtgggcacctttgtcaggcctgt gctgggcactgtgacactgatgagaaagacaaggtgcctgcccgctgtaggagctcactg cctgctggccaagacagctggacacaagcagtgtctgagctacaggtcatgcctaagagg cctaaggacattgggaacacatggagcagcagagaagggacatgtagcaggctgaacaat ggccccaatgatgtccacgtcctaatccccacaacctgtgactatgttaccgtggcgaga gggacttggcaggtggtgcatgaagatgccctcctgccctgtgctccccgagctcagatc tctgacaccttctctggcatctccatgtaccagtgccacacactgcggcctcttctgagt gaaacatctttcaacctaatatcagaaaaatgtgacattctatccattcttcgggaccat cctgaaaacaggatttaccggaggaaaatcgaggaactcagcaaaaggttcaccgccatc cgcaagaccaaaggggatgggaactgcttctacagggccttgggctattcctacctggag tccctgctggggaagagcagggagatcttcaagttcaaagaacgcgtactgcagacccca aatgaccttctggctgctggctttgaggagcacaagttcagaaacttcttcaatgctctg agggccgggcgggcgcagttttacagtgtggtggaactggtagagaaggatggctcagtg tccagcctgctgaaggtgttcaacgaccagagtgcctcggaccacatcgtgcagttcctg cgcctgctcacgtcggccttcatcaggaaccgagcagacttcttccggcacttcattgat gaggagatggacatcaaagacttctgcactcacgaagtagagcccatggccacggagtgt gaccacatccagatcacggcgttgtcgcaggccctgagcattgccctgcaagtggagtac gtggacgagatggataccgccctgaaccaccacgtgttccctgaggccgccaccccttcc gtttacctgctctataaaacatcccactacaacatcctttatgcagccgataaacattga >gi568815584f:94011683_94216524|GENSCAN_predicted_peptide_2|859_aa MKLKDTKSRPKQSSCGKFQTKGIKVVGKWKEVKIDPNMFADGQMDDLVCFEELTDYQLVS PAKNPSSLFSKEAPKRKAQAVSEEEEEEEGKSSSPKKKIKLKKSKNVATEGTSTQKEFEV KDPELEAQGDDMVCDDPEAGEMTSENLVQTAPKKKKNKGKKGLEPSQSTAAKVPKKAKTW IPEVHDQKADVSAWKDLFVPRPVLRALSFLGFSAPTPIQALTLAPAIRDKLDILGAAETG SGKTLAFAIPMIHAVLQWQKRNAAPPPSNTEAPPGETRTEAGAETRSPGKAEAESDALPD DTVIESEALPSDIAAEARAKTGGTVSDQALLFGDDDAGEGPSSLIREKPVPKQNENEEEN LDKEQTGNLKQELDDKSATCKAYPKRPLLGLVLTPTRELAVQVKQHIDAVARFTGIKTAI LVGGMSTQKQQRMLNRRPEIVVATPGRLWELIKEKHYHLRNLRQLRCLVVDEADRMVEKG HFAELSQLLEMLNDSQYNPKRQTLVFSATLTLVHQAPARILHKKHTKKMDKTAKLDLLMQ KIGMRGKPKVIDLTRNEATVETLTETKIHCETDEKDFYLYYFLMQYPGRSLVFANSISCI KRLSGLLKVLDIMPLTLHACMHQKQRLRNLEQFARLEDCVLLATDVAARGLDIPKVQHVI HYQVPRTSEIYVHRSGRTARATNEGLSLMLIGPEDVINFKKIYKTLKKDEDIPLFPVQTK YMDVVKERIRLARQIEKSEYRNFQACLHNSWIEQAAAALEIELEEDMYKGGKADQQEERR RQKQMKVLKKELRHLLSQPLFTESQKTKYPTQSGKPPLLVSAPSKSESALSCLSKQKKKK TKKPKEPQPEQPQPSTSAN >gi568815584f:94011683_94216524|GENSCAN_predicted_CDS_2|2580_bp atgaagttgaaggacacaaaatcaaggccaaagcagtcaagctgtggcaaatttcagaca aagggaatcaaagttgtgggaaaatggaaggaagtgaagattgacccaaatatgtttgca gatggacagatggatgacttggtgtgctttgaggaattgacagattaccagttggtctcc cctgccaagaatccctccagtctcttctcaaaggaagcacccaagagaaaggcacaagct gtttcagaagaagaggaggaggaggagggaaagtctagctcaccaaagaaaaagatcaag ttgaagaaaagtaaaaatgtagcaactgaaggaaccagtacccagaaagaatttgaagtg aaagatcctgagctggaggcccagggagatgacatggtttgtgatgatccggaggctggg gagatgacatcagaaaacctggtccaaactgctccaaaaaagaagaaaaataaagggaaa aaagggttggagccttctcagagcactgctgccaaggtgcccaaaaaagcgaagacatgg attcctgaagttcatgatcagaaagcagatgtgtcagcttggaaggacctgtttgttccc aggccggttctccgagcactcagctttctaggcttctctgcacccacaccaatccaagcc ctgaccttggcacctgccatccgtgacaaactggacatccttggggctgctgagacagga agtgggaaaactcttgcctttgccatcccaatgattcatgcggtgttgcagtggcagaag aggaatgctgcccctcctccaagtaacaccgaagcaccacctggagagaccagaactgag gccggagctgagactagatcaccaggcaaggctgaagctgagtctgatgcattgcctgac gatactgtaattgagagtgaagcactgcccagtgatattgcagccgaggccagagccaag actggaggcactgtctcagaccaggcgttgctctttggtgacgatgatgctggtgaaggg ccttcttccctgatcagggagaaacctgttcccaaacagaatgagaatgaggaggaaaat cttgataaagagcagactggaaatctaaaacaggagttggatgacaaaagcgccacctgt aaggcatatccaaagcgtcctctgcttggactggttctgactcccactcgagagctggcc gtccaggtcaaacagcacattgatgctgtggccaggtttacaggaattaaaactgctatt ttggttggtggaatgtccacgcagaaacagcagaggatgctgaaccgtcgtcctgagatt gtggttgctactccaggccggctgtgggaattaattaaagaaaagcattatcatttgagg aaccttcggcagctcaggtgcctggtagtggatgaggctgaccggatggttgagaaaggc cattttgctgagctctcacagctgctagagatgctcaatgactcccaatacaacccaaag agacaaacgcttgttttttctgccacactcaccctggtgcatcaggctcctgctcgaatc cttcataagaagcacaccaagaaaatggataaaacagccaaacttgacctccttatgcag aaaattggcatgaggggcaagcccaaggtcattgacctcacaaggaatgaggccacggtg gagacgctaacagagaccaagatccattgtgagactgatgagaaagacttctacttgtac tacttcctgatgcagtatccaggccgcagcttagtgtttgccaacagtatctcctgcatc aaacgcctctctgggctcctcaaagtccttgatatcatgcccttgaccctgcatgcctgt atgcaccagaagcagaggctcagaaacctggagcagtttgcccgtctggaagactgtgtt ctcttggcaacagatgtggcagctcggggtctggatattcctaaagtccagcatgtcatc cattaccaggtcccacgtacctcggagatttatgtccaccgaagtggtcgaactgctcga gctaccaatgaaggcctcagtctgatgctcattgggcctgaggatgtgatcaactttaag aagatttacaaaacgctcaagaaagatgaggatatcccactgttccccgtgcagacaaaa tacatggatgtggtcaaggagcgaatccgtttagctcgacagattgagaaatctgagtat cggaacttccaggcttgcctgcacaactcttggattgagcaggcagcagctgccctggag attgagctggaagaagacatgtataagggaggaaaagctgaccagcaagaagaacgtcgg agacaaaagcagatgaaggttctgaagaaggagctgcgccacctgctgtcccagccactg tttacggagagccagaaaaccaagtatcccactcagtctggcaagccgcccctgcttgtg tctgccccaagtaagagcgagtctgctttgagctgtctctccaagcagaagaagaagaag acaaagaagccgaaggagccacagccggaacagccacagccaagtacaagtgcaaattaa >gi568815584f:94011683_94216524|GENSCAN_predicted_peptide_3|457_aa MQKGRVELSQKVSVWQGPFITLEFRQKVISGRLDDIEETPELKEIDRNTNWLTLVPGGMQ SSLLMQLYSNQILNKVKTQPTGGSPRIRVEAHRGEGANHGKGEWMGLSHCQMFLPALRAQ HNSGGPGMLCPVLQGAQPQGWAEETGRAAVAAVVGGVFPESTADPSSSSPQEDVREVWGT PSHGANPKSPLPAVVAVGTVLVALSAMGFTSVGIAASSIAAKMMSTAAIANGGGVAAGSL VAILQSVAWLYSSSHQEPLRKSTPDPKATELTRAGMEASALTSSAVTSVAKVVRVASGSA VVLPLAALSPNISLLRPLLGALEASSFMLGSLTGTLFCNLEMGNRLRKWRGSQCGSTHRM FFWFPARIATVVIGGVVAMAAVPMVLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASG SLVATLQSLGATGLSGLTKFILGSIGSAIAAVIARFY >gi568815584f:94011683_94216524|GENSCAN_predicted_CDS_3|1374_bp atgcagaagggtagagtggagctcagtcagaaagtgtcggtatggcaaggaccattcata actcttgagttccgacaaaaggtgatatctggaagattagacgatattgaggagacccct gaactaaaggaaatagaccgcaacaccaattggctgactttggtcccaggagggatgcag agttctttattaatgcagctttattcaaaccagatcctgaataaagtcaaaactcaacca acaggtggaagtccaagaatccgagtggaggctcaccgaggcgaaggggccaaccatggg aaaggagagtggatgggactcagccattgtcagatgttcctgccagccctgagagctcaa cacaactcaggaggcccaggcatgctctgcccagtgctgcagggggcccagccacaaggc tgggcagaggagacaggcagggctgctgtagcagctgtggtcggaggagtctttcctgag agcacagcagacccctcctcctcatcgccccaggaggatgtcagggaagtctgggggact ccgtcccatggggccaaccccaaatctccacttcccgcagttgtggctgtggggactgtg ctcgtggcgctcagtgccatgggcttcacctcagtaggaatcgccgcatcctccatagca gccaagatgatgtctacagcagccattgccaacgggggcggagttgctgctggcagtctg gtggctattctgcagtcagtggcatggctgtacagctcctcccatcaagagccattaagg aagagcacaccggacccgaaggccacggaattaacccgagcaggcatggaggcctctgct ctcacctcatcagcagtgaccagtgtggccaaagtggtcagggtggcctctggctctgcc gtagttttgcccctggctgccctgagccccaacatctctctcctcagacccttgctgggg gcactggaggctagctccttcatgctggggtccctcactggcaccctgttttgcaacttg gagatgggcaatcggcttagaaagtggaggggaagccagtgtggatctactcacagaatg ttcttttggtttccagccaggattgctacagttgtgattggaggagttgtggccatggcg gctgtgcccatggtgctcagtgccatgggcttcactgcggcgggaatcgcctcgtcctcc atagcagccaagatgatgtccgcggcggccattgccaatgggggtggagttgcctcgggc agccttgtggctactctgcagtcactgggagcaactggactctccggattgaccaagttc atcctgggctccattgggtctgccattgcggctgtcattgcgaggttctactag >gi568815584f:94011683_94216524|GENSCAN_predicted_peptide_4|214_aa MGPQNNRTVTTISRLLSEAERAAAAAVGGALAVGAVPVVLSAMGFTGAGIAASSIAAKMM SAAAIANGGGVSAGSLVATLQSVGAAGLSTSSNILLASVGSVLGACLGNSPSSSLPAEPE AKEDEARENVPQDGLKAAFWVHTRHLIDHLVTYQEGRAGQKGTKTLSCGGKTADPVASKA HAKTIAEGEEKVKPSALGEEQETILSPDLETCYP >gi568815584f:94011683_94216524|GENSCAN_predicted_CDS_4|645_bp atgggacctcagaataataggacagtcaccacaatttcaaggctgctctcagaggcagaa cgggcagctgctgctgcagtgggaggagccctggcagtgggggctgtgcccgtggtgctc agtgccatgggcttcactggggcaggaatcgccgcgtcctccatagcagccaagatgatg tccgcagcagccattgccaacgggggtggtgtttctgcggggagcctggtggctactctg cagtccgtgggggcagctggactctccacatcatccaacatcctcctggcctctgttggg tcagtgttgggggcctgcttggggaattcaccttcttcttctctcccagctgaacccgag gctaaagaagatgaggcaagagaaaatgtaccccaagatggtcttaaggcagccttctgg gtgcacacaaggcatcttatagaccacttggtaacctatcaagaaggaagagctggccag aagggcacaaaaaccttaagctgcgggggaaagacagccgaccctgtagcttccaaagct catgcaaagacaattgcagaaggagaagaaaaagtaaaaccctccgcccttggggaagag caagaaaccatcctgagcccagacctggagacctgctacccttag >gi568815584f:94011683_94216524|GENSCAN_predicted_peptide_5|235_aa MTIQCSKQRAQYQQKINGQKVNKDIQDLNSAMGQADLIGIYRTLYPKSTEYTFFSAPHHT YSKIDHIIGSKTLLSKCKRMEIVTNSLSDHSAITLEVRIRKCTQNLTTTWKLNNLLLNDY WVHIEIKAEIIKFFETNENKDTMYQNLWDTAKAVFRRKFIALNAHRRKQKRSKIKTLISQ LKEVEKQEQTNPKASGRQKISEIRAELKEIETRKTLQKINESGSWFFEKINKIDC >gi568815584f:94011683_94216524|GENSCAN_predicted_CDS_5|708_bp atgaccatccagtgcagcaagcagcgtgctcagtatcaacagaagatcaacggacagaaa gttaacaaggatattcaggacttgaactcagctatgggccaagcggacctaataggcatc tacagaactctctaccccaaatcaacagaatatacattcttctcggcaccacatcacact tattctaaaattgaccacatcattggaagtaaaacactcctcagcaaatgcaaaagaatg gaaattgtaacaaacagtctctcagaccacagtgcaatcacattagaagtcaggattagg aaatgcactcaaaacctcacaactacatggaaactgaacaacctgctcttgaatgactac tgggtacatattgaaattaaggcagaaataattaagttctttgaaaccaatgagaataaa gacacaatgtaccagaatctctgggacacagctaaagcagtgtttagaaggaaatttata gcactaaatgcccacagaagaaagcagaaaagatctaaaatcaaaaccctaatatcgcaa ttaaaagaagtagagaagcaagagcaaacaaatccaaaagctagcggaagacaaaaaata agtgagattagagcagaactaaaggagatagagacacgaaaaacccttcaaaaaatcaat gaatctggaagctggttttttgaaaagattaacaaaatagactgctag >gi568815584f:94011683_94216524|GENSCAN_predicted_peptide_6|113_aa MDFSQNSLFGYMEDLQELTIIERPVRRSLKTPEEIERLTVDEDLSDIERAVYLLRQKLLS ATIVQFVFSVIFCVYPEEKKSLYEKDACKCMFVAVQFAVAKIWNQPKCPSVNE >gi568815584f:94011683_94216524|GENSCAN_predicted_CDS_6|342_bp atggatttcagtcagaacagcctgttcggttacatggaggacctgcaggagctcaccatc atcgagaggccggtccgccggagcctcaagacaccggaagaaatagaaagattgacagtc gatgaagacctcagtgatattgaaagggctgtttatctgctcaggcaaaaactgctttct gccactatagttcagtttgtattttctgtgattttctgtgtctacccagaggaaaagaag tcattatatgaaaaagatgcttgcaaatgcatgtttgtagcagtacaatttgcagttgct aaaatatggaaccagcccaaatgcccatcagtcaatgagtag