GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:52:11 Sequence gi568815588r:123910152_124146212 : 236061 bp : 48.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4257 4394 138 1 0 136 61 55 0.819 8.04 1.02 Term + 25568 25962 395 0 2 -83 40 1051 0.903 78.40 1.03 PlyA + 27577 27582 6 -0.45 2.06 PlyA - 27601 27596 6 1.05 2.05 Term - 28603 28504 100 0 1 98 43 78 0.406 1.90 2.04 Intr - 28907 28862 46 0 1 58 84 39 0.235 -2.03 2.03 Intr - 31696 31540 157 1 1 73 75 74 0.061 4.18 2.02 Intr - 35816 35757 60 2 0 106 87 -5 0.128 0.13 2.01 Init - 42169 42125 45 1 0 92 78 69 0.695 6.98 2.00 Prom - 43265 43226 40 -5.36 3.06 PlyA - 45243 45238 6 1.05 3.05 Term - 57806 57642 165 2 0 75 42 123 0.355 4.32 3.04 Intr - 63138 62957 182 1 2 52 61 106 0.161 3.89 3.03 Intr - 63300 63222 79 0 1 79 68 37 0.291 -0.08 3.02 Intr - 63619 63498 122 2 2 88 14 85 0.266 1.31 3.01 Init - 65269 65248 22 1 1 76 81 -4 0.361 -2.87 3.00 Prom - 66959 66920 40 -4.26 4.00 Prom + 72850 72889 40 -1.76 4.01 Init + 81452 81683 232 1 1 50 31 286 0.398 15.52 4.02 Intr + 81845 82141 297 0 0 -59 -7 346 0.503 7.25 4.03 Intr + 82173 82439 267 1 0 61 82 178 0.409 12.00 4.04 Intr + 87676 87801 126 2 0 106 60 26 0.017 2.25 4.05 Term + 98280 98488 209 1 2 40 47 149 0.071 3.50 4.06 PlyA + 98944 98949 6 1.05 5.10 PlyA - 99317 99312 6 -0.45 5.09 Term - 100188 99998 191 1 2 103 54 247 0.865 20.41 5.08 Intr - 100547 100419 129 0 0 59 47 64 0.478 0.07 5.07 Intr - 102329 102182 148 2 1 110 52 202 0.977 18.61 5.06 Intr - 111261 111105 157 2 1 81 23 209 0.849 13.71 5.05 Intr - 120306 120223 84 2 0 100 68 50 0.353 3.14 5.04 Intr - 128520 128364 157 1 1 108 83 241 0.836 24.67 5.03 Intr - 132296 132150 147 0 0 87 99 190 0.988 20.21 5.02 Intr - 134768 134429 340 2 1 149 97 704 0.994 72.45 5.01 Init - 136061 135516 546 2 0 80 84 202 0.931 13.91 5.00 Prom - 142034 141995 40 -4.36 6.00 Prom + 144888 144927 40 -7.36 6.01 Init + 145683 145845 163 2 1 44 49 118 0.261 3.39 6.02 Term + 146480 146790 311 0 2 57 54 172 0.420 5.92 6.03 PlyA + 147379 147384 6 1.05 7.00 Prom + 161439 161478 40 -4.66 7.01 Init + 173736 173772 37 2 1 113 91 -3 0.820 2.68 7.02 Intr + 174213 174265 53 1 2 116 55 40 0.413 2.13 7.03 Intr + 176834 176970 137 1 2 -20 90 183 0.577 7.07 7.04 Intr + 181593 181908 316 0 1 52 44 189 0.654 6.97 7.05 Intr + 184215 184399 185 2 2 47 22 148 0.013 2.69 7.06 Intr + 198032 198148 117 2 0 40 32 216 0.157 10.78 7.07 Intr + 209317 209360 44 1 2 82 51 12 0.023 -5.22 7.08 Term + 213786 213940 155 1 2 95 54 139 0.825 9.28 7.09 PlyA + 216453 216458 6 -0.45 8.05 PlyA - 216525 216520 6 -1.75 8.04 Term - 217731 217611 121 2 1 79 53 93 0.851 2.85 8.03 Intr - 219248 219089 160 0 1 61 77 73 0.032 2.55 8.02 Intr - 230016 229913 104 2 2 66 105 62 0.336 5.52 8.01 Init - 233120 232780 341 2 2 30 58 140 0.185 0.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 25350 25443 94 2 1 95 111 15 0.925 5.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_1|177_aa XKPVHSTHCCGFKFLQRSPVFQVCVTDSLLDISTQMLPPGPKLHSKDPARNTSIITIIIT ITIITSIFTIIITTTTSSIFIAIITSVLTIITTTTITTILTIITTTNTSIFTIITTFTII ITIITSIFTIITIITTTTITSILTIITTTNTSIFTIITITIITTIFTITTTITIIIY >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_1|534_bp nggaagcctgttcattctacccattgctgtggctttaaattcttgcagagaagtcccgtg ttccaggtctgtgtgactgacagcctccttgacatctccactcagatgctgcctccaggt cccaaacttcactcgaaagatccagccagaaatacaagcatcatcaccatcatcatcacc atcaccatcataaccagcatctttaccatcatcatcaccaccaccaccagcagcatcttc atcgccatcatcaccagcgtcttgaccatcatcaccaccaccaccatcaccaccatcctt accatcatcaccaccaccaacaccagcatcttcaccatcatcaccacatttaccatcatc atcaccatcatcaccagcatctttaccatcatcaccatcatcactaccaccaccatcacc agcatccttaccatcatcaccaccaccaacaccagcatcttcaccatcatcaccatcacc atcatcaccactatctttaccatcaccaccaccatcaccatcatcatctattga >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_2|135_aa MAGVNHDVITKSEKEGTATLPSVSVDFPLLDISLKMRFREGRWPAQGERENQFFWPHTLA FELFFKAPPKAEEGESPLGSTGQPSAAGYQEHNVSAPPEEGEGPERIRVALSICKMAAVD AGQPGSQRAFTILAC >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_2|408_bp atggccggtgtaaatcatgatgtcatcaccaaatcggaaaaggagggaactgctacttta ccttctgtctctgtggattttcctcttctggacatttcattgaaaatgcggttcagagaa ggtaggtggcctgctcaaggtgaacgagagaaccagttcttctggcctcatactctagct tttgagctcttcttcaaagcacccccaaaggcagaggagggggaaagtcctctggggtcc acagggcagccgtccgctgcaggataccaggaacataatgtgtctgctcctccagaggag ggagagggaccagagcgtatacgggtggccctgagtatctgcaagatggctgcggtagac gctggccagccgggcagccagcgggcattcaccattcttgcttgctaa >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_3|189_aa MAPAPASATLHLKELDGAGTHSYIGAAPKSGIHKQFWAYKNQKRIDCTNFCDFSFESFAF QKAHSVPLTLAGLVDTHGWVRPQSLLHKPEPCKSQGDSAPVGLSASYFPSPHLEENLLGP ECIQGNKNFREGVHQFISVGFTGVSENSEVHLCTMRLTPVFGSPTQPAGGRLDLGMHRSG SPETCSLPR >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_3|570_bp atggcaccagcacctgcttctgcgaccttgcacctgaaggagctggatggggctggtacc cacagctacattggagctgctcccaaatcaggaattcataagcaattctgggcttacaag aatcaaaagcggattgattgtactaacttttgcgatttcagctttgaaagttttgcattc cagaaagcccactcagtcccactcaccctggccggcctggtggacacccatggctgggtt cgcccgcaaagtctgctgcataagcccgagccctgcaaatctcagggagacagcgctccc gtcgggctgtccgccagctatttcccatccccacacctggaggaaaatctgcttggccct gagtgtattcaagggaacaagaacttcagggaaggcgtccaccagtttattagcgtgggc ttcacgggggtatctgagaactcagaagtccacctttgcacgatgaggctgacgcctgtg tttggttccccgacacagccagcagggggcaggctggacctgggaatgcaccggagcggc tccccggagacctgcagtttgccgagatga >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_4|376_aa MKTLGVRARLGARGSRSAAGSPAGGRGRGGGGGGGGGGGGGGRYWTRRDERGGGGSGGDS GPRGDGARDGGRGGSGGVRRTSRCWQPESCLGTAKRFNVRSGYGFTNRNDAKEDVFVHWT AVKRNNPRKFLRSVRDGETVEFDVVEGEKGAEATNVTGPRAAGVPMKGTVMPPTDVVAPP PMVAEIPSRGTEPGGEGERAEDSGQRPRRWRPPPFYLRRFVRGPRPPNQQQPIEGTDGVE PNRQPHWRGTNSREMSGSPRPDSGPGWTPAPSLILVFDHLGLDFAFDLASIACHILQLVT STRWVRFGQISLLAAAPSAGYREGGDCPCKWEFRTHAPSVAKTALAILALRALCTPSNGL CVPSLMAERQHQQKDG >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_4|1131_bp atgaagacattgggcgtccgggcgaggctcggggcgcgaggatcgcgcagcgcagcgggt tcgccagccggggggaggggccggggcggtggcggcggcggcggaggaggcggtggcggc gggggccggtactggacccggcgggatgagcgaggtggaggcggcagcggtggcgacagc ggtccccgcggcgacggtgcccgcgacggtggcaggggtggtagcggtggtgtcaggcgg acaagccggtgctggcaacccgagtcctgcctgggcactgccaaacggttcaacgtccgg agtggttacggattcaccaacaggaatgacgccaaggaagatgtcttcgttcactggaca gctgttaaaagaaacaaccccaggaagtttctgcgcagcgttagagatggggagactgtg gaatttgatgtcgtggaaggagagaagggcgcagaagccactaatgtaactgggcccagg gcggcgggggtgcccatgaagggcaccgttatgcccccaaccgacgtagttgccccacca cccatggtggcagagatcccctctcgggggacagaacctggcggcgaaggggagcgggcc gaagactctgggcagcggcccagacgatggcgccccccacccttctacctacggcggttt gtgcgaggcccccggccccccaaccagcagcagcctatagagggcactgatggggtagaa ccaaacagacagccccattggaggggcaccaacagcagggagatgagtgggtccccccgc ccagattccggcccaggctggacaccagctccaagtctaatccttgtctttgaccatttg gggcttgactttgcctttgaccttgcctccattgcctgccacatccttcagttagtgaca agcacaagatgggtcagatttggccagatttccctgctggcggctgctcccagtgccggc tatagagaaggtggggactgtccctgcaagtgggagttcaggacacacgctccttcagta gcaaaaacagccctggccatcctggcgctcagagccctctgcacaccttcgaacgggctg tgtgtcccttctctgatggcagaaagacaacaccagcagaaagacggctga >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_5|632_aa MRHCINCCIQLLPDGAHKQQVNCQGGPHHGHQACPTCKGENKILFRVDSKQMNLLAVLEV RTEGNENWGGFLRFKKGKRCSLVFGLIIMTLVMASYILSGAHQELLISSPFHYGGFPSNP SLMDSENPSDTKEHHHQSSVNNISYMKDYPSIKLIINSITTRIEFTTRQLPDLEDLKKQE LHMFSVIPNKFLPNSKSPCWYEEFSGQNTTDPYLTNSYVLYSKRFRSTFDALRKAFWGHL AHAHGKHFRLRCLPHFYIIGQPKCGTTDLYDRLRLHPEVKFSAIKEPHWWTRKRFGIVRL RDGLRDRYPVEDYLDLFDLAAHQIHQGLQASSAKEQSKMNTIIIGEASASTMWDNNAWTF FYDNSTDGEPPFLTQDFIHAFQPNARLIVMLRDPVERFWKIKNYKGPCEDYCSAVSLLKL PIDSWLYSDYLYFASSNKSADDFHEKVTEALQLFENCMLDYSLRACVYNNTLNNAMPVRL QVGLYAVYLLDWLSVFDKQQFLILRLEDHASNVKYTMHKVFQFLNLGRTFLQGRADTHGE QPTKGNWKGILEVSLRRCIIIKCQLGSPEGPLSEKQEALMTKSPASNARRPEDRNLGPMW PITQKILRDFYRPFNARLAQVLADEAFAWKTT >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_5|1899_bp atgaggcactgcattaattgctgcatacagctgttacccgacggcgcacacaagcagcag gtcaactgccaagggggcccccatcacggtcaccaggcgtgccccacgtgcaaaggagaa aacaaaattctgtttcgtgtggacagtaagcagatgaacttgcttgctgttctcgaagtg aggactgaagggaacgaaaactggggtgggtttttgcgcttcaaaaaggggaagcgatgt agcctcgtttttggactgataataatgaccttggtaatggcttcttacatcctttctggg gcccaccaagagcttctgatctcatcacctttccattacggaggcttccccagcaacccc agcttgatggacagcgaaaacccaagtgacacaaaggagcatcaccaccaatcctctgta aataatatttcatacatgaaggactatccaagcattaaattaattatcaacagcatcaca actaggattgagttcacgaccagacagctcccagacttagaagaccttaagaagcaggag ttgcatatgttttcagtcatccccaacaaattccttccaaacagtaagagcccctgttgg tacgaggagttctcggggcagaacaccaccgacccctacctcaccaactcctacgtgctc tactccaagcgcttccgctccaccttcgacgccctgcgcaaggccttctggggccacctg gcgcacgcgcacgggaagcacttccgcctgcgctgcctgccgcacttctacatcataggg cagcccaagtgcgggaccacagacctctatgaccgcctgcggctgcaccctgaggtcaag ttctccgccatcaaggagccacactggtggacccggaagcgctttggaatcgtccgccta agagatgggctgcgagaccgctatcccgtggaagattatctggacctctttgacctggcc gcacaccagatccatcaaggactgcaggccagctctgcaaaggagcagagcaagatgaat acaatcattatcggggaggccagtgcctccacgatgtgggataataatgcctggacgttc ttctacgacaacagcacggatggcgagccaccgtttctgacgcaggacttcatccacgcc tttcagccaaatgccagactgattgtcatgctcagggaccctgtggagagattttggaag attaagaattacaagggtccttgtgaggactactgtagtgcggtgtctttattgaagctg cctattgattcctggttgtactcagactatctctactttgcaagttcgaataaatccgcg gacgacttccatgagaaagtgacagaagcactgcagctgtttgaaaattgcatgcttgat tattcactgcgcgcctgcgtctacaacaacaccctcaacaacgccatgcctgtgaggctc caggttgggctctatgctgtgtaccttctggactggctcagcgtttttgacaagcaacag tttctcattcttcgcctggaagatcatgcatccaacgtcaagtacaccatgcacaaggtc ttccagtttctgaacctaggccgcacgttcctccaaggacgggctgacacccatggagag cagcccaccaagggcaattggaagggaatccttgaggtcagccttagacgctgtatcatc attaaatgccagctgggctcccccgaggggcccttaagtgagaagcaggaggctttgatg accaagagccccgcatccaatgcacggcgtcccgaggaccggaacctggggcccatgtgg cccatcacacagaagattctgcgggatttctacaggcccttcaacgctaggctggcgcag gtcctcgcggatgaggcgtttgcgtggaagacgacgtga >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_6|157_aa MQDSEAPGIPEDSTAVIHSKKHPEPLNNDGGYTQTQGKTQVGSSKSSACQYKPAAETMDR NGTGPMHRRQAPVRSPQLLFCPRTSKKPGLCAVGHAAAQARTTGHSAAQARTTGHSTAQA YTTGHTLTQPCLYDRTLCGPGLYDRTHCSLGPYDRTL >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_6|474_bp atgcaagacagtgaagctcctgggatccctgaggattctactgcagtcattcacagtaag aaacacccggaacccctcaacaatgatggtgggtacactcagacccagggcaaaacccag gtgggctccagcaaatcatcagcttgccagtacaaacctgcggctgagacaatggacagg aatggaactggccccatgcacaggaggcaggccccggtgcgcagcccccaactcctattt tgtccccgcacatcaaagaagcctgggctgtgtgcagttggacacgctgcggcccaggcc cgtacgaccggacactccgcggcccaggcccgtacaaccggacactccacggcccaggcc tatacgaccggacacactttgacccagccctgcctgtacgaccggacactctgcggccca ggcctgtacgaccggacacattgcagcctgggcccgtacgaccggacactctga >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_7|347_aa MGQAGDDHRRGSGQLTSMAQPSPLEPLGQKSFRVTGAKYLADKKREGTGELTAISMSELP GVTKGNTAAQTTFVEGQTKKGPRPQELVPASSLAASFEDFPTLGADLQPPAAGNPAPAFL LYTRRPSRAGRPRAQGGRERGHPDSDGEVKQLCPGPGRLRSLNTHTLARTPAHRLPGGVG KWPGVGCPREPPGQPEGKKAAPVADSTDSGLKMPSVKEKTHEILADSLRGRLEHQAQPLR KLRMRPGTSKGKAATVPESLQRHLDAKYAYVDCYAYVDCYAWTSSAKEPALHPGCWAHQH GDDPTETMEVAEDDTEDDRHKTEEAGSSDHHLEAGWLLVHLHGLTHE >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_7|1044_bp atgggtcaggctggagatgaccacaggcgaggaagtggccagctgacctccatggcccaa ccttctccgttagagcctttgggccagaagagcttccgtgtcacgggcgccaaatatttg gctgacaagaaacgtgaaggtacaggtgaacttaccgccatcagcatgagtgagctgcca ggggttacaaaagggaacacagctgcccagaccactttcgtggaagggcagacgaagaag ggcccccgcccccaggagctggtcccagcctccagcctggcagcctccttcgaagacttc cctacactgggcgcggacctccagccgccagcagccggcaacccagccccagccttcctc ctctatacccgccgtcccagccgcgccggccggccccgggcgcagggcgggcgggaacgt gggcacccggacagcgacggggaggtcaagcagctgtgcccaggacccgggcggctccga agcctcaacacgcacacactcgcgcgcacgccggcccatcggttgcctggaggagtcggg aagtggcctggagtcggctgccccagggaaccacctggacagcccgagggcaagaaagcg gcaccagtggccgactccactgactctgggctgaagatgccatcagttaaagaaaagaca catgaaattctggcagattcgctcagagggcgcttggagcaccaagcccagcctctcagg aaactgaggatgagaccagggacctccaagggcaaagcagccacggtccctgagtcactg caaagacacctggatgccaagtacgcatacgtggactgctatgcctacgtggactgctat gcctggacctcctctgccaaggagcctgctcttcaccctggctgctgggctcaccagcat ggagatgaccccactgagaccatggaggtagctgaagatgacactgaagatgacagacac aagacagaggaggcagggtcatctgatcatcacctggaagctggctggctgctggtgcat ttgcacggattgacacatgaatga >gi568815588r:123910152_124146212|GENSCAN_predicted_peptide_8|241_aa MCGDRAEGGGLSPTVGVGSLLLAVSKQDGQLVDRDAGAGPRDLGGGLDETHIPDSQVAAV LGFSLGESMERRSERGSNDSFISLSLDPGLWWPYLSISLVSVAGNKDWPSKKFRSLQFSE GTRGKLERSWKIWEKRITRRLNEVASAGGPGSVQPPRVCGVGKVAVPVATWSRGSGEGGR PSACLVMGATGPIERMEGGFGSLLVYKPETETHSRSDLVRLEACTPLVGPTAPVVILSMM S >gi568815588r:123910152_124146212|GENSCAN_predicted_CDS_8|726_bp atgtgtggggacagagctgagggcggtggcctgagccccactgtgggagtgggctccctg ttactggctgtatccaagcaggatgggcagctggtagacagggatgctggggcagggcct cgagacctgggtggaggcttggatgagacacacatccctgactcacaagttgctgctgtt ttaggtttctcactgggtgagtctatggaaaggagatcagaaagaggaagtaacgactcc ttcatctccctatccctcgaccccggcctctggtggccttacctgagcatatctctagtg tcagtggctgggaataaagactggccgagcaagaagttcaggtccctgcagttttctgaa ggcacaagaggcaagcttgagaggagctggaaaatatgggaaaagagaatcacaaggagg ctcaacgaggtggccagtgccggagggccaggatctgtgcagcccccaagagtgtgtgga gtggggaaggtggccgtcccagtggctacctggtcacgggggagtggggagggtggccgt cccagtgcctgcctggtcatgggggccacggggcccatcgagaggatggaaggaggcttt ggaagtctcctggtctacaagccagaaaccgagactcactcacgttctgatctggtgcgt ctggaagcctgcacccccttggttgggccaactgcccctgttgtcatcctctcgatgatg tcctga