GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:29:29 Sequence gi568815591f:17199265_17443061 : 243797 bp : 36.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 17069 17260 192 1 0 48 54 213 0.679 8.79 1.02 PlyA + 18212 18217 6 1.05 2.00 Prom + 18383 18422 40 -6.15 2.01 Sngl + 18476 19357 882 1 0 49 41 420 0.659 29.17 2.02 PlyA + 19451 19456 6 -0.45 3.00 Prom + 19675 19714 40 -11.84 3.01 Init + 19759 21437 1679 0 2 39 53 458 0.218 28.53 3.02 Intr + 27747 27811 65 0 2 62 92 38 0.038 -0.96 3.03 Intr + 57925 58074 150 2 0 36 72 141 0.227 6.51 3.04 Intr + 71854 72024 171 2 0 42 24 153 0.381 3.29 3.05 Term + 75068 75165 98 0 2 78 49 111 0.484 3.35 3.06 PlyA + 76738 76743 6 1.05 4.00 Prom + 82256 82295 40 -6.15 4.01 Init + 100001 100065 65 1 2 105 92 138 0.948 16.57 4.02 Intr + 110672 110859 188 2 2 96 95 154 0.999 15.31 4.03 Intr + 123237 123343 107 1 2 95 99 57 0.997 6.51 4.04 Intr + 128495 128584 90 1 0 101 111 1 0.884 2.97 4.05 Intr + 130688 130811 124 1 1 76 116 152 0.993 16.04 4.06 Intr + 131492 131627 136 0 1 69 82 21 0.512 -1.69 4.07 Intr + 133070 133205 136 2 1 53 29 115 0.562 1.75 4.08 Intr + 134648 134850 203 1 2 88 108 124 0.975 11.56 4.09 Intr + 135623 135732 110 2 2 70 100 54 0.999 3.81 4.10 Intr + 136381 136522 142 2 1 69 85 116 0.730 7.89 4.11 Intr + 139722 140964 1243 0 1 34 116 739 0.588 57.66 4.12 Term + 143657 143800 144 1 0 63 31 89 0.437 -2.37 4.13 PlyA + 143862 143867 6 1.05 5.03 PlyA - 144559 144554 6 1.05 5.02 Term - 147340 147312 29 2 2 58 38 15 0.161 -9.24 5.01 Init - 147967 147826 142 1 1 83 107 127 0.775 14.44 5.00 Prom - 150198 150159 40 -4.95 6.02 PlyA - 151330 151325 6 1.05 6.01 Sngl - 154312 154010 303 1 0 77 54 204 0.647 11.48 6.00 Prom - 173727 173688 40 -4.35 7.05 PlyA - 173861 173856 6 1.05 7.04 Term - 191647 191537 111 1 0 111 48 141 0.980 9.98 7.03 Intr - 195125 194921 205 1 1 43 39 149 0.327 3.78 7.02 Intr - 200279 200215 65 2 2 117 54 104 0.303 6.50 7.01 Init - 211784 211770 15 2 0 68 110 10 0.104 1.48 7.00 Prom - 212578 212539 40 -6.95 8.00 Prom + 216454 216493 40 -2.55 8.01 Sngl + 219612 220085 474 2 0 49 43 208 0.636 8.48 8.02 PlyA + 220530 220535 6 1.05 9.03 PlyA - 221893 221888 6 1.05 9.02 Term - 235236 235090 147 0 0 40 41 144 0.637 1.82 9.01 Init - 241408 241034 375 1 0 75 75 164 0.432 10.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_1|63_aa MENDFDKLREEGFRGSNSSELKEEVRTHRKEVKNLEKKLDEWLTRITNAEMSLKDLMELK TTA >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_1|192_bp atggagaatgactttgacaagttgagagaagaaggcttcagaggatcaaactcctccgag ctaaaggaggaagttcgaacccatcgcaaagaagttaaaaaccttgaaaaaaaattagac gaatggctaactagaataaccaatgcagagatgtccttaaaggacctgatggagctgaaa accaccgcatga >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_2|293_aa MGDFNTPLSTLDRSMRQKVNKDIQELNSALHQADLTDTYRTLYPKSTEYTFFSAPHHTYP KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKKLTQNCSTAWKLNNLLLNDCWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNIHKRKQERSKIDTLTSQLK ELEKQEQTHSKVSRRQEITKIRAELKEIETQNTLQKINEARSWFFETINKIDRPLARLIK KKREKNQIDEIRNDKGDITTNPTEIETTIREYYKHLYANKLENLEEMDKFLDT >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_2|882_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctaacagacacctacaga actctctaccccaaatcaacagaatatacattcttttcagcaccacaccacacctatccc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactgcatggaaactgaacaacctgctcctgaatgactgttgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatatccataaaagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagttagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaacacccttcaaaaaatcaatgaagcc aggagctggttttttgaaacgatcaacaaaattgatagaccactagcaagactaataaag aagaagagagagaagaatcaaatagatgaaataagaaatgacaaaggggatatcaccacc aatcccacagaaatagaaactaccatcagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatag >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_3|720_aa MQGWLNIFKSINVIHHINRTNDKNHTIISIDAEKAFDKIQHPFMLKTLNKLGIDGTYLKI TRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNVVLEVLARAIRQEKEIKGIQ LGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQASLYTNNRQT ESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGR INIVKMAILPKVIYRFNAIPIKLPMTSFTELEKTTLKFIWNQKRAHITKSILSQKNKAGG ITLPEFKLHYKAAVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYNHLIFDKPEKNKKWGNN SLFSRWCWEDWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGSTIQDIG MGKDFMSKTPKTMATKAKIDKWDLIKLKSFCTAEETTIRVNRQPTEWEKIFAIYSSDKGL ISRIYNELQQIYKKRTNNPIKKWAKHMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNDREERQTWFVVKETSSLQERQESGRKDDVGKEKSSSDSVFEC EVFVGLSRRDVQEALNLCMSWFRREIWAKDAGIGAQCGHGAGIWSMGICKQTADCCCKVW NNMPMHIPALAMALVSMMQELREKPLSQGSPSPSTMSGRNLRSVPDADASAMLLIQFVEL >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_3|2163_bp atgcaaggctggctcaacatattcaaatcaataaacgtaatccatcatataaacagaacc aacgacaaaaaccacacaattatctcaatagatgcagaaaaggcctttgacaaaattcaa catcccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaata acaagagctatctatgacaaacccacagccaatatcatactgaatggacaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaat gtagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatataaaatcaatgtgcaaaaatcacaagcatccttatacaccaataacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatcccc atcaagttaccaatgacttccttcacagaattggaaaaaactactttaaagttcatatgg aatcaaaaaagagcccacatcaccaagtcaatcctaagccaaaagaacaaagctggaggc atcacgctacctgagttcaaactacactacaaggctgcagtaaccaaaacagcatggtac tggtatcaaaacagagatatagaccaatggaacagaacagagccctcagaaataatgctg catatctacaaccatctgatctttgacaaacctgagaaaaacaagaaatggggaaacaat tccctgtttagtagatggtgctgggaagactggctagccatatgtagaaagctgaaactg gatcccttccttacaccttatacgaaaattaattcaagatggattaaagacttaaatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcagtaccattcaggacataggc atgggcaaggacttcatgagtaaaactccaaaaacaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcagaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctataatgaactccaacaaatttacaagaaaagaacaaacaaccccatc aaaaagtgggcaaaacatatgaacagacatttctcaaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacgacagg gaggaaaggcagacgtggtttgttgtgaaggagacaagcagcctgcaggagagacaggaa tcaggtagaaaagatgacgtgggaaaagagaagagtagttcagattcagtgtttgaatgt gaggtgtttgtgggactttcaaggagagatgtccaggaagcactgaacttgtgcatgtca tggtttaggagagaaatctgggctaaagatgcaggaataggtgctcagtgtggccatgga gctgggatctggagcatgggcatatgcaagcagactgcagactgttgttgcaaggtatgg aacaatatgcctatgcacatacctgctctagcaatggctctggtgtctatgatgcaggaa ctcagagagaagccattgagccagggctccccttcaccttccacaatgagtggacggaat ctgaggtctgtaccagatgcagatgccagtgccatgcttctaatacagtttgtagaacta taa >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_4|895_aa MNSSSANITYASRKRRKPVQKTVKPIPAEGIKSNPSKRHRDRLNTELDRLASLLPFPQDV INKLDKLSVLRLSVSYLRAKSFFDVALKSSPTERNGGQDNCRAANFREGLNLQEGEFLLQ ALNGFVLVVTTDALVFYASSTIQDYLGFQQSDVIHQSVYELIHTEDRAEFQRQLHWALNP SQCTESGQGIEEATGLPQTVVCYNPDQIPPENSPLMERCFICRLRCLLDNSSGFLVRIRL TVKQAQAGPSGNIPKEGIVITGDDSSMHVIAPADLPVGQDVEAMNFQGKLKYLHGQKKKG KDGSILPPQLALFAIATPLQPPSILEIRTKNFIFRTKHKLDFTPIGCDAKGRIVLGYTEA ELCTRGSGYQFIHAADMLYCAESHIRMIKTGESGMIVFRLLTKNNRWTWVQSNARLLYKN GRPDYIIVTQRPLTDEEGTEHLRKRNTKLPFMFTTGEAVLYEATNPFPAIMDPLPLRTKN GTSGKDSATTSTLSKDSLNPSSLLAAMMQQDESIYLYPASSTSSTAPFENNFFNESMNEC RNWQDNTAPMGNDTILKHEQIDQPQDVNSFAGGHPGLFQDSKNSDLYSIMKNLGIDFEDI RHMQNEKFFRNDFSGEVDFRDIDLTDEILTYVQDSLSKSPFIPSDYQQQQSLALNSSCMV QEHLHLEQQQQHHQKQVVVEPQQQLCQKMKHMQVNGMFENWNSNQFVPFNCPQQDPQQYN VFTDLHGISQEFPYKSEMDSMPYTQNFISCNQPVLPQHSKCTELDYPMGSFEPSPYPTTS SLEDFVTCLQLPENQKHGLNPQSAIITPQTCYAGAVSMYQCQPEPQHTHVGQMQYNPVLP GQQAFLNKFQNGVLNETYPAELNNINNTQTTTHLQPLHHPSEARPFPDLTSSGFL >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_4|2688_bp atgaacagcagcagcgccaacatcacctacgccagtcgcaagcggcggaagccggtgcag aaaacagtaaagccaatcccagctgaaggaatcaagtcaaatccttccaagcggcataga gaccgacttaatacagagttggaccgtttggctagcctgctgcctttcccacaagatgtt attaataagttggacaaactttcagttcttaggctcagcgtcagttacctgagagccaag agcttctttgatgttgcattaaaatcctcccctactgaaagaaacggaggccaggataac tgtagagcagcaaatttcagagaaggcctgaacttacaagaaggagaattcttattacag gctctgaatggctttgtattagttgtcactacagatgctttggtcttttatgcttcttct actatacaagattatctagggtttcagcagtctgatgtcatacatcagagtgtatatgaa cttatccataccgaagaccgagctgaatttcagcgtcagctacactgggcattaaatcct tctcagtgtacagagtctggacaaggaattgaagaagccactggtctcccccagacagta gtctgttataacccagaccagattcctccagaaaactctcctttaatggagaggtgcttc atatgtcgtctaaggtgtctgctggataattcatctggttttctggtaagaataaggtta actgtaaaacaggctcaggcaggtccttcaggaaatattccaaaagaaggcattgttatc acaggagatgacagctccatgcatgttattgcccccgcagaccttccagtgggacaagat gtagaggcaatgaatttccaagggaagttaaagtatcttcatggacagaaaaagaaaggg aaagatggatcaatacttccacctcagttggctttgtttgcgatagctactccacttcag ccaccatccatacttgaaatccggaccaaaaattttatctttagaaccaaacacaaacta gacttcacacctattggttgtgatgccaaaggaagaattgttttaggatatactgaagca gagctgtgcacgagaggctcaggttatcagtttattcatgcagctgatatgctttattgt gccgagtcccatatccgaatgattaagactggagaaagtggcatgatagttttccggctt cttacaaaaaacaaccgatggacttgggtccagtctaatgcacgcctgctttataaaaat ggaagaccagattatatcattgtaactcagagaccactaacagatgaggaaggaacagag catttacgaaaacgaaatacgaagttgccttttatgtttaccactggagaagctgtgttg tatgaggcaaccaacccttttcctgccataatggatcccttaccactaaggactaaaaat ggcactagtggaaaagactctgctaccacatccactctaagcaaggactctctcaatcct agttccctcctggctgccatgatgcaacaagatgagtctatttatctctatcctgcttca agtacttcaagtactgcaccttttgaaaacaactttttcaacgaatctatgaatgaatgc agaaattggcaagataatactgcaccgatgggaaatgatactatcctgaaacatgagcaa attgaccagcctcaggatgtgaactcatttgctggaggtcacccagggctctttcaagat agtaaaaacagtgacttgtacagcataatgaaaaacctaggcattgattttgaagacatc agacacatgcagaatgaaaaatttttcagaaatgatttttctggtgaggttgacttcaga gacattgacttaacggatgaaatcctgacgtatgtccaagattctttaagtaagtctccc ttcataccttcagattatcaacagcaacagtccttggctctgaactcaagctgtatggta caggaacacctacatctagaacagcaacagcaacatcaccaaaagcaagtagtagtggag ccacagcaacagctgtgtcagaagatgaagcacatgcaagttaatggcatgtttgaaaat tggaactctaaccaattcgtgcctttcaattgtccacagcaagacccacaacaatataat gtctttacagacttacatgggatcagtcaagagttcccctacaaatctgaaatggattct atgccttatacacagaactttatttcctgtaatcagcctgtattaccacaacattccaaa tgtacagagctggactaccctatggggagttttgaaccatccccataccccactacttct agtttagaagattttgtcacttgtttacaacttcctgaaaaccaaaagcatggattaaat ccacagtcagccataataactcctcagacatgttatgctggggccgtgtcgatgtatcag tgccagccagaacctcagcacacccacgtgggtcagatgcagtacaatccagtactgcca ggccaacaggcatttttaaacaagtttcagaatggagttttaaatgaaacatatccagct gaattaaataacataaataacactcagactaccacacatcttcagccacttcatcatccg tcagaagccagaccttttcctgatttgacatccagtggattcctgtaa >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_5|56_aa MEEDLFTGNPTEKESLSESRCRSCVHENERGRGLTLEFRKLSSSHTGGFSMNVQQL >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_5|171_bp atggaggaggacctattcacaggaaatcctacagagaaggagagcctctcagaaagcagg tgtcgaagctgtgtgcatgagaacgagagaggacgggggctaaccctggagttcaggaaa ctctcctcctcacatactggagggttttccatgaatgttcaacagttatga >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_6|100_aa MGRNQCKKAENSKNQNTSSTPKDHNSSPEREQNWMENEFDELTEVSFRRWVITNSSELKE HVLTQCKEAKNLEKRLDKLLTRITSLEKNINDLMELKNTA >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_6|303_bp atggggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaatacctcttctact ccaaaggatcacaactcctcgccagaaagggaacaaaactggatggagaatgagtttgat gaattgacagaagtaagcttcagaaggtgggtaataacaaactcctctgagctaaaggag catgttctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagacaaattgcta actagaataaccagtttagagaagaacataaatgacctgatggagctgaaaaacacagca tga >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_7|131_aa MISARPTQHEDEEHEDLYDDPLPFSECGAVRRGPLSFRPQNGGSMDSLSCAPGKATDTQC QPMKAARRETVPCKATGVELPKTMGTYFLHQHDLDLHFLVDKGGQEEAVNEGESYLIGTV NRALVPASHGR >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_7|396_bp atgatatctgctaggcctactcaacatgaagatgaagagcatgaagacctttatgatgat ccacttccatttagtgaatgtggagctgtgagaagagggccactatccttcagaccccag aatggtggatccatggacagcttgtcctgtgcacctggaaaagccacagacactcaatgc cagcccatgaaagcagccaggagggagactgtaccctgcaaagccacaggggtagagctg cccaagaccatgggaacctacttcttgcatcagcatgacctagatctgcattttctggtg gacaaaggaggacaggaagaagcagtaaatgaaggggagtcttatttgattggcacagtt aacagggcactggtacccgctagtcatggcagatga >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_8|157_aa MPRVRKLKYLLVWVDTFTGWVEAFPTGSEKSTAIISSLLSDIIPWFGLPTSIQSNNRPAF ISQITQAVSQALGIQWNRHNPYRSQSSGKVKWTIGLLKTHLTKLSLQLKKDWTILLPLAL LRIQACPQDATGYSPFELLYGRSILLGPSIIPDTSPT >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_8|474_bp atgccccgagtcaggaaactaaaatacctcttggtctgggtagacactttcactggatgg gtagaggcctttcccacagggtctgagaagtccactgcaatcatttcttcccttctgtca gacataattccttggtttggccttcccacgtctatacagtccaataacagaccagctttt attagtcaaatcacccaagcagtttctcaggctcttggtattcagtggaaccgtcataac ccttaccgttctcaatcttcaggaaaggtaaaatggactattggtcttttaaaaacacac ctcaccaagctcagcctccaacttaaaaaggactggacaatacttttaccacttgccctt ctcagaattcaggcctgtcctcaggatgctacagggtacagcccatttgagctcctgtat ggacgttccattttattaggccccagtatcattccagacaccagcccaacttga >gi568815591f:17199265_17443061|GENSCAN_predicted_peptide_9|173_aa MLVVIWTIKFRLKCSQMEMRNLLGTRAKVILPYPKRLEAFCPCPGDLWNFELERDDLGYL AQEISKWQSIQAETEHKSLENLQPDNAIEKKNPFSGEKFKPAAELCISNKELNVNHQDNG ENVSRNLGTLISKSKMDTLGLKNAAKTHKCQVLATEQAVQLEETRGIPKPFVL >gi568815591f:17199265_17443061|GENSCAN_predicted_CDS_9|522_bp atgctggtagtgatatggacaataaagttcaggctgaagtgctctcagatggaaatgagg aacttgttgggaactagagcaaaggtcattcttccctatccaaagagattggaggcattt tgcccctgccctggagatctgtggaactttgaacttgagagagatgatttagggtatctg gcacaagaaatttctaagtggcaaagcattcaagcggaaacagagcataaaagcttggaa aatttgcagcctgacaatgcgatagaaaagaaaaacccattttctggggagaaattcaag cctgctgcagaactttgcataagtaacaaggagctgaatgttaatcaccaagacaatggg gaaaatgtttccaggaatctgggaactcttatatccaagagcaagatggatacacttgga ttgaaaaatgcagccaagacccacaaatgccaagtccttgccacagaacaagctgtccag ctggaagaaactagaggaatccccaaaccatttgtcctctga