GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:32:59 Sequence gi568815594r:67372608_67644196 : 271589 bp : 37.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4863 4932 70 2 1 27 100 96 0.854 5.96 1.02 Term + 9255 9373 119 1 2 30 43 139 0.675 1.32 1.03 PlyA + 9753 9758 6 1.05 2.02 PlyA - 11449 11444 6 1.05 2.01 Sngl - 37362 37105 258 0 0 73 36 168 0.632 4.98 2.00 Prom - 38125 38086 40 -4.75 3.03 PlyA - 38364 38359 6 1.05 3.02 Term - 39336 38900 437 1 2 -30 41 365 0.775 14.46 3.01 Init - 40071 39969 103 0 1 83 65 84 0.636 6.05 3.00 Prom - 58331 58292 40 -4.95 4.00 Prom + 61126 61165 40 -7.45 4.01 Sngl + 67605 68123 519 2 0 10 41 243 0.592 7.59 4.02 PlyA + 68197 68202 6 1.05 5.00 Prom + 68960 68999 40 -3.65 5.01 Init + 73332 73564 233 2 2 75 27 268 0.731 17.18 5.02 Intr + 77136 77267 132 0 0 26 109 63 0.017 1.04 5.03 Intr + 93113 93374 262 2 1 92 80 91 0.461 5.27 5.04 Term + 93768 93959 192 2 0 70 39 110 0.407 0.74 5.05 PlyA + 94216 94221 6 1.05 6.16 PlyA - 96182 96177 6 1.05 6.15 Term - 100068 99998 71 1 2 96 48 39 0.047 -2.18 6.14 Intr - 102371 102281 91 2 1 94 110 -7 0.059 0.75 6.13 Intr - 104617 104543 75 1 0 55 106 34 0.056 0.69 6.12 Intr - 117514 117360 155 2 2 75 97 22 0.189 0.67 6.11 Intr - 120390 120262 129 1 0 85 10 105 0.411 2.15 6.10 Intr - 121381 121277 105 2 0 33 98 122 0.911 6.97 6.09 Intr - 132677 132598 80 1 2 64 97 63 0.821 3.08 6.08 Intr - 134327 134181 147 1 0 64 91 93 0.965 5.63 6.07 Intr - 136498 136207 292 2 1 68 63 120 0.896 2.77 6.06 Intr - 139962 139795 168 1 0 62 89 148 0.907 11.30 6.05 Intr - 142080 141467 614 2 2 39 106 387 0.560 26.70 6.04 Intr - 145761 145549 213 2 0 96 109 241 0.999 23.91 6.03 Intr - 146895 146610 286 1 1 65 87 220 0.506 15.08 6.02 Intr - 158307 158208 100 0 1 83 77 64 0.008 3.56 6.01 Init - 172748 172731 18 2 0 93 119 11 0.005 4.35 6.00 Prom - 173314 173275 40 -5.45 7.03 PlyA - 173700 173695 6 1.05 7.02 Term - 179772 179610 163 2 1 115 42 127 0.863 7.23 7.01 Init - 184228 184212 17 1 2 76 78 3 0.414 -2.02 7.00 Prom - 185057 185018 40 -5.45 8.00 Prom + 186914 186953 40 -6.95 8.01 Init + 188568 188678 111 2 0 84 105 40 0.847 5.56 8.02 Intr + 198477 198548 72 2 0 109 92 24 0.870 3.58 8.03 Intr + 202778 202891 114 1 0 106 105 129 0.906 16.12 8.04 Intr + 204596 204652 57 1 0 85 75 37 0.479 0.26 8.05 Term + 208698 208868 171 2 0 81 42 136 0.648 5.14 8.06 PlyA + 209133 209138 6 1.05 9.04 PlyA - 209380 209375 6 1.05 9.03 Term - 209533 209388 146 2 2 73 34 121 0.069 2.29 9.02 Intr - 211046 210959 88 2 1 33 78 35 0.047 -4.38 9.01 Init - 215884 215693 192 1 0 88 65 196 0.394 14.32 9.00 Prom - 220656 220617 40 -5.35 10.11 PlyA - 222276 222271 6 1.05 10.10 Term - 224660 224577 84 2 0 88 42 84 0.447 0.47 10.09 Intr - 246525 246423 103 2 1 53 63 144 0.279 7.66 10.08 Intr - 250318 250224 95 1 2 60 111 23 0.725 -0.36 10.07 Intr - 252580 252387 194 2 2 34 111 220 0.933 17.19 10.06 Intr - 253870 253753 118 1 1 37 78 100 0.698 3.02 10.05 Intr - 256535 256464 72 2 0 52 95 57 0.673 1.48 10.04 Intr - 260866 260738 129 1 0 89 115 -6 0.808 2.17 10.03 Intr - 261715 261635 81 1 0 106 98 2 0.784 1.92 10.02 Intr - 262951 262846 106 0 1 62 103 31 0.119 1.30 10.01 Intr - 266517 266336 182 0 2 54 91 83 0.086 2.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 34830 34501 330 0 0 81 44 120 0.831 2.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_1|62_aa MKAVFDEVTSRLNTAKERFNELEAPKRKRPSRMGHDDNGGFVEKKRGNVGKRQRDQIVTV SG >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_1|189_bp atgaaggctgtctttgatgaggtcaccagcagactgaacacagccaaagaaagattcaat gagcttgaagctccgaagagaaagcgaccatcgagaatgggccatgatgacaatggcggt tttgtcgaaaagaaaagggggaatgtggggaaaagacagagagatcagattgttactgtg tctgggtag >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_2|85_aa MLLNDYWVNNNIKGEINKFLETNEIKDTTYQSLWDTAKATFRAVFIALNAHKRKRERSKI DTLTSQLKELEKQEQTNSKLAEENK >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_2|258_bp atgctcctgaatgactactgggtaaataacaacattaagggagaaataaataagttcttg gaaaccaatgagatcaaagacacaacgtaccagagtctctgggacacagctaaagcaacg tttagagctgtgtttatagcactaaatgcccacaagagaaagcgggaaagatctaaaata gataccctaacttcacaattaaaagaactagagaagcaagaacaaacaaattcaaaacta gcagaagaaaataaataa >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_3|179_aa MEPNKLRSTGLKFSLPAQQSEVNLGHSSLLWMWAERNSIKINKKDDHVTTPFEGHQHQRP KADKSMKMRKNQCKKAENSKTQNASSPPKAGNSSPAREHNWMENKFDELTEVGMRRWVIT NSSELKEHVLTRCKEAKNLDKRLQELLTRIISLEKSINDLMELKNTAREFHEAHTSINS >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_3|540_bp atggagcccaacaagctaagatccactggcttgaaattctcgctgccagcacagcagtct gaagtcaacctgggacactcgagcttgttgtggatgtgggcggaaaggaatagcatcaag atcaacaaaaaggatgaccacgtgacaaccccattcgaaggccaccaacatcaaagacca aaggcagacaaatccatgaagatgaggaaaaaccagtgcaaaaaggctgaaaattccaaa acccagaatgcctcttctcctccaaaagctggcaactcctcaccagcaagggaacacaac tggatggagaataagtttgatgaattgacagaagttggtatgagaaggtgggtaataaca aactcctctgagctaaaggagcatgttctaacccgatgcaaggaagctaagaaccttgat aaaaggttacaggaactgctaactagaataatcagtttagagaaaagcataaatgacctg atggagctgaaaaacacagcacgagaatttcatgaagcacacacaagtatcaatagctga >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_4|172_aa MCKNNKHSYAPKTDKQQIMSKLPFTIASKRIKYLGIQFTRDVKDLLKENYKPLLNEIKED TNKWKNIPCSWIGRTSIVNMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAR IAKTILSKKIKAGGIMLPDFKLYYKATANKTAWYWYQNRDIDQWNTEQRPQK >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_4|519_bp atgtgcaaaaataacaagcattcttatgcaccaaaaaccgacaaacagcaaatcatgagt aaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaatttacaagg gatgtgaaggacctcctcaaggagaactacaaaccactgctcaatgaaataaaagaggac acaaacaaatggaagaacattccatgctcatggataggaagaaccagtattgtgaacatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc attgccaagacaatcctaagcaaaaagattaaagctggaggcatcatgctacctgacttc aaactatactacaaggctacagcaaacaaaacagcttggtactggtaccaaaacagagat atagaccaatggaacacagaacagaggcctcagaaataa >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_5|272_aa MDERFKCKTSNYKNPRRGEKRRRSTVQRSSCSPFPSLGERCLRSTRCRRRYRYRHRHRRR LLSLWRGGGGGARAQRHNNHTPLYLSKELKTYVLWKILHMDVYVSFIHNCQNLEATKVSF SSWPWEPLLENLRTAHITGFLADIPLHQPGVWQPHWVASPSRAFTVVWLSGTPTTEGKGS IPYQGSIPYDKRIQMAGLEFQNLHLWEAQTELGLLPWELHVGAPLDSLSGTRQGDCIPTG RVPSRFRLACGVESHSLSTWNTNVPADEEVPV >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_5|819_bp atggatgaaagatttaaatgtaagacctcaaactataagaatcctagaagaggtgaaaag cgacggcgcagcacggtgcagcgcagctcctgctcgcctttcccctcgctgggcgagagg tgtctacggagcacccgctgccgccgccgctaccgctaccgccaccgccaccgccgccgg ctgctgtctctatggcgaggaggaggaggaggagcgcgagctcagcgacacaacaatcat actcctttgtatttatcaaaggaattgaaaacttatgttctatggaaaatccttcatatg gatgtttatgtcagctttattcataattgccaaaacttggaagcaaccaaggtgtccttc agcagctggccctgggagccactgctggaaaatttgaggacagctcatatcactggattc cttgcagacattcccctgcaccagcctggagtgtggcagccccactgggtggctagcccc agcagagcattcacagtagtctggctctcagggactcccactactgaaggaaaggggagt ataccgtatcaagggagcatcccgtatgacaaaagaatccagatggcaggccttgagttc cagaaccttcatttgtgggaagcacagacagagctggggcttctcccatgggagcttcat gtgggtgcacctttagacagcctttctggaacacgtcagggtgactgcatccccacagga agagtgccctccaggttcaggcttgcatgtggggtagagtcacactctctctctacttgg aacaccaacgttcctgcagatgaagaggtacctgtctga >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_6|847_aa MAASGLCQKSHPKSVPVSSKKKEASLQFVVEPSEATNRSVQAHEVHQKILATDVSSKNTP DSKKISSRNINDHHSEADEEFYLSVGSPSVLLDAKTSVSQNVIPSSAQKRETYTFENSVN MLPSSTEVSVKTKKRLNFDDKVMLKKIEIDNKVSDEEDKTSEGQERKPSGSSQNRIRDSE YEIQRQAKKSFSTLFLETVKRKSESSPIVRHAATAPPHSCPPDDTKLIEDEFIIDESDQS FASRSWITIPRKAGSLKQRTISPAESTALLQGRKSREKHHNILPKTLANDKHSHKPHPVE TSQPSDKTVLDTSYALIGETVNNYRSTKYEMYSKNAEKPSRSKRTIKQKQRRKFMAKPAE EQLDVGQSKDENIHTSHITQDEFQRNSDRNMEEHEEMGNDCVSKKQMPPVGSKKSSTRKD KEESKKKRFSSESKNKLVPEEVTSTVTKSRRISRRPSDWWVVKSEESPVYSNSSVRNELP MHHNSSRKSTKKTNQSSKNIRKKTIPLKRQKTATKGNQRVQKFLNAEGSGGIVGHDEISR CSLSEPLESDEADLAKKKNLDCSRSTRSSKNEDNIMTAQNVPLKPQTSGYTCNIPTESNL DSGEHKTSVLEESGPSRLNNNYLMSGKNDVDDEEVHGSSVLPSNTPNVRRTKRTRLKPLE YWRGERIDYQGRPSGGFVISGVLSPDTISSKRKAKENIGKVNKKSNKKRICLDNDERNLV RPQDTYQFFVKHGELKVYKTLDTPFFSTGKLILGPQEEKGKQHVGQDILVAGHQPGDGTF KRASADTRLPQVAWVFYVNFGDLLCTLHETPYILSTGDSFYVPSGNYYNIKNLRNEESVL LFTQIKR >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_6|2544_bp atggctgcgtccggtctgtgccagaaatcacatccaaagtcagttccagtttcttcaaag aagaaagaagcctctctacagtttgttgtagaaccaagtgaagccacaaacagatcagtt caggcccatgaagttcatcagaaaattctggcaactgatgttagttccaaaaatacacct gactcgaaaaaaatatcaagtagaaacataaatgatcatcacagtgaagctgatgaagaa ttttacttatccgttggctcaccttctgttcttttggatgcaaaaacatctgtatcacaa aatgttattccatctagtgcccaaaagagagagacttacacttttgaaaattcagtaaat atgctgccttcaagtacagaggtttcagttaaaaccaaaaaaaggttaaactttgatgat aaagttatgttaaagaaaatagaaatagataataaagtatcagatgaagaggataaaaca tcggaaggacaagaaagaaaaccatcaggatcatctcagaatagaatacgagattcagaa tatgaaattcaacgacaagctaaaaaaagtttttcaacattgtttttagaaacagtaaaa cgaaaaagtgaatccagtcccattgttaggcatgcggcaactgctccacctcattcgtgt cctcccgatgatacgaagttgatagaggatgaatttataattgatgagtcggatcaaagt tttgccagtagatcttggattacaataccaagaaaggcagggtctctgaaacaacgcaca atatccccggctgagagcactgcactccttcaaggtagaaagtcaagagaaaagcatcat aatatattacctaagactttggcaaatgacaaacattcccataaacctcacccagtagag acatctcagccctctgataaaacagtactggatacaagttatgctttgataggtgaaaca gtaaataattatagatctacaaaatatgaaatgtattccaagaatgcagaaaaaccatct agaagcaaaaggactataaaacaaaaacagagaagaaaattcatggctaaaccagctgaa gaacagcttgatgtgggacagtctaaagatgaaaacatacatacatcacatattacccaa gacgaatttcaaagaaattcagacagaaatatggaagagcatgaagagatgggaaatgat tgtgtttccaaaaaacagatgccacctgtgggaagcaagaaaagtagcactagaaaagat aaggaagaatctaaaaagaagcgcttttccagtgagtccaagaacaaacttgtacctgaa gaagtgacttcaactgtcacgaaaagtcgaagaatttccaggcgtccatctgattggtgg gtggtaaaatcagaggagagtcctgtttatagcaattcttcagtaagaaatgaattacca atgcatcacaatagtagccgaaaatctactaagaaaacaaatcagtcatctaagaatatt aggaaaaaaactattccacttaaaaggcagaagacagcaactaaaggcaaccaaagagta cagaagtttttaaatgctgaaggttctggaggtatcgttggtcatgatgaaatttccaga tgttcgctgagtgagccattggaaagtgatgaggcagacttggctaagaagaaaaatctt gattgttctagatctacaagaagctcaaagaatgaagataacattatgactgcacagaat gttcccctaaagcctcagaccagtggatatacatgtaatataccaacagagtcaaacttg gattctggagagcataagacttcagttttagaggaaagtggaccttccaggctcaataat aattatttaatgtctggaaagaatgatgtggatgatgaggaagttcatggaagttcagta ttgccctccaacacaccaaatgttcgcaggaccaagagaacacgtttgaaacctttggag tactggcgaggagagcgaatagattatcaaggaaggccatcaggaggattcgtgattagt ggagtactatctccagacacaatatcgtctaaaaggaaggcaaaagaaaatattggaaaa gtcaacaaaaaatctaataagaaaaggatctgtcttgataacgatgaaagaaatcttgta aggccacaagatacatatcaattttttgttaagcatggtgagttgaaggtatacaagaca ttggatacaccctttttttctactgggaaattgatattaggaccacaagaagaaaaggga aagcagcatgttggccaggatatattggtggctggccaccagccaggagatggtactttc aagagagcatctgctgatacaaggttgccacaggttgcctgggttttttatgttaacttt ggtgaccttttgtgtactttacatgaaacaccttatatattaagtactggggattcgttc tatgttccttcaggtaactattataacatcaaaaatctccggaatgaggaaagtgttctt ctttttactcagataaaaagatga >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_7|59_aa MTALLLCTKIGKLKADSGEYACSAERCMGTDTQVFLPDKHSKETQKQYKPLINSPTLNP >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_7|180_bp atgactgctttgcttttgtgcactaagatagggaagctaaaagcagactcgggggagtat gcctgcagcgcagaaagatgtatgggaacagacacacaagtcttcctcccagataagcac agcaaagagacacagaagcagtacaagcctctgataaactctcccaccctgaatccttaa >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_8|174_aa MVIIRVLENRSYEKNLSKISGISKQLAAKCQGIFEQKEYEHYWTELRGTTLFFYTDKKSI IYVDKLDIVDLTCLTEQNSTEKNCAKFTLVLPKEEVQLKTENTESGEEWRGFILTVTELS VPQNVSLLPGQVIKLHEVLEREKKRRIETEQSTSVEKEKEPTEDYVDVLNPMPA >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_8|525_bp atggtgataattagagtacttgaaaatagaagctatgagaaaaatctaagcaaaataagt ggaatttccaagcaattggcagcaaagtgccagggaatctttgaacagaaggagtatgag cattactggacagagttgagaggaactactcttttcttttataccgacaaaaagagtata atatatgttgacaaattagacatagtagacctcacatgccttactgagcagaattcaact gaaaagaactgtgcgaaattcacccttgttttgccgaaagaggaagtacaactgaagaca gagaacacagaaagtggggaagaatggagaggcttcattcttacagtaacagagctgtca gttccccaaaacgtgtcactcctacctgggcaagtaattaaactgcatgaagtcctagag agagaaaagaaaaggaggattgagacagagcagagtacgtccgtggaaaaagagaaggaa ccaactgaagattatgtggatgtactgaaccctatgccagcgtaa >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_9|141_aa MAVSRARTTVLQPVRQSEAPSQKKQKQKTTTTTKKRHHIIIIIIIIVVVVYGKAVGGKVA KVNKASGSYFPKKGSSGASQLPLSGTLYKNIYRGKNDGWRVSVIVVQIEYATEPDVKQGT RTSFIDNTTKIPGGKSSESFQ >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_9|426_bp atggcagtgagccgagctcgcaccactgtactccagcctgtgagacagagcgaggccccg tctcaaaaaaaacaaaaacaaaaaacaacaacaacaacaaaaaaaagacatcatatcatc atcatcatcatcatcatcgtcgtcgtcgtctatggcaaagctgtgggaggaaaagtggca aaggtcaacaaggcctcaggatcatatttcccaaagaagggttcttctggagcatctcag ttgcctctttccgggacactgtataaaaacatctacagaggtaaaaatgatggttggcgg gtttcagtgattgtagtacagatagagtatgcaacagaacctgatgtcaaacagggtacc aggacaagttttattgacaacaccaccaagattccaggaggaaagagctcagaatctttt caataa >gi568815594r:67372608_67644196|GENSCAN_predicted_peptide_10|387_aa KPKSYTAADATLKINSQIKIDAHLNKVCPTTETIYNDEFYTKQDVIITALDNVEARRYVD SRCLANLRPLLDSGTMGTKGHTEVIVPHLTESYNSHFESSFSHKPSLFNKFWQTYSSAEE VLQKIQSGHSLEGCFQVIKLLSRRPRNWSQCVELARLKFEKYFNHKDLSADALLNILSEV KIQEFKPSNKVVQTDETARKPDHVPISSEDERNAIFQLEKAILSNEATKSDLQMAVLSFE KDDDHNGHIDFITAASNLRAKMYSIEPADRFKTKRIAGKIIPAIATTTATVSGLEKYGIE PTMVVQGVKMLYVPVMPGHAKRLKLTMHKLVKPTTEKKYVDLTVSFAPDIDGDEDLPGPP AQHLMEAIKAWGFYLLKHQPELYLNPF >gi568815594r:67372608_67644196|GENSCAN_predicted_CDS_10|1164_bp aaacctaaaagctacactgctgctgatgctactctgaaaataaattctcaaataaagata gatgcacacctgaacaaagtatgtccaaccactgagaccatttacaatgatgagttctat actaaacaagatgtaattattacagcattagataatgtggaagccaggagatacgtagac agtcgttgcttagcaaatctaaggcctcttttagattctggaacaatgggcactaaggga cacactgaagttattgtaccgcatttgactgagtcttacaatagtcattttgaaagttcc ttttcccacaaaccttcattgtttaacaaattttggcaaacctattcatctgcagaagaa gtcttacagaagatacagagtggacacagtttagaaggctgttttcaagttataaagtta cttagcagaagacctagaaattggtcccagtgtgtagaattagcaagattaaagtttgaa aaatattttaaccataaggacttatcagcagatgccctcttgaatattctttcagaagta aagattcaggaattcaagccttccaataaggttgttcaaacagatgaaactgcaaggaaa ccagaccatgttcctattagcagtgaagatgagaggaatgcaattttccaactagaaaag gctattttatctaatgaagccaccaaaagtgaccttcagatggcagtgctttcatttgaa aaagatgatgatcataatggacacatagatttcatcacagctgcatcaaatcttcgtgcc aaaatgtacagcattgaaccagctgaccgtttcaaaacaaagcgcatagctggtaaaatt atacctgctatagcaacaaccactgctacagtttctggcttggagaagtatggaattgag ccaacaatggtggtacagggagtcaaaatgctttatgttcctgtaatgcctggtcatgca aaaagattgaagttaacaatgcataaacttgtaaaacctactactgaaaagaaatatgtg gatcttactgtgtcatttgctccagacattgatggagatgaagatttgccgggacctcca gctcaacacctcatggaagccatcaaggcttggggcttctaccttctgaagcaccaacct gagctgtacctcaaccccttttag