GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:31:31 Sequence gi568815597f:84379282_84597467 : 218186 bp : 38.31% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 1604 1599 6 1.05 1.09 Term - 24237 24169 69 0 0 92 42 127 0.620 5.46 1.08 Intr - 25157 24883 275 0 2 63 26 285 0.411 16.13 1.07 Intr - 27327 27285 43 0 1 105 116 -2 0.418 1.19 1.06 Intr - 32774 32698 77 0 2 87 91 88 0.908 7.32 1.05 Intr - 33318 33201 118 0 1 30 65 82 0.867 -0.78 1.04 Intr - 34706 34525 182 0 2 51 73 178 0.900 11.27 1.03 Intr - 37864 37813 52 1 1 69 127 42 0.107 3.76 1.02 Intr - 48611 48447 165 2 0 89 85 106 0.016 9.64 1.01 Init - 54219 54097 123 0 0 96 -39 116 0.021 -0.08 1.00 Prom - 61757 61718 40 -3.65 2.00 Prom + 63062 63101 40 -4.05 2.01 Sngl + 64829 65164 336 1 0 52 41 261 0.966 13.58 2.02 PlyA + 65215 65220 6 1.05 3.00 Prom + 69483 69522 40 -4.35 3.01 Init + 76389 76462 74 2 2 87 92 21 0.509 2.99 3.02 Intr + 77930 77995 66 2 0 88 56 85 0.345 2.40 3.03 Intr + 84046 84216 171 1 0 107 -6 106 0.511 1.24 3.04 Term + 84421 84688 268 1 1 18 49 226 0.386 5.68 3.05 PlyA + 85737 85742 6 1.05 4.00 Prom + 92462 92501 40 -5.35 4.01 Init + 100001 100432 432 1 0 95 39 416 0.567 33.66 4.02 Intr + 101675 101731 57 1 0 93 94 53 0.858 4.56 4.03 Intr + 103634 103714 81 1 0 90 80 147 0.971 13.02 4.04 Intr + 110352 110447 96 2 0 133 49 84 0.992 8.29 4.05 Intr + 111038 111191 154 1 1 94 7 134 0.636 4.62 4.06 Intr + 116092 116174 83 2 2 58 115 59 0.988 4.04 4.07 Intr + 116601 116782 182 2 2 90 94 40 0.987 2.54 4.08 Intr + 116963 117089 127 2 1 80 95 77 0.950 7.36 4.09 Intr + 126757 126844 88 0 1 22 72 98 0.014 0.22 4.10 Intr + 127119 127137 19 1 1 107 94 0 0.044 -2.75 4.11 Intr + 143109 143226 118 0 1 27 103 111 0.864 6.05 4.12 Term + 146564 146845 282 1 0 87 43 159 0.926 5.74 4.13 PlyA + 146993 146998 6 1.05 5.00 Prom + 153500 153539 40 -2.05 5.01 Init + 164825 165023 199 1 1 41 106 86 0.891 4.81 5.02 Intr + 166353 166478 126 1 0 53 58 72 0.563 0.43 5.03 Intr + 169505 169683 179 0 2 101 62 154 0.968 12.82 5.04 Intr + 171151 171249 99 0 0 72 86 43 0.750 1.89 5.05 Intr + 176629 176698 70 0 1 72 90 67 0.030 3.14 5.06 Term + 182387 182490 104 0 2 53 44 102 0.010 -0.24 5.07 PlyA + 182525 182530 6 1.05 6.13 PlyA - 182659 182654 6 1.05 6.12 Term - 183115 183101 15 1 0 99 43 1 0.352 -6.04 6.11 Intr - 184137 183964 174 0 0 93 88 116 0.643 11.31 6.10 Intr - 184551 184454 98 1 2 59 75 79 0.999 2.51 6.09 Intr - 186731 186560 172 2 1 61 99 113 0.694 8.29 6.08 Intr - 190858 190650 209 2 2 65 98 121 0.999 8.67 6.07 Intr - 191439 191301 139 0 1 94 95 134 0.999 13.82 6.06 Intr - 195172 194958 215 2 2 41 60 341 0.091 24.11 6.05 Intr - 195473 195407 67 2 1 54 66 80 0.051 -0.04 6.04 Intr - 203077 202997 81 1 0 83 74 81 0.120 5.12 6.03 Intr - 204599 204446 154 1 1 9 86 143 0.553 5.35 6.02 Intr - 210116 210080 37 0 1 32 115 38 0.053 -2.70 6.01 Intr - 217254 217150 105 1 0 82 103 100 0.121 10.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 44755 44999 245 0 2 51 80 225 0.848 15.25 S.002 Term + 48437 48671 235 2 1 43 49 142 0.885 0.61 S.003 Term + 176629 176736 108 0 0 72 42 106 0.864 1.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:84379282_84597467|GENSCAN_predicted_peptide_1|367_aa MVSFSDSAGAKPIKRGWQHRQQGASGNRALIWLLLCDLGQNSPDGKMVGLACLRSYWAEE WTEVGCKSDFLDIGKQLACSMYQSLLGHQTLLHLTEVLQQGVTITKIYQHPYHGVTHGSG KEFGEDTIHVSVIKDLELTEFDGLIEGSEKEEMGSDWVTMQNTRKSRFVGDDDASVQQQN YTACCLMMVFHTVKKRIRLCKMEEFLSLGRLKCDELTENLHHTPKIMKFVIDEIDIRTQN QSLKTLHVAGFTKGKAMKAIKPKTIHSCWKNLCSDVEHDFTGFMTESIKKVMKEIVDMAR KVGGDGFQDMIPGEIQKLIDTTLEKLTEHNSMEPEPDDEQEDIEELTQQEDDKDEDTYDD PFPLNEE >gi568815597f:84379282_84597467|GENSCAN_predicted_CDS_1|1104_bp atggtgtccttctcagactcagctggagcaaagcctatcaagcgagggtggcagcatcgt caacagggagcatcgggcaaccgggctctcatctggcttctgctctgtgatctcggacag aacagccctgatgggaagatggttggactggcctgcctccgttcctactgggccgaggaa tggactgaggttggctgcaagtctgactttttggatatagggaaacaactggcttgcagc atgtaccagtctctccttggtcaccagacactgctccatctcacagaggttctacaacaa ggagttaccatcaccaaaatataccaacacccctaccacggtgtgacccatggcagtggc aaagagtttggagaagatacaatacatgtgagtgttattaaggatctggaactgactgaa tttgatggtttgatagaagggtcagaaaaagaagagatggggagtgactgggtaactatg cagaatacaagaaaaagtagatttgtaggggatgatgacgcatcagttcaacagcagaat tatacagcatgttgtttgatgatggtctttcataccgtcaagaaaagaatccgactttgc aaaatggaggaatttttgtccctgggccgactgaagtgtgatgaattaactgagaattta catcacacacccaagattatgaagtttgtaattgatgaaattgatattcgaacccagaac caatcactcaagacattgcatgtggcaggttttactaaaggaaaagccatgaaagccatc aagcccaaaacaatacattcctgctggaaaaatctgtgttcagatgttgagcatgacttc acaggatttatgacagagtcaatcaagaaagtcatgaaagagattgtggatatggcaaga aaggttgggggtgatggatttcaagatatgattcctggagaaattcaaaagctaatagac accacactagagaaattaacagaacataactcaatggaaccagagccagatgatgagcaa gaagacatagaagagctgactcaacaggaagatgacaaggatgaagacacttatgatgat ccatttccacttaatgaagagtga >gi568815597f:84379282_84597467|GENSCAN_predicted_peptide_2|111_aa MRKNQYKKAENSKNQNTSSPKDHNSSSAREQTWTESEFDEFTEVGFRRWVITNSSELKEH VLTQCKEVKTLEKRLEELLTRITSLEKNINDLMELKNTARELSEAYTSINS >gi568815597f:84379282_84597467|GENSCAN_predicted_CDS_2|336_bp atgaggaaaaaccagtacaaaaaggctgaaaattccaaaaaccagaacacctcttctcca aaggatcacaactcctcatcagcaagggaacaaacctggacagagagtgagtttgacgaa ttcacagaagtaggcttcagaaggtgggtaataacaaactcctccgagctaaaggagcat gttctaacccaatgcaaggaagttaagacccttgaaaaaaggttagaggaattgctaact agaataaccagtttagagaagaacataaatgacctgatggagctgaaaaacacagcacga gaacttagtgaagcatacacaagtatcaacagctga >gi568815597f:84379282_84597467|GENSCAN_predicted_peptide_3|192_aa MNKNTKWSYRVQNISYLHHEDSGKRLVAKGEEKESEEGLQVLRAKQIIAIKAPKLKTIYI KEPPSFKENQWATERQQNWRSSSKHQSPILSSSLLRYLHIMCCTRKTSIGEQKEVAPVEM LLGLEQDAWIFQKTLKKKAKLEVNCWYLPKGAVHQVMGPQHQEAREHIAKEGVRRGGLSR IRELCRGPCMAS >gi568815597f:84379282_84597467|GENSCAN_predicted_CDS_3|579_bp atgaacaagaatacaaagtggtcctatagggtccagaatataagttacctccatcatgag gatagtggtaagaggttggtggctaagggagaagagaaggagagtgaagaaggtctacag gttctgagagcaaagcaaataattgccatcaaagctccaaagctgaagacaatatacatc aaggaacccccatcatttaaagagaaccagtgggctactgagaggcaacagaactggcgt tcctcctccaagcatcagtctcccatcctttcctcctccctactgcgttatctacatatt atgtgttgtacgagaaaaacaagtattggagaacaaaaagaagtggcaccagttgagatg ctacttggattggagcaagatgcttggattttccagaagacactgaagaagaaagccaag cttgaagttaattgctggtacctgcccaagggggctgtgcaccaagtgatgggaccccaa caccaagaggctagggagcacattgctaaggaaggtgtcagaagaggtggcctgtcccgc ataagggaactgtgcagaggcccttgtatggcttcctga >gi568815597f:84379282_84597467|GENSCAN_predicted_peptide_4|572_aa MAKAGDKSSSSGKKSLKRKAAAEELQEAAGAGDGATENGVQPPKAAAFPPGFSISEIKNK QRRHLMFTRWKQQQRKVRERRGLPGACALFLTLRAVAGRTSVVVCWSQKCSLWLRGKGGV AAHVFWSPRPSLETLGSAFCLSVKEKLAAKKKLKKEREALGDKAPPKPVPKTIDNQRVYD ETTVDPNDEEVAYDEATDEFASYFNKQTSPKILITTSDRPHGRTVRLCEQLSTVIPNSHV YYRRGLALKKIIPQCIARDFTDLIVINEDRKTPNGLILSHLPNGPTAHFKMSSVRLRKEI KRRGKDPTEHIPEIILNNFTTRLGHSIGRMFASLFPHNPQFIGRQVATFHNQRDYIFFRF HRYIFRSEKKVGIQELGPRFTLKLRSLQKGTFDSKYGEYEWVHKPELLNHFLHSGDAGGA RHGARDGRADSWVEGPIRERVSPQLTLRALRERLGEFLGEDAIAEKFLFLKCIGNNLAVA LQPELYLLPVMDHLGNVYSPSTVILDERQTNNGVNEADGTIHRPISVTLFKEELGRDPSL LENTLKELPNKNQEEGDFKLEWEKERGYYSTF >gi568815597f:84379282_84597467|GENSCAN_predicted_CDS_4|1719_bp atggcgaaagccggggataagagcagcagcagcgggaagaaaagtctaaaacggaaagcc gctgccgaagaacttcaggaggctgcaggcgctggggatggggcgacggaaaacggggtc caacccccgaaagcggctgcctttccgccaggctttagcatttcggagattaaaaacaaa cagcggcgacacttaatgttcacgcggtggaaacagcagcagcggaaggtacgcgagagg cgggggctgccgggcgcttgcgcgttgttcctgacgcttagggcggtcgcggggcgcaca tctgtggttgtctgctggtctcaaaagtgttctctgtggctccgtgggaagggtggcgtc gcggcccacgtgttctggtcccctcgccctagtttggagactctaggttcagccttttgc ctgagcgtcaaggaaaagttggcagctaagaaaaaacttaaaaaagaaagagaggctctt ggcgataaggctccaccaaagcctgtacccaagaccattgacaaccagcgagtgtatgat gaaaccacagtagaccctaatgatgaagaggtcgcttatgatgaagctacagatgaattt gcttcttacttcaacaaacagacttctcccaagattctcatcacaacatcagatagacct catgggagaacagtacgactctgtgaacagctctccacagttataccaaactcacatgtt tattacagaagaggactggctctgaaaaaaattattccacagtgcatcgcaagagatttc acagacctgattgttattaatgaagatcgtaaaaccccaaatggacttattttgagtcac ttgccaaatggcccaactgctcattttaaaatgagcagtgttcgtcttcgtaaagaaatt aagagaagaggcaaggaccccacagaacacatacctgaaataattctgaataattttaca acacggctgggtcattcaattggacgtatgtttgcatctctctttcctcataatcctcaa tttatcggaaggcaggttgccacattccacaatcaacgggattacatattcttcagattt cacagatacatattcaggagtgaaaagaaagtgggaattcaggaacttggaccacgtttt accttaaaattaaggtctcttcagaaaggaacctttgattctaaatatggagagtatgaa tgggtccataagccggagctgttgaaccactttcttcatagcggcgacgctggaggagcc agacatggtgcacgcgacggccgggccgattcgtgggtcgaggggccgattagggagaga gtatctcctcaacttactttacgagccctgagggagcgtcttggtgagttcctgggtgaa gatgctattgcagaaaaatttttatttctgaaatgcattggaaataatttagctgtggct cttcaaccagaattatatttgcttcctgtaatggaccatttaggaaatgtttattcacca tcaacagttattttagatgagcggcagactaataatggtgttaatgaggctgatggaaca atccacagaccaattagtgtaactttgttcaaggaggaacttggaagagatcccagtttg ttagaaaacactttgaaagagcttcctaacaagaatcaggaagaaggtgattttaaactg gaatgggaaaaagaaagaggctattatagtacattttag >gi568815597f:84379282_84597467|GENSCAN_predicted_peptide_5|258_aa MHFWFALSNTYGQVKKECDSVADLTHFHVCLQTAEKEYITLPDHPSLPCQPVLSSGITDI SLLQTEREKIIKQMKQVKEERRYLERNREELVKTVEKLFEQSKLKRYHAYNGWKKKYLET KKVTASMEEVLTKLREDLELYYKKLLMQLEAREIKMRPKNLANITDSKNYLIIQITEVQH AIDQLKRKLDTDKMKLIVEVKLLEVCLARRSTEDFVYDESAKSKEIAIATPTFSNHHPDQ SAAINDKARHSSSKQVIT >gi568815597f:84379282_84597467|GENSCAN_predicted_CDS_5|777_bp atgcatttctggtttgcattatcaaatacctacgggcaagtgaaaaaagaatgtgacagt gtagccgatcttactcattttcacgtttgcttacagacagctgaaaaagagtacatcacc ctaccagatcacccttcacttccttgtcaacctgttctttcttcaggaataactgatata tctttattacaaactgaaagagagaagattatcaaacaaatgaaacaagtaaaggaagaa agaaggtatctggaaagaaatagagaagaactagtaaaaacggttgaaaagctatttgaa caaagcaaattaaaacgatatcatgcctacaatggttggaagaaaaaatacttggaaaca aagaaagtcacagcatcaatggaggaggttttaacaaaacttcgagaagatttggaactc tactataaaaaactgctcatgcaacttgaagccagggagatcaagatgagaccaaagaat ctggcaaacatcacagactccaagaattacctaataatccagatcactgaggtacagcat gcaattgaccagcttaagagaaaactagatactgacaaaatgaaactcatagtagaagtt aagttgctagaagtgtgcctggctaggaggtctacagaagattttgtttatgatgaatca gccaagtcaaaagaaattgccatagccaccccaaccttcagcaaccaccaccctgatcag tcagcagccatcaatgacaaggcaagacactcttccagcaagcaggttataacttga >gi568815597f:84379282_84597467|GENSCAN_predicted_peptide_6|488_aa XFCHVRMWHKGAIWEQKLDPHQTPDLPEFKSIRGHGLFDKVQLFILAREGPAARKYRLVH IRPDHREMVKVRQRNSDFTQEFLLVLKQENEKWPQGIRKDPLVLMNGVIHKPEDLVETSV LTHHIPTPQGVHDQPRQNEMQTIGRTINGGRNPLRPARPAAMSRPQLRRWRLVSSPPSGV PGLALLALLALLALRLAAGTDCPCPEPELCRPIRHHPDFEVFVFDVGQKTWKSYDWSQIT TVATFGKYDSELMCYAHSKGARVVLKGDVSLKDIIDPAFRASWIAQKLNLAKTQYMDGIN IDIEQEVNCLSPEYDALTALVKETTDSFHREIEGSQVTFDVAWSPKNIDRRCYNYTGIAD ACDFLFVMSYDEQSQIWSECIAAANAPYNQTLTGYNDYIKMSINPKKLVMGVPWYGYDYT CLNLSEDHVCTIAKVPFRGAPCSDAAGRQVPYKTIMKQINSSISGNLWDKDQRAPYYNYK VRLFRALV >gi568815597f:84379282_84597467|GENSCAN_predicted_CDS_6|1467_bp nccttctgccatgtgaggatgtggcacaaaggtgccatttgggagcagaaactggaccct caccagacaccagacctgccagagtttaagagcattagaggccacggtttatttgataaa gtccagttgttcatccttgcaagagaagggcctgcagcgcggaaatacagacttgtccac attaggccagatcacagagaaatggtaaaagtccggcagaggaattcggacttcactcag gagttcctgctggttctgaagcaggaaaatgagaagtggcctcagggaattagaaaggat ccactggtgttgatgaatggagtaatccacaagccggaggatctggtggaaacatcggtg ctcacacatcacattccgactccacaaggagtacacgaccagccacgacaaaacgagatg cagactattgggaggactataaacggcggtaggaacccactccggcccgctagacctgct gctatgtcccggccgcagcttcgacgctggcgcctcgtctctagcccgccgagcggcgtc ccgggtctagcgctgctggcgctgctggcgctgctggcgctgcggctcgcggccgggacc gactgcccatgcccggagcctgagctctgccgcccgattcgccaccatccagatttcgag gtctttgtgtttgatgttggacagaaaacttggaaatcttatgattggtcacagattaca actgtggcaacatttggaaaatatgactcagaacttatgtgctacgctcattcaaaagga gccagagtagtacttaaaggagatgtatccttaaaggatatcattgatcctgctttcaga gcatcctggatagctcaaaaacttaatttggccaaaacacaatatatggatggaattaat atagatatagagcaagaagttaattgtttatcacctgaatatgatgcattaactgcttta gtcaaagaaactacagactctttccatcgtgaaattgagggatcacaggtaacctttgat gtagcttggtctccaaagaacatagacagaagatgctataattatactggaatcgcagat gcttgtgacttcctctttgtgatgtcttatgatgaacaaagtcagatctggtcagaatgt attgcagcagccaatgctccctataatcagacattaactggatataatgactacatcaag atgagcattaatcctaagaaacttgtaatgggtgttccttggtatggttatgattatacc tgcctgaatctgtctgaggatcatgtttgtaccattgcaaaagtccctttccggggggct ccttgtagtgacgctgcaggacgtcaggtgccctacaaaacgatcatgaagcaaataaat agttctatttctggaaacctatgggataaagatcagcgggctccttattataactataaa gtaagacttttcagagctttagtgtag