GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:15:23 Sequence gi568815597r:84401848_84606091 : 204244 bp : 38.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 250 245 6 -1.75 1.09 Term - 1671 1603 69 0 0 92 42 127 0.349 5.46 1.08 Intr - 2591 2317 275 0 2 63 26 285 0.230 16.13 1.07 Intr - 4761 4719 43 0 1 105 116 -2 0.416 1.19 1.06 Intr - 10208 10132 77 0 2 87 91 88 0.906 7.32 1.05 Intr - 10752 10635 118 0 1 30 65 82 0.865 -0.78 1.04 Intr - 12140 11959 182 0 2 51 73 178 0.899 11.27 1.03 Intr - 15298 15247 52 1 1 69 127 42 0.107 3.76 1.02 Intr - 26045 25881 165 2 0 89 85 106 0.016 9.64 1.01 Init - 31653 31531 123 0 0 96 -39 116 0.021 -0.08 1.00 Prom - 39191 39152 40 -3.65 2.00 Prom + 40496 40535 40 -4.05 2.01 Sngl + 42263 42598 336 1 0 52 41 261 0.966 13.58 2.02 PlyA + 42649 42654 6 1.05 3.00 Prom + 46917 46956 40 -4.35 3.01 Init + 53823 53896 74 2 2 87 92 21 0.509 2.99 3.02 Intr + 55364 55429 66 2 0 88 56 85 0.345 2.40 3.03 Intr + 61480 61650 171 1 0 107 -6 106 0.511 1.24 3.04 Term + 61855 62122 268 1 1 18 49 226 0.386 5.68 3.05 PlyA + 63171 63176 6 1.05 4.00 Prom + 69896 69935 40 -5.35 4.01 Init + 77435 77866 432 1 0 95 39 416 0.567 33.66 4.02 Intr + 79109 79165 57 1 0 93 94 53 0.858 4.56 4.03 Intr + 81068 81148 81 1 0 90 80 147 0.971 13.02 4.04 Intr + 87786 87881 96 2 0 133 49 84 0.992 8.29 4.05 Intr + 88472 88625 154 1 1 94 7 134 0.636 4.62 4.06 Intr + 93526 93608 83 2 2 58 115 59 0.988 4.04 4.07 Intr + 94035 94216 182 2 2 90 94 40 0.987 2.54 4.08 Intr + 94397 94523 127 2 1 80 95 77 0.950 7.36 4.09 Intr + 104191 104278 88 0 1 22 72 98 0.014 0.22 4.10 Intr + 104553 104571 19 1 1 107 94 0 0.044 -2.75 4.11 Intr + 120543 120660 118 0 1 27 103 111 0.864 6.05 4.12 Term + 123998 124279 282 1 0 87 43 159 0.926 5.74 4.13 PlyA + 124427 124432 6 1.05 5.00 Prom + 130934 130973 40 -2.05 5.01 Init + 142259 142457 199 1 1 41 106 86 0.891 4.81 5.02 Intr + 143787 143912 126 1 0 53 58 72 0.563 0.43 5.03 Intr + 146939 147117 179 0 2 101 62 154 0.968 12.82 5.04 Intr + 148585 148683 99 0 0 72 86 43 0.750 1.89 5.05 Intr + 154063 154132 70 0 1 72 90 67 0.030 3.14 5.06 Term + 159821 159924 104 0 2 53 44 102 0.010 -0.24 5.07 PlyA + 159959 159964 6 1.05 6.16 PlyA - 160093 160088 6 1.05 6.15 Term - 160549 160535 15 1 0 99 43 1 0.352 -6.04 6.14 Intr - 161571 161398 174 0 0 93 88 116 0.643 11.31 6.13 Intr - 161985 161888 98 1 2 59 75 79 0.999 2.51 6.12 Intr - 164165 163994 172 2 1 61 99 113 0.694 8.29 6.11 Intr - 168292 168084 209 2 2 65 98 121 0.999 8.67 6.10 Intr - 168873 168735 139 0 1 94 95 134 0.999 13.82 6.09 Intr - 172606 172392 215 2 2 41 60 341 0.092 24.11 6.08 Intr - 172907 172841 67 2 1 54 66 80 0.051 -0.04 6.07 Intr - 180511 180431 81 1 0 83 74 81 0.119 5.12 6.06 Intr - 182033 181880 154 1 1 9 86 143 0.630 5.35 6.05 Intr - 187550 187514 37 0 1 32 115 38 0.088 -2.70 6.04 Intr - 194688 194584 105 1 0 82 103 100 0.260 10.17 6.03 Intr - 196249 196102 148 1 1 94 94 121 0.981 12.19 6.02 Intr - 197435 197336 100 1 1 81 48 81 0.817 2.59 6.01 Intr - 204148 203915 234 0 0 73 47 163 0.913 6.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 22189 22433 245 0 2 51 80 225 0.848 15.25 S.002 Term + 25871 26105 235 2 1 43 49 142 0.885 0.61 S.003 Term + 154063 154170 108 0 0 72 42 106 0.864 1.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:84401848_84606091|GENSCAN_predicted_peptide_1|367_aa MVSFSDSAGAKPIKRGWQHRQQGASGNRALIWLLLCDLGQNSPDGKMVGLACLRSYWAEE WTEVGCKSDFLDIGKQLACSMYQSLLGHQTLLHLTEVLQQGVTITKIYQHPYHGVTHGSG KEFGEDTIHVSVIKDLELTEFDGLIEGSEKEEMGSDWVTMQNTRKSRFVGDDDASVQQQN YTACCLMMVFHTVKKRIRLCKMEEFLSLGRLKCDELTENLHHTPKIMKFVIDEIDIRTQN QSLKTLHVAGFTKGKAMKAIKPKTIHSCWKNLCSDVEHDFTGFMTESIKKVMKEIVDMAR KVGGDGFQDMIPGEIQKLIDTTLEKLTEHNSMEPEPDDEQEDIEELTQQEDDKDEDTYDD PFPLNEE >gi568815597r:84401848_84606091|GENSCAN_predicted_CDS_1|1104_bp atggtgtccttctcagactcagctggagcaaagcctatcaagcgagggtggcagcatcgt caacagggagcatcgggcaaccgggctctcatctggcttctgctctgtgatctcggacag aacagccctgatgggaagatggttggactggcctgcctccgttcctactgggccgaggaa tggactgaggttggctgcaagtctgactttttggatatagggaaacaactggcttgcagc atgtaccagtctctccttggtcaccagacactgctccatctcacagaggttctacaacaa ggagttaccatcaccaaaatataccaacacccctaccacggtgtgacccatggcagtggc aaagagtttggagaagatacaatacatgtgagtgttattaaggatctggaactgactgaa tttgatggtttgatagaagggtcagaaaaagaagagatggggagtgactgggtaactatg cagaatacaagaaaaagtagatttgtaggggatgatgacgcatcagttcaacagcagaat tatacagcatgttgtttgatgatggtctttcataccgtcaagaaaagaatccgactttgc aaaatggaggaatttttgtccctgggccgactgaagtgtgatgaattaactgagaattta catcacacacccaagattatgaagtttgtaattgatgaaattgatattcgaacccagaac caatcactcaagacattgcatgtggcaggttttactaaaggaaaagccatgaaagccatc aagcccaaaacaatacattcctgctggaaaaatctgtgttcagatgttgagcatgacttc acaggatttatgacagagtcaatcaagaaagtcatgaaagagattgtggatatggcaaga aaggttgggggtgatggatttcaagatatgattcctggagaaattcaaaagctaatagac accacactagagaaattaacagaacataactcaatggaaccagagccagatgatgagcaa gaagacatagaagagctgactcaacaggaagatgacaaggatgaagacacttatgatgat ccatttccacttaatgaagagtga >gi568815597r:84401848_84606091|GENSCAN_predicted_peptide_2|111_aa MRKNQYKKAENSKNQNTSSPKDHNSSSAREQTWTESEFDEFTEVGFRRWVITNSSELKEH VLTQCKEVKTLEKRLEELLTRITSLEKNINDLMELKNTARELSEAYTSINS >gi568815597r:84401848_84606091|GENSCAN_predicted_CDS_2|336_bp atgaggaaaaaccagtacaaaaaggctgaaaattccaaaaaccagaacacctcttctcca aaggatcacaactcctcatcagcaagggaacaaacctggacagagagtgagtttgacgaa ttcacagaagtaggcttcagaaggtgggtaataacaaactcctccgagctaaaggagcat gttctaacccaatgcaaggaagttaagacccttgaaaaaaggttagaggaattgctaact agaataaccagtttagagaagaacataaatgacctgatggagctgaaaaacacagcacga gaacttagtgaagcatacacaagtatcaacagctga >gi568815597r:84401848_84606091|GENSCAN_predicted_peptide_3|192_aa MNKNTKWSYRVQNISYLHHEDSGKRLVAKGEEKESEEGLQVLRAKQIIAIKAPKLKTIYI KEPPSFKENQWATERQQNWRSSSKHQSPILSSSLLRYLHIMCCTRKTSIGEQKEVAPVEM LLGLEQDAWIFQKTLKKKAKLEVNCWYLPKGAVHQVMGPQHQEAREHIAKEGVRRGGLSR IRELCRGPCMAS >gi568815597r:84401848_84606091|GENSCAN_predicted_CDS_3|579_bp atgaacaagaatacaaagtggtcctatagggtccagaatataagttacctccatcatgag gatagtggtaagaggttggtggctaagggagaagagaaggagagtgaagaaggtctacag gttctgagagcaaagcaaataattgccatcaaagctccaaagctgaagacaatatacatc aaggaacccccatcatttaaagagaaccagtgggctactgagaggcaacagaactggcgt tcctcctccaagcatcagtctcccatcctttcctcctccctactgcgttatctacatatt atgtgttgtacgagaaaaacaagtattggagaacaaaaagaagtggcaccagttgagatg ctacttggattggagcaagatgcttggattttccagaagacactgaagaagaaagccaag cttgaagttaattgctggtacctgcccaagggggctgtgcaccaagtgatgggaccccaa caccaagaggctagggagcacattgctaaggaaggtgtcagaagaggtggcctgtcccgc ataagggaactgtgcagaggcccttgtatggcttcctga >gi568815597r:84401848_84606091|GENSCAN_predicted_peptide_4|572_aa MAKAGDKSSSSGKKSLKRKAAAEELQEAAGAGDGATENGVQPPKAAAFPPGFSISEIKNK QRRHLMFTRWKQQQRKVRERRGLPGACALFLTLRAVAGRTSVVVCWSQKCSLWLRGKGGV AAHVFWSPRPSLETLGSAFCLSVKEKLAAKKKLKKEREALGDKAPPKPVPKTIDNQRVYD ETTVDPNDEEVAYDEATDEFASYFNKQTSPKILITTSDRPHGRTVRLCEQLSTVIPNSHV YYRRGLALKKIIPQCIARDFTDLIVINEDRKTPNGLILSHLPNGPTAHFKMSSVRLRKEI KRRGKDPTEHIPEIILNNFTTRLGHSIGRMFASLFPHNPQFIGRQVATFHNQRDYIFFRF HRYIFRSEKKVGIQELGPRFTLKLRSLQKGTFDSKYGEYEWVHKPELLNHFLHSGDAGGA RHGARDGRADSWVEGPIRERVSPQLTLRALRERLGEFLGEDAIAEKFLFLKCIGNNLAVA LQPELYLLPVMDHLGNVYSPSTVILDERQTNNGVNEADGTIHRPISVTLFKEELGRDPSL LENTLKELPNKNQEEGDFKLEWEKERGYYSTF >gi568815597r:84401848_84606091|GENSCAN_predicted_CDS_4|1719_bp atggcgaaagccggggataagagcagcagcagcgggaagaaaagtctaaaacggaaagcc gctgccgaagaacttcaggaggctgcaggcgctggggatggggcgacggaaaacggggtc caacccccgaaagcggctgcctttccgccaggctttagcatttcggagattaaaaacaaa cagcggcgacacttaatgttcacgcggtggaaacagcagcagcggaaggtacgcgagagg cgggggctgccgggcgcttgcgcgttgttcctgacgcttagggcggtcgcggggcgcaca tctgtggttgtctgctggtctcaaaagtgttctctgtggctccgtgggaagggtggcgtc gcggcccacgtgttctggtcccctcgccctagtttggagactctaggttcagccttttgc ctgagcgtcaaggaaaagttggcagctaagaaaaaacttaaaaaagaaagagaggctctt ggcgataaggctccaccaaagcctgtacccaagaccattgacaaccagcgagtgtatgat gaaaccacagtagaccctaatgatgaagaggtcgcttatgatgaagctacagatgaattt gcttcttacttcaacaaacagacttctcccaagattctcatcacaacatcagatagacct catgggagaacagtacgactctgtgaacagctctccacagttataccaaactcacatgtt tattacagaagaggactggctctgaaaaaaattattccacagtgcatcgcaagagatttc acagacctgattgttattaatgaagatcgtaaaaccccaaatggacttattttgagtcac ttgccaaatggcccaactgctcattttaaaatgagcagtgttcgtcttcgtaaagaaatt aagagaagaggcaaggaccccacagaacacatacctgaaataattctgaataattttaca acacggctgggtcattcaattggacgtatgtttgcatctctctttcctcataatcctcaa tttatcggaaggcaggttgccacattccacaatcaacgggattacatattcttcagattt cacagatacatattcaggagtgaaaagaaagtgggaattcaggaacttggaccacgtttt accttaaaattaaggtctcttcagaaaggaacctttgattctaaatatggagagtatgaa tgggtccataagccggagctgttgaaccactttcttcatagcggcgacgctggaggagcc agacatggtgcacgcgacggccgggccgattcgtgggtcgaggggccgattagggagaga gtatctcctcaacttactttacgagccctgagggagcgtcttggtgagttcctgggtgaa gatgctattgcagaaaaatttttatttctgaaatgcattggaaataatttagctgtggct cttcaaccagaattatatttgcttcctgtaatggaccatttaggaaatgtttattcacca tcaacagttattttagatgagcggcagactaataatggtgttaatgaggctgatggaaca atccacagaccaattagtgtaactttgttcaaggaggaacttggaagagatcccagtttg ttagaaaacactttgaaagagcttcctaacaagaatcaggaagaaggtgattttaaactg gaatgggaaaaagaaagaggctattatagtacattttag >gi568815597r:84401848_84606091|GENSCAN_predicted_peptide_5|258_aa MHFWFALSNTYGQVKKECDSVADLTHFHVCLQTAEKEYITLPDHPSLPCQPVLSSGITDI SLLQTEREKIIKQMKQVKEERRYLERNREELVKTVEKLFEQSKLKRYHAYNGWKKKYLET KKVTASMEEVLTKLREDLELYYKKLLMQLEAREIKMRPKNLANITDSKNYLIIQITEVQH AIDQLKRKLDTDKMKLIVEVKLLEVCLARRSTEDFVYDESAKSKEIAIATPTFSNHHPDQ SAAINDKARHSSSKQVIT >gi568815597r:84401848_84606091|GENSCAN_predicted_CDS_5|777_bp atgcatttctggtttgcattatcaaatacctacgggcaagtgaaaaaagaatgtgacagt gtagccgatcttactcattttcacgtttgcttacagacagctgaaaaagagtacatcacc ctaccagatcacccttcacttccttgtcaacctgttctttcttcaggaataactgatata tctttattacaaactgaaagagagaagattatcaaacaaatgaaacaagtaaaggaagaa agaaggtatctggaaagaaatagagaagaactagtaaaaacggttgaaaagctatttgaa caaagcaaattaaaacgatatcatgcctacaatggttggaagaaaaaatacttggaaaca aagaaagtcacagcatcaatggaggaggttttaacaaaacttcgagaagatttggaactc tactataaaaaactgctcatgcaacttgaagccagggagatcaagatgagaccaaagaat ctggcaaacatcacagactccaagaattacctaataatccagatcactgaggtacagcat gcaattgaccagcttaagagaaaactagatactgacaaaatgaaactcatagtagaagtt aagttgctagaagtgtgcctggctaggaggtctacagaagattttgtttatgatgaatca gccaagtcaaaagaaattgccatagccaccccaaccttcagcaaccaccaccctgatcag tcagcagccatcaatgacaaggcaagacactcttccagcaagcaggttataacttga >gi568815597r:84401848_84606091|GENSCAN_predicted_peptide_6|649_aa XGRSEVWAVGSFGDGGAGGADGAAGGGPGQRGLEGLGHLEAASNGSSKTCSSDLGELLST DLSDMIAKRAELVLQCLQRVNTGGDSPPSLQGELSLYSIPKGSPERSCKVELLAVLEFAH QICHPRRPKVQQDDCEQDQGPQPLQKKVEDYVHERHSRLSSAFCHVRMWHKGAIWEQKLD PHQTPDLPEFKSIRGHGLFDKVQLFILAREGPAARKYRLVHIRPDHREMVKVRQRNSDFT QEFLLVLKQENEKWPQGIRKDPLVLMNGVIHKPEDLVETSVLTHHIPTPQGVHDQPRQNE MQTIGRTINGGRNPLRPARPAAMSRPQLRRWRLVSSPPSGVPGLALLALLALLALRLAAG TDCPCPEPELCRPIRHHPDFEVFVFDVGQKTWKSYDWSQITTVATFGKYDSELMCYAHSK GARVVLKGDVSLKDIIDPAFRASWIAQKLNLAKTQYMDGINIDIEQEVNCLSPEYDALTA LVKETTDSFHREIEGSQVTFDVAWSPKNIDRRCYNYTGIADACDFLFVMSYDEQSQIWSE CIAAANAPYNQTLTGYNDYIKMSINPKKLVMGVPWYGYDYTCLNLSEDHVCTIAKVPFRG APCSDAAGRQVPYKTIMKQINSSISGNLWDKDQRAPYYNYKVRLFRALV >gi568815597r:84401848_84606091|GENSCAN_predicted_CDS_6|1950_bp nngggtaggagtgaagtctgggcagtggggagctttggagatggtggagctggaggagct gatggtgcagctggagggggccctggacaaaggggactggaaggactggggcacttagaa gctgcctctaatggttcctccaaaacttgctcttctgacttaggggaactactctccaca gacctgtctgatatgattgctaagagggctgagttggttttgcaatgcttgcaaagagtt aacactggaggagactcgcccccaagtttgcagggagagctaagtctatatagcattcct aaagggagcccagagaggtcctgtaaggtggaactgttggcagttttggagtttgcccac cagatatgccatccaagaagaccaaaagtacagcaggatgattgtgaacaagaccaggga cctcagcccttgcaaaaaaaggttgaagactatgtacatgaacgacattctaggctatcc tcagccttctgccatgtgaggatgtggcacaaaggtgccatttgggagcagaaactggac cctcaccagacaccagacctgccagagtttaagagcattagaggccacggtttatttgat aaagtccagttgttcatccttgcaagagaagggcctgcagcgcggaaatacagacttgtc cacattaggccagatcacagagaaatggtaaaagtccggcagaggaattcggacttcact caggagttcctgctggttctgaagcaggaaaatgagaagtggcctcagggaattagaaag gatccactggtgttgatgaatggagtaatccacaagccggaggatctggtggaaacatcg gtgctcacacatcacattccgactccacaaggagtacacgaccagccacgacaaaacgag atgcagactattgggaggactataaacggcggtaggaacccactccggcccgctagacct gctgctatgtcccggccgcagcttcgacgctggcgcctcgtctctagcccgccgagcggc gtcccgggtctagcgctgctggcgctgctggcgctgctggcgctgcggctcgcggccggg accgactgcccatgcccggagcctgagctctgccgcccgattcgccaccatccagatttc gaggtctttgtgtttgatgttggacagaaaacttggaaatcttatgattggtcacagatt acaactgtggcaacatttggaaaatatgactcagaacttatgtgctacgctcattcaaaa ggagccagagtagtacttaaaggagatgtatccttaaaggatatcattgatcctgctttc agagcatcctggatagctcaaaaacttaatttggccaaaacacaatatatggatggaatt aatatagatatagagcaagaagttaattgtttatcacctgaatatgatgcattaactgct ttagtcaaagaaactacagactctttccatcgtgaaattgagggatcacaggtaaccttt gatgtagcttggtctccaaagaacatagacagaagatgctataattatactggaatcgca gatgcttgtgacttcctctttgtgatgtcttatgatgaacaaagtcagatctggtcagaa tgtattgcagcagccaatgctccctataatcagacattaactggatataatgactacatc aagatgagcattaatcctaagaaacttgtaatgggtgttccttggtatggttatgattat acctgcctgaatctgtctgaggatcatgtttgtaccattgcaaaagtccctttccggggg gctccttgtagtgacgctgcaggacgtcaggtgccctacaaaacgatcatgaagcaaata aatagttctatttctggaaacctatgggataaagatcagcgggctccttattataactat aaagtaagacttttcagagctttagtgtag