GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:48:44 Sequence gi568815580f:9002744_9234276 : 231533 bp : 39.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4943 4982 40 -3.55 1.01 Init + 8720 8948 229 1 1 49 4 185 0.206 4.78 1.02 Term + 14677 15017 341 2 2 75 47 161 0.223 4.41 1.03 PlyA + 16136 16141 6 1.05 2.00 Prom + 17334 17373 40 -8.55 2.01 Sngl + 17542 18024 483 0 0 79 44 492 0.619 39.72 2.02 PlyA + 22053 22058 6 1.05 3.00 Prom + 25272 25311 40 -6.25 3.01 Init + 30083 30753 671 1 2 68 43 259 0.025 14.24 3.02 Intr + 40401 40496 96 0 0 119 72 42 0.400 3.91 3.03 Term + 41747 41816 70 2 1 50 37 98 0.297 -2.67 3.04 PlyA + 43553 43558 6 1.05 4.00 Prom + 47233 47272 40 -4.75 4.01 Init + 47486 47833 348 1 0 60 49 161 0.224 6.63 4.02 Intr + 56878 56949 72 0 0 119 48 67 0.651 4.48 4.03 Intr + 61889 61918 30 1 0 128 69 23 0.539 1.81 4.04 Term + 67271 67393 123 1 0 59 50 159 0.693 6.60 4.05 PlyA + 67401 67406 6 1.05 5.00 Prom + 68873 68912 40 -3.75 5.01 Init + 70864 70929 66 0 0 69 96 19 0.354 1.92 5.02 Intr + 84399 84531 133 2 1 67 42 121 0.900 4.70 5.03 Term + 85348 85448 101 2 2 81 39 129 0.673 4.61 5.04 PlyA + 86964 86969 6 1.05 6.00 Prom + 90414 90453 40 -3.65 6.01 Init + 100001 100054 54 1 0 77 82 142 0.989 11.83 6.02 Intr + 116583 116645 63 2 0 48 86 63 0.418 0.10 6.03 Intr + 116731 116847 117 0 0 69 79 85 0.937 5.44 6.04 Intr + 119770 119938 169 0 1 48 50 163 0.973 7.10 6.05 Intr + 122131 122240 110 2 2 53 73 81 0.758 2.18 6.06 Intr + 122416 122589 174 0 0 48 67 69 0.408 0.01 6.07 Intr + 124088 124164 77 1 2 97 91 94 0.962 7.99 6.08 Intr + 133784 133887 104 2 2 88 84 -12 0.152 -2.70 6.09 Intr + 133996 134353 358 2 1 51 97 434 0.855 33.99 6.10 Term + 134785 134902 118 1 1 52 39 88 0.533 -2.67 6.11 PlyA + 135110 135115 6 1.05 7.04 PlyA - 136516 136511 6 1.05 7.03 Term - 165461 164096 1366 1 1 -11 42 476 0.847 22.96 7.02 Intr - 166427 165571 857 2 2 48 72 435 0.958 27.12 7.01 Init - 168082 167735 348 1 0 73 37 324 0.508 23.03 7.00 Prom - 176129 176090 40 -5.55 8.00 Prom + 183577 183616 40 -7.25 8.01 Init + 186415 186720 306 0 0 68 -56 254 0.097 6.34 8.02 Intr + 192808 192955 148 0 1 67 75 188 0.857 14.29 8.03 Intr + 205914 206060 147 1 0 30 59 127 0.805 3.29 8.04 Intr + 208841 209041 201 0 0 112 110 193 0.998 22.24 8.05 Intr + 213014 213151 138 0 0 32 30 160 0.517 4.11 8.06 Intr + 214015 214157 143 2 2 26 95 133 0.850 6.95 8.07 Intr + 219109 219256 148 0 1 112 72 152 0.421 14.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 186415 186726 312 0 0 68 37 253 0.888 13.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_1|189_aa MGTEGELATCEELPIECDKCINTHIQLGRQGCGGSGCWSFQRVSQEQETLHVGRNDTCQM EGMKKAVMGEQEMQWRMQFPSGTTKNKHKPGSCQRRGSPWGAHRTHVRARRPPANHESVK EAATKAEKPLPSQVALRCTGALTVELALCLFPQLPPLRFRPNCWTHCRVGLSTFPWHIVL GGSFFFSIW >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_1|570_bp atggggaccgaaggagagttagctacctgtgaggaactccccatcgagtgtgacaaatgc ataaacacgcacatacagctgggaaggcagggctgtggtgggagtgggtgttggagcttc cagagggtgtcacaggagcaagaaacattgcatgttgggagaaatgacacgtgccagatg gagggaatgaagaaggccgtgatgggggagcaagagatgcaatggagaatgcagtttccc tctggcacaaccaaaaataaacacaaaccaggctcctgccaaaggaggggcagtccctgg ggcgcccaccgcacgcatgtgagagctcgcaggccacctgccaatcatgaatctgttaag gaagcggccacgaaggcagagaaacctctcccatcgcaggttgcgctgcgatgcacaggg gcgctgactgttgagttggcgctctgtttatttccccaactgcctcctctcaggttccgt cccaactgctggactcactgccgcgtgggcctgagcacatttccttggcacattgtcctc ggtggaagttttttcttctcgatttggtga >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_2|160_aa MSTDKTGENFHLICDTKAPFAVHRIAPEEAKYRLCKVRKIFVDTKGIPHLLTHDAHTICY PDPLIKVNDTIQVDLENGKITGFIKFNTENLCMVTGGANLGRIGVITNRERHPGSFDVVH GKDANSNINKPWISLPQGKVMRLTIAEETDKRLVAKQTSG >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_2|483_bp atgagcactgacaagacgggagagaatttccatctgatctgtgacaccaaggctcccttt gctgtacatcgtattgcacctgaggaggccaagtacaggttgtgcaaagtgagaaaaata tttgtggacacaaaaggaatccctcatctgctgactcatgatgctcataccatctgctac cctgatcccctcatcaaggtgaatgacaccattcaggttgatttggagaatggcaagatt actggtttcatcaagttcaacactgagaacctgtgtatggtgactggaggtgctaacctg ggaagaattggtgtgatcaccaacagagagaggcaccctggttcttttgatgtggttcat gggaaagatgccaacagcaacatcaacaaaccctggatttctcttccccaaggaaaggta atgcgcctcaccattgctgaagagacagacaagagactggtggccaaacagaccagtgga tga >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_3|278_aa MTPKALQRSLRLPLSSQARSARALREERFKGDPRAPMGLWGSLPRGTPSFGSLNSGAVLL DHPSCGSGGQDATWVNNPEGTVCKPWWHTCGANSPGMQNARTVEAWLPPPRFQRVPHNLS LKALNCHKAGCGRESPLRPCPAELQGQDHCREPPIGQCLVRPCKTATPKSPKPVEPAVPA SGQCQPGRAISIQLQFLRAAVWASSSKAMVVGSPGGMGTQSPLHTGVPNPRPRTSSGPWP VRNWAAQQEMSRGQNEMTVAHECELKEQYPCGSGKELS >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_3|837_bp atgactccaaaggcacttcagagatctttgaggctgccactctcgtcacaggcccggagt gctagggccttgagggaagaacgtttcaaaggagatcccagggcacccatgggactttgg ggctcactgcccaggggcaccccaagcttcggttccctgaattctggtgcagtactcctt gaccaccccagctgtggctcagggggccaagatgcaacttgggtcaacaatccggaaggc acagtatgtaaaccttggtggcatacatgtggggctaactctccaggcatgcagaatgca agaacagtggaagcatggctacctccacctagatttcaaagggtgcctcacaacctcagt ctcaaagctctgaactgccacaaggcagggtgtggcagagagtccccactacggccatgc ccagcagagctacagggtcaagaccactgcagggagcccccaatagggcaatgcttagtg aggccatgcaagacagccacccctaaaagccccaaacctgtagagcctgcagtgccagcc agcgggcagtgccagccaggaagagccataagtatccaactccaattcctgagagctgca gtgtgggcttcatccagcaaagccatggtagtggggtctcctgggggaatggggacccaa tccccactccacacaggggtccctaaccccaggccacgaactagtagtggtccgtggcct gttaggaactgggctgcacagcaggagatgagcagagggcaaaacgaaatgaccgtagca catgagtgtgaattaaaagaacagtatccttgtggctctggcaaagagctctcttaa >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_4|190_aa MWKQFWNWITDRSWNSLKGSEEDKKMWESLELPRDWLNGFDQNSDNDMDNEIQAEVVSDR DEELVGNWSKADSCYVLAKRLAAFCPCSRDLWNFELEGDDLWYLAEEISKQQSIQEEQTW QARGLKKLEGPQDKPLMKSLGFGFAMLPRQNTVLIFGQQEGALLREGKLPGTRVLCEICK ALHKEGKQFH >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_4|573_bp atgtggaagcaattttggaactggataacagacagaagttggaacagtttgaagggctca gaagaagacaaaaaaatgtgggaaagtttggaacttcctagagactggctgaatggcttt gaccaaaattctgataatgatatggacaatgaaatccaggctgaggtggtctcagataga gatgaggaacttgttgggaactggagtaaagctgactcttgctatgttttagcaaagaga ctggcagcattttgcccctgctctagagatttgtggaactttgaacttgagggagatgat ttatggtatctagcagaggaaatttctaagcagcaaagcattcaagaggaacagacttgg caagccagagggctgaagaagctagaggggccacaggataaaccactgatgaaatcactg ggattcggttttgccatgttgcccaggcagaacactgtcctgatatttgggcagcaggag ggtgccctgcttcgagaaggaaaattaccagggacccgtgtgctctgtgaaatatgtaag gctctacacaaggaaggaaagcagttccactga >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_5|99_aa MMKCHIRFSKQTSQSPKETNSKKLANICTDDIYRKLETIYKSINREMKNKVKIPKNIESA VDAGCGDPDKPPLCPLTTELQSSGAAGGGPAGPVGATVH >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_5|300_bp atgatgaaatgccacattcgcttcagtaaacagacatctcaaagtccaaaggaaacaaac tcaaagaaacttgcaaatatttgcacagatgacatctatagaaagttggaaacaatctac aagtctatcaatagggaaatgaaaaataaagtgaaaattcccaagaacattgaatcagca gttgatgctggatgtggtgatcctgacaagcctcctctctgccctctgaccaccgagctg cagagctccggagctgcaggaggaggacctgcagggcctgtgggtgcaactgtacactag >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_6|447_aa MFFSAALRARAAGLTAHWHRDTPENNPDTPFDFTPENYKRIEAIVKNYPEGHKAAAVLPV LDLAQRQNGWLPISAMNKVAEVLQVPPMRVYEVATFYTMYNRKPVGKYHIQVCTTTPCML RNSDSILEAIQKKLGIKVGETTPDKLFTLIEVECLGACVNAPMVQINDNYYKTNFKTLDY EKFNNLDINTENNPSVANYQDLTESNILPHFTSDTIIIIIICRNKLQIQEDLTAKDIEEI IDELKAGKIPKPGPSVCGGEAVTLVRVGRGDRPARRAEFRPAPLRRLPLVTCPRVSRARV AANRGATLSWERGGCGERRTETGDVFKPGSGGFTGGCSGDEDNDSDGYAEALVPGVKPPA PALPRVRTGEGGVATGKEEGGARFRACPRRQGGDPLSSALLPFPLPRASRRACADAGESA AAARAGSVGGTRVLGEVDEIKKGKLLL >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_6|1344_bp atgttcttctccgcggcgctccgggcccgggcggctggcctcaccgcccactggcacaga gatactcctgagaataaccctgatactccatttgatttcacaccagaaaactataagagg atagaggcaattgtaaaaaactatccagaaggccataaagcagcagctgttcttccagtc ctggatttagcccaaaggcagaatgggtggttgcccatctctgctatgaacaaggttgca gaagttttacaagtacctccaatgagagtatatgaagtagcaactttttatacaatgtat aatcgaaagccagttggaaagtatcacattcaggtctgcactactacaccctgcatgctt cgaaactctgacagcatactggaggccattcagaaaaagcttggaataaaggttggggag actacacctgacaaacttttcactcttatagaagtggaatgtttaggggcctgtgtgaac gcaccaatggttcaaataaatgacaattactataagacaaattttaaaactttagattat gaaaaatttaataatttggatattaatacagaaaataatcctagtgtagccaactatcag gacttaactgagagtaatattttgccacattttacttcagatactatcatcatcatcatc atttgtagaaataagctacagatacaggaggatttgacagctaaggatattgaagaaatt attgatgagctcaaggctggcaaaatcccaaaaccagggccaagtgtctgcggcggggag gcggtgacactagtccgagtggggcgtggagaccgaccagcacgaagggcggagttccgg cccgcacccctccgccgcctgccgcttgtcacgtgtccgcgggtgtcacgtgctcgcgtc gcagccaatcgcggggcgacgctgtcctgggagcgaggaggctgtggtgagagacggacc gagaccggagatgttttcaagcccggctccggcggctttacaggcggctgcagcggcgac gaagacaacgacagcgacggctacgccgaagcactcgttccgggggtgaagcctcctgcg ccggccttgcctcgggtccgtacgggtgaggggggcgtggcgacaggaaaggaggaagga ggggcgcggtttcgagcttgtccccggcgccagggaggagatcccctcagttctgcgctg ctcccgttccccctccccagagcaagccgccgagcgtgcgccgacgccggggagtctgcc gcggccgctcgggccgggagcgtcggaggtacccgggtcctgggagaggtggatgaaatc aagaaggggaaattgctgctttag >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_7|856_aa MGKKQSRKTGNSKNQSASPPPKERSSSPAMEQSWRENDFEKLREGFRRSNYSELQEEIQT NGKEVKSFEKKIDEWITRITNAEKSLKDLMELKTKAPELHDECRSLSSRCDQLEERHHTY SKIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSATWKLNNLLLNDYW VNNETKAEITMFFETNENKDTTYQNLWDTFKAVCRGKSIALNAHKRKQERSKIDTLTSQL KELEKQEQTHSKGSRRQEITKIRAELKEIETQKTLQKINESRSWFSEKINKIDRPLARLI KKKREKNQIDAIKNDKGDTTTDPTEIQTTIREYYKHHYTNKLENQEEMDEFLDTYILPRL NQEEVESLNRPITGSEIEAIINSLPMKKSPGPDGFTAEFYQSLAETQQKKENFRPISLMN TDAKILDKLLANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVSQHINRTKDKNHMII SIEAEKAFDKIQQPFMLKTLNKLGIDGTYLKTIRAMYDKPTANIILNGQKLEAFPLETGT RQRCPPSPLLFNTVLEVVARTIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQN LLKLISNFSKVSGYKINVQKSQAFLNTNNKQTESQIMSELPFTIASKRIKYLGIQLTRDV KDFFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMTTAILPKVIYRFNAIPIKLPMTFF TELEKTTLKFIWNQKRARIAKSILSLKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDID QWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPIQ KLIQDGLKTYMLDLKP >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_7|2571_bp atgggaaaaaaacagagcagaaaaactggaaactctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggagggagaatgactttgaa aagttgagagaaggcttcagaagatcaaactactccgagctacaagaggaaattcaaacc aatggcaaagaagttaaaagctttgaaaaaaaaattgacgaatggataactagaataacc aatgcagagaagtccttaaaggacctgatggagctgaaaaccaaggcaccagagctacat gacgaatgcagaagtctcagcagccgatgcgatcaactggaagaaaggcaccacacctac tccaaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaa attataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaa ctcactcaaaaccactcagctacatggaaactgaacaacctgctcctgaatgactactgg gtaaataatgaaacgaaggcagaaataacgatgttctttgaaaccaacgagaacaaagac acaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaatctatagca ctaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaatta aaagaactagagaagcaagagcaaacacattcaaaaggtagcagaaggcaagaaataact aagatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaa tccaggagctggttttctgaaaagatcaacaaaattgatagaccactagcaagactaata aagaagaaaagagaaaagaatcaaatagacgcaataaaaaatgacaaaggggataccacc accgatcccacagaaatacaaactactatcagagaatactataaacaccactatacaaat aaactagaaaatcaagaagaaatggatgaattcctcgacacatacatcctcccaagacta aatcaggaagaagttgaatctctgaatagaccaataacaggctctgaaattgaggcaata atcaatagcttaccaatgaaaaaaagtccaggaccagatggattcacagccgaattctac cagagcctggcagagacacaacaaaaaaaagagaattttagaccaatatctttgatgaac actgatgcaaaaatcctcgataaattactggcaaaccgaatccagcagcacatcaagaag cttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatgc aaatcaataaatgtaagccagcatataaacagaaccaaagacaaaaaccacatgattatc tcaatagaggcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaactctc aataaattaggtattgatgggacgtatctcaaaacaataagagctatgtatgacaaaccc acagccaatatcatactgaatgggcaaaaactggaagcattccctttggaaactggcaca agacagagatgccctccctcaccactcctattcaacacagtgttggaagttgtggccagg acaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattg tccctgtttgcagatgacatgattgtgtatctagaaaaccccatcgtctcagcccaaaat ctcctcaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaa tcacaagcattcttaaacaccaataacaaacaaacagagagccaaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtg aaggacttcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaac aaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgacaacggccata ctgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgcc aagtcaatcctaagcctaaagaacaaagctggaggtatcacactgcctgacttcaaacta tactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagac caatggaacagaacagagccctcagaaataatgccgcatatttacaactatctgatcttt gacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgg gaaaactggctagccatatgtaggaagctgaaactggatcccttccttacacctatacaa aaattaattcaagatggattaaagacttacatgttagacctaaaaccataa >gi568815580f:9002744_9234276|GENSCAN_predicted_peptide_8|411_aa MEEVWSGLDRRSNQLQHPLKPKFNPDQGPLLNSVKAERGLEAAEEKSEASISWFMRFKER SCLHNIKVQGEAPSANVEATASYPEDLAMIFDEDGYTKLQISSKDKIASYSKTPKIERSD VSKEMKEKSSMKRKLPFTISPSRNEERDSDTEKEGPEKKKTKKEAGNKKSTPVSILFGYP LSERKQMALLMQMTARDNSPDSTPNHPSQTTPAQKKTPSSSSRQKDKVNKRNERGETPLH MAAIRGDVKQVKELISLGANVNVKDFAGFTTEPVKEIMKEIVDMSKKVAGEKFQNMDLGE IEELVDTTSEVLIGWTPLHEACNVGYYDVAKILIAAGADVNTQGLDDDTPLHDSASSGHR DIVKLLLRHGGNPFQANKHGERPVDVAETEELELLLKREVPLSDDDESYTX >gi568815580f:9002744_9234276|GENSCAN_predicted_CDS_8|1233_bp atggaggaagtttggagtggtctggatagaagatcaaaccagctacaacatccccttaaa ccaaagttcaatccagatcaaggccctctcctcaattctgtgaaggctgagagaggtttg gaagctgcagaagaaaagtctgaagctagcataagttggttcatgaggtttaaggaaaga agctgtctccataacataaaagtgcaaggtgaagcaccaagtgctaatgtggaagctaca gcaagttatccagaagatctagctatgatctttgatgaagatggctacactaaactacag atttcaagtaaagacaagattgcatcctacagcaaaactccaaaaattgaacgaagtgat gtgagcaaggagatgaaagagaaatcatccatgaaacgtaaacttccttttactattagc ccatcaagaaatgaagaacgagattcagacacagagaaagaaggtccagaaaagaagaag acaaaaaaggaagctggaaataagaaatccacaccagttagcattctttttggttatcca ctctctgagcgaaaacagatggcacttcttatgcagatgacagcaagagacaacagtcca gattccacaccaaatcatccatcacaaacaacgcctgcccaaaagaaaactcccagttct tcatctcgacagaaagataaagttaataaaagaaatgaacgtggtgaaactcctttacac atggctgctattcgaggagatgtgaaacaagttaaagaattaataagtttaggggcaaat gtgaatgtgaaagattttgcaggatttaccacagaaccagtcaaggaaatcatgaaagag attgtggatatgtcaaaaaaggtagcgggtgaaaagtttcaaaatatggatcttggagaa attgaggagctagtagataccacatcagaggtattaataggttggacaccactgcatgaa gcttgcaatgttggatattacgatgttgctaagatacttatagcagctggagcagatgtt aacacacaaggattagatgatgacactccactccatgattctgctagtagtgggcacaga gatatagtaaagctgttacttcgtcacggtggaaatccatttcaagctaataaacatggg gagcgtccagtggatgtagcagaaacagaggagttggagttgctactaaaaagagaggtg cctttatctgatgatgatgaaagttacacagnn