GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:08:53 Sequence gi568815586f:10209385_10414790 : 205406 bp : 38.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3746 3835 90 1 0 81 86 197 0.981 19.34 1.02 Intr + 8679 8757 79 2 1 120 55 82 0.984 6.31 1.03 Intr + 11056 11315 260 2 2 66 56 187 0.718 9.56 1.04 Term + 12403 12468 66 0 0 121 48 81 0.734 4.36 1.05 PlyA + 13719 13724 6 1.05 2.00 Prom + 15074 15113 40 -8.35 2.01 Init + 15175 15277 103 0 1 60 68 109 0.421 4.57 2.02 Term + 16144 16847 704 2 2 47 49 302 0.396 14.90 2.03 PlyA + 18626 18631 6 1.05 3.00 Prom + 21294 21333 40 -2.35 3.01 Init + 24844 25025 182 0 2 75 -16 193 0.144 4.38 3.02 Intr + 27986 28166 181 2 1 92 56 103 0.167 6.45 3.03 Intr + 42558 42776 219 2 0 100 80 98 0.772 7.78 3.04 Intr + 67756 67779 24 0 0 117 90 20 0.013 2.40 3.05 Intr + 71268 71396 129 2 0 65 78 69 0.014 3.57 3.06 Intr + 91937 91985 49 1 1 62 97 38 0.001 -0.57 3.07 Intr + 98610 98700 91 1 1 90 91 34 0.020 2.03 3.08 Intr + 105289 105389 101 1 2 97 58 103 0.162 7.03 3.09 Intr + 108433 108563 131 2 2 9 38 100 0.004 -3.41 3.10 Intr + 123405 123632 228 2 0 86 30 106 0.013 1.74 3.11 Intr + 126775 126893 119 0 2 -4 109 163 0.188 7.54 3.12 Intr + 128226 128373 148 0 1 43 34 61 0.097 -4.48 3.13 Term + 128549 128623 75 1 0 111 54 63 0.131 2.06 3.14 PlyA + 130605 130610 6 1.05 4.02 PlyA - 130685 130680 6 1.05 4.01 Sngl - 137316 135874 1443 0 0 42 47 410 0.889 27.78 4.00 Prom - 137475 137436 40 -11.44 5.02 PlyA - 137838 137833 6 1.05 5.01 Sngl - 138673 138017 657 1 0 44 41 239 0.585 10.72 5.00 Prom - 138766 138727 40 -6.15 6.02 PlyA - 138935 138930 6 1.05 6.01 Sngl - 139818 139189 630 0 0 59 42 422 0.806 30.63 6.00 Prom - 143076 143037 40 -3.55 7.10 PlyA - 143696 143691 6 1.05 7.09 Term - 154826 154411 416 0 2 80 49 194 0.841 9.14 7.08 Intr - 155280 154983 298 0 1 116 100 132 0.453 12.72 7.07 Intr - 169321 169170 152 2 2 75 19 60 0.093 -3.24 7.06 Intr - 170098 170063 36 2 0 60 131 29 0.705 1.72 7.05 Intr - 170408 170316 93 0 0 103 93 67 0.908 7.72 7.04 Intr - 177626 177519 108 0 0 102 115 24 0.279 5.84 7.03 Intr - 198998 198945 54 0 0 81 83 40 0.437 0.83 7.02 Intr - 199626 199528 99 1 0 149 63 52 0.702 7.96 7.01 Init - 200191 200005 187 1 1 81 93 85 0.991 7.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 158373 158498 126 2 0 65 40 188 0.831 9.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_1|164_aa MKFQYKEDHPFEYRKKEGEKIRKKYPDRVPVIVEKAPKARVPDLDKRKYLVPSDLTVGQF YFLIRKRIHLRPEDALFFFVNNTIPPTSATMGQLYEVMVLVAQYWMPSSAVWHPLALVLD ALITHLRSGAEGVIYPDPLTYGSDNHEEDYFLYVAYSDESVYGK >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_1|495_bp atgaagttccagtacaaggaggaccatccctttgagtatcggaaaaaggaaggagaaaag atccggaagaaatatccggacagggtccccgtgattgtagagaaggctccaaaagccagg gtgcctgatctggacaagaggaagtacctagtgccctctgaccttactgttggccagttc tacttcttaatccggaagagaatccacctgagacctgaggacgccttattcttctttgtc aacaacaccatccctcccaccagtgctaccatgggccaactgtatgaggtaatggttctg gttgcacaatactggatgccgtccagtgcagtctggcatcctctagcccttgttctagat gcgttgataacacatctgagaagtggggcagaaggtgttatttatccggatcctcttaca tatggcagtgacaatcatgaggaagactattttctgtatgtggcctacagtgatgagagt gtctatgggaaatga >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_2|268_aa MKPRTLAVSVTALKVACLESVPSDVQMCSEFLPSDSGAQLASPSGSHTGAAGGAACQSRA VRSHSSALGLFVPSRSMGLGAVEQGVVLVGEARAAQVPMEWVGGSGMAGCRSRALPRGKA AKARREIERSAGGPALLGDPVHPPQPLARVLSPPLPRASRAGWLAAPSAGPAKSTPTRNS SWRASAPRSPGSARASPSTPPSKLREWAPALASPERGSHSAVGGLKGSSNATKVGAQAGE VPRASEGSEDCQHAVTSQQAGLDLALLY >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_2|807_bp atgaagccgcggaccctcgcggtgagtgttacagctcttaaggtggcgtgtctggagtct gtcccttctgatgttcagatgtgttcggagtttcttccttctgactcgggagcccagctg gcttcacccagtggatcccacaccggggctgcaggtggagctgcctgccagtcccgcgcc gtgcgctcgcattcctcagcccttgggttgtttgttccttcccggtcgatgggactgggc gccgtggagcagggggtggtgctcgtcggggaggctcgggccgcacaggtgcccatggag tgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgcgggaaggca gctaaggcccggcgagaaatcgagcgcagcgccggtgggccagcactgctgggggaccca gtacaccctccgcagccactggcccgggtgctaagtcccccattgccccgggccagcagg gctggctggctggctgctccgagtgcggggcccgccaagtccacgcccacccggaactcc agctggcgcgcaagcgccccacgcagccccggttccgctcgtgcctctccctccacacct ccctccaagctgagggagtgggctccagccttggccagcccagaaaggggctcccacagt gcagtgggggggctgaagggctcctcaaatgccaccaaagtgggagcccaggcaggggag gtgccgagagcaagcgagggctctgaggactgccagcatgctgtcacttctcagcaggct ggattggatttggcacttctttactga >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_3|558_aa MALGISAPVALQGTAPLLAVLSGCSFPKHMLQTVNGSPFWGLENGGPLLRARLGSAPVET LELFSSLNKILHSYHSSVVKCDLILLGRWTKAWDPLSAGGGCHTGPLPLQVEGNHPTGSY RVPNRPQYRSVAWGLGTSGLVNYTFLLNSGETTYQFLRGNKDFLKNHIKLNYCFLLIEVD NLTLVFVIEKTLGQIFDIPKVELLFSYQCFPMVENRQKPEGEEDCVIQLSELSCTECSKK AWRMEVLHTNKTTNATQCGGPAQLQQFNAVLSEKVHIVPSLLRSWNIISHGRFPSFETFN TKNCIAYNPNGNALDESCEDKNRYIWLEKPQETYSNDRRESKHIPLRMAAERRRAEQKEK YPLIKSSDLFYISIKMIKTNKTKLVYKYTKSHCGGKSSLIYFSRYRSNRTAKWNSSQKPD SGRCTRVLAPNFTRVRVKRPPNRLCGASEAIRQRQSSAAKLRKSGKESVREPWARVPGAL GVAARKAGLAAKGEGEGVEGYLPLSQKSREGVETRREGVGVLAPSPEKRDLLLRKMKGIE IKKRERLKSGKEKVVEGQ >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_3|1677_bp atggccttgggaatctctgcccctgtggctttgcagggtacagccccactcctggctgtg cttagtggctgcagctttcccaagcacatgttgcaaactgtcaatggatcaccattctgg ggtttggagaatggtggccctcttctcagagctcgactaggcagtgctccagtggagact ctagaactgttttcatcgctcaataaaattctccactcttaccattcttcagttgtcaag tgtgacctcattcttcttggacgctggacaaaagcttgggacccactaagtgcaggtgga ggctgtcacactggccctttgcccttgcaagtggagggcaaccaccccactggcagctac agggttcctaacaggccacagtaccggtctgtggcctggggcttggggacctctggtctg gtcaattacacatttctcctgaattctggagagacaacataccagttcctcagaggaaac aaagattttcttaaaaatcacatcaaattaaattactgctttttgcttattgaagtggat aatcttactcttgtttttgtcattgaaaagacactaggccagatatttgatattccaaag gtagagcttctcttctcctaccaatgctttccaatggttgaaaacagacagaagccagag ggtgaggaagactgtgtgatacagttgtcagagctcagctgcacagaatgcagcaaaaaa gcatggagaatggaggttctgcataccaacaaaaccaccaatgccacccagtgtggaggg cccgctcagcttcaacaattcaacgctgttctttctgaaaaagtacacatcgtgccttct ctacttcgctcttggaacataatttctcatggcagatttccatcatttgaaacttttaat acaaagaactgcatagcgtataatccaaatggaaatgctttagatgaatcctgtgaagat aaaaatcgttatatctggttggagaagcctcaggaaacttacagtaatgacagaagggaa agcaaacacatccctctacgcatggcagcagaaaggagacgtgctgagcaaaaggaaaaa taccctcttattaaatcatcagatctattctacatttcaatcaaaatgattaaaacaaat aaaaccaaacttgtttacaaatatacgaagagtcactgtggagggaaaagctctttaatt tatttctcaagatatagatccaacaggactgcaaagtggaacagttctcagaagcctgat tcaggaagatgtacccgggttttggcaccaaatttcacacgcgtccgtgtgaagagacca ccaaacaggctttgtggggcttccgaggcgatccggcagcgtcagtcttcagctgctaag ctgagaaaatctgggaaggagtcagtcagagagccttgggccagagttccaggggctctg ggagtggctgccagaaaagcgggacttgccgctaagggtgaaggagaaggggttgagggg tacttgcccctgtcccagaaaagcagagaaggggtagagacaaggagagaaggggttggg gtacttgccccttccccagaaaagcgggacttgctgctaaggaaaatgaaaggaatagaa attaagaaaagggagagattgaagagtggaaaggagaaagtggttgagggacagtga >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_4|480_aa MIISINAEKAFDKIQQPLMLKTLNKLGIDGTYFKIIRAIYDKPTANIMLNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTSNRQTESQIMSELPFTIASKRMKYLGIQLT RDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWEKWLAICRKLKLDPFLT PYTEINSRRIKDLNVRPKAIKTLVENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLI KLKSFCTAKETTIRVNRQPTKWEKIFTTCSSDKGLISRIYNELKQIYKKKTTPSKSGQRT >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_4|1443_bp atgattatctcaataaatgcagaaaaggcctttgacaaaattcaacaacccttaatgcta aaaactctcaataaattaggtattgatgggacgtatttcaaaataataagagctatttat gacaaacccacagccaatatcatgctgaatggacaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctctttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccagcaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaatgaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacaccacatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaagtggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacagaaatcaattcaagacggattaaagacttaaacgttagacctaaagccata aaaaccctagtagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcacaacctgctcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaccccatcaaaaagtgggcaaaggaca tga >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_5|218_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNCSTTWKLNNLLLNDHWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNVHKRKQERSKINTLTSQLK ELEKQEQTHSKASGSQEVTKIRAELKDIETDKTIQKNQ >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_5|657_bp atgggagactttaacaccccactctcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctgtcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgaccactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgtccacaagagaaagcaggaaagatctaaaatcaacaccctgacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcggaagtcaagaagtaactaag atcagagcagaactgaaggacatagagacagacaaaaccattcaaaaaaatcaatga >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_6|209_aa MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTVDLSAETLQARREWGPIFDILKEKNFQPRNSYPAKLSLISEGEIKYFTEK QMLRDFVTTRPALKELLKEVLNMERNNRY >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_6|630_bp atggaagatgagatgaatgaaatgaagcgagaagggaagtttagagaaaaacgaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacagtggatctctcggcagaaactctacaagcc agaagagagtgggggccaatattcgacattcttaaagaaaagaattttcaacccagaaat tcatatccagccaaactaagcctcataagtgaaggagaaataaaatactttacagagaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagtg ctaaacatggaaaggaacaaccggtactag >gi568815586f:10209385_10414790|GENSCAN_predicted_peptide_7|480_aa MNKQRGTYSEVSLAQDPKRQQRKLKGNKISISGTKQEIFQVELNLQNASSDHQGNDKTYH CKGLLPPPEKLTAEVLGIICIVLMATVLKTIVLIPCIGVLEQNNFSLNRRMQKEMSEFHN YNLDLKKSDFSTRWQKQRCPVVKSKCRENASPFFFCCFIAVAMGIRFIIMVTIWSAVFLN SLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNWYESQASCMSQNASLLKVYSK EDQVQQTRSYYRSFGHCLARKVSSPVTACYSCKARPISCMQCPRSDFPPHSPQSTQPVLR DLCLPTSLPALASPETIAFTHFHTPSLPQDSSLDEMSLSHVPDPVNHFHFVSGDISFIQI GRPLLLPPATLGWISGQSLGGDQAPLCPYGTGFPALGLYSLPQFLKCWYRPAAPSSVPLR CQAGCWDVGRAGLYIRVKLSCGTPWMPGLSWPRGPSIAGKGDSKSVISNPDEPPEMYRDI >gi568815586f:10209385_10414790|GENSCAN_predicted_CDS_7|1443_bp atgaataaacaaagaggaacctactcagaagtgagtctggcccaggacccaaagaggcag caaaggaaacttaagggcaataaaatctccatttcaggaaccaaacaggaaatattccaa gtagaattaaaccttcaaaatgcttcttcggatcatcaagggaatgacaagacatatcac tgcaaaggtttactgccacctccagagaagctcactgctgaggtcctaggaatcatttgc attgtcctgatggccactgtgttaaaaacaatagttcttattccttgtattggagtactg gagcagaacaatttttccctgaatagaagaatgcagaaagagatgagtgaatttcataat tataacttggatctgaagaagagtgatttttcaacacgatggcaaaagcaaagatgtcca gtagtcaaaagcaaatgtagagaaaatgcatctccattttttttctgctgcttcatcgct gtagccatgggaatccgtttcattattatggtaacaatatggagtgctgtattcctaaac tcattattcaaccaagaagttcaaattcccttgaccgaaagttactgtggcccatgtcct aaaaactggatatgttacaaaaataactgctaccaattttttgatgagagtaaaaactgg tatgagagccaggcttcttgtatgtctcaaaatgccagccttctgaaagtatacagcaaa gaggaccaggtccagcagacccggtcctattacaggagtttcggacactgcttagccagg aaggtgtcttcccctgtcacagcctgctacagctgcaaagctcgtcctatcagctgtatg caatgtcctaggtctgatttccccccacactcgcctcagagcacacagcccgtgctaagg gatctgtgcctccccacgtcactccctgcattggcctctcccgagaccatcgctttcaca cactttcacacaccttccttgccccaggattcctcattggacgaaatgagcctctctcat gtcccggatccagtgaaccactttcactttgttagtggggacataagcttcatccaaatt ggcaggccactcctgctgcccccagccactctgggttggattagtggtcagtccctggga ggtgatcaagctcccctttgtccctatgggacgggctttcctgccttgggcctatactcc ttaccacagttcctgaagtgctggtatcgtcctgcagccccatcctcggttccattgcgc tgccaggcagggtgctgggacgtggggagagctggtctatatatccgggtgaagctcagc tgtggcacaccttggatgccgggtctctcctggccccggggacctagtatcgcaggcaaa ggggacagtaaatctgtcatctccaatcctgatgagcccccagaaatgtatcgggatata taa