GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:16:18 Sequence gi568815590r:66534504_66766917 : 232414 bp : 40.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7095 7369 275 1 2 46 55 187 0.341 7.53 1.02 Intr + 7952 8120 169 1 1 -43 80 173 0.175 2.00 1.03 Intr + 8168 8295 128 0 2 60 40 99 0.044 1.78 1.04 Term + 9317 9448 132 1 0 -30 48 166 0.041 -2.09 1.05 PlyA + 9562 9567 6 1.05 2.04 PlyA - 9611 9606 6 1.05 2.03 Term - 9928 9795 134 2 2 86 55 73 0.540 1.07 2.02 Intr - 10072 10019 54 2 0 95 49 81 0.396 2.93 2.01 Init - 13834 13786 49 1 1 95 95 -4 0.518 2.44 2.00 Prom - 19505 19466 40 -2.95 3.19 PlyA - 21545 21540 6 1.05 3.18 Term - 28761 28684 78 0 0 41 48 124 0.776 0.48 3.17 Intr - 31740 31561 180 0 0 88 80 61 0.930 4.44 3.16 Intr - 32285 32181 105 2 0 64 72 68 0.805 2.29 3.15 Intr - 38093 37979 115 1 1 12 68 59 0.166 -4.17 3.14 Intr - 39003 38861 143 0 2 62 110 123 0.917 10.13 3.13 Intr - 41872 41504 369 1 0 78 72 218 0.046 13.58 3.12 Intr - 45863 45630 234 2 0 71 84 221 0.075 16.96 3.11 Intr - 58041 57937 105 0 0 57 52 148 0.136 7.59 3.10 Intr - 63047 62931 117 2 0 124 88 16 0.116 4.94 3.09 Intr - 64639 64547 93 1 0 87 57 65 0.794 2.54 3.08 Intr - 67266 67195 72 0 0 57 108 33 0.648 0.88 3.07 Intr - 68020 67915 106 0 1 75 76 154 0.995 12.20 3.06 Intr - 78057 78011 47 0 2 54 111 32 0.518 -1.61 3.05 Intr - 78266 78108 159 2 0 41 68 102 0.265 2.76 3.04 Intr - 78442 78316 127 0 1 -18 94 141 0.419 3.86 3.03 Intr - 79261 79030 232 2 1 37 62 213 0.519 9.51 3.02 Intr - 79665 79411 255 1 0 79 59 75 0.231 0.19 3.01 Init - 82364 82262 103 2 1 66 92 14 0.348 0.05 3.00 Prom - 91700 91661 40 -6.05 4.05 PlyA - 92409 92404 6 1.05 4.04 Term - 100869 99998 872 1 2 122 34 583 0.857 48.11 4.03 Intr - 127992 127863 130 0 1 20 83 99 0.177 1.95 4.02 Intr - 128141 128034 108 2 0 74 67 87 0.636 4.76 4.01 Init - 132455 129723 2733 2 0 55 31 2158 0.666 194.78 4.00 Prom - 144208 144169 40 -3.15 5.05 PlyA - 144531 144526 6 1.05 5.04 Term - 156255 156032 224 1 2 91 33 166 0.473 7.50 5.03 Intr - 182075 182003 73 2 1 74 51 48 0.005 -2.24 5.02 Intr - 191969 191837 133 1 1 40 76 92 0.065 2.93 5.01 Init - 209489 209326 164 2 2 65 26 139 0.507 4.65 5.00 Prom - 209856 209817 40 -4.65 6.00 Prom + 213850 213889 40 -2.85 6.01 Sngl + 221560 221760 201 0 0 77 53 140 0.722 4.33 6.02 PlyA + 223117 223122 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 42477 43448 972 2 0 42 48 389 0.827 26.78 S.002 Term - 61254 61050 205 2 1 97 42 84 0.825 0.56 S.003 Intr - 63047 62827 221 2 2 124 44 168 0.880 12.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:66534504_66766917|GENSCAN_predicted_peptide_1|234_aa XVTRAFWREHLDLKRTSQWSRRLEHRRGLHTASLTSTGPGQWCSRTEHPIQIQTGHFPIC VNGGQAAALDKSRLLSSVLLPDLKKRPPGVAEQQQQVHGEPALGPACHIPNRVPSGVWRA QRPPVPAGCAFMQRSGRPAPGRRDTLLAASPGLGPHKRRRVLRAWAPSRSGLAPGLTADV GGLALSERKGCKLGTQRGGDRKGNQEGSQRTGEELERSQRPNEGKGQVGSAMSP >gi568815590r:66534504_66766917|GENSCAN_predicted_CDS_1|705_bp natgtaaccagggccttctggagagagcacctagacctgaagaggacaagtcagtggagc agacgcttagaacatagaaggggccttcatacagcttccctcacatctactggcccagga cagtggtgttctagaacagaacaccccattcaaatccaaacggggcatttcccaatctgt gtaaacggtggtcaagctgcagccctagataaatcacgcttactatcatcagttctgctg ccagatctgaagaaacggccaccaggggtggcagagcagcagcagcaggtccacggggag cctgccctcggtcctgcttgtcacatccctaatcgcgtcccatctggtgtctggcgtgca caaaggccgcccgttccggcgggttgtgcatttatgcagcgcagcggccgccccgcacca ggccgacgcgacacacttctggcagcctctccagggctcgggccacacaaaaggcgccgg gtgctgcgggcttgggccccctcccgctccgggctggcaccgggactgacggctgatgtg ggtggactggccctcagcgagagaaagggatgcaagttaggcacccaacgaggaggagac aggaaaggaaaccaagagggcagccagagaaccggagaagagctagaaagatcccagagg ccgaatgaaggaaagggtcaagtgggctctgccatgagcccctaa >gi568815590r:66534504_66766917|GENSCAN_predicted_peptide_2|78_aa MTEFKCYMTFLQEHLTDLKQSLSMAARSYPVPVGDKIVTTSQKYLIAPACQHNPPLPAGP TLIKFPAEVPTYTLYSAS >gi568815590r:66534504_66766917|GENSCAN_predicted_CDS_2|237_bp atgacggagtttaagtgctatatgaccttcctccaggaacacttgactgacctaaagcag tccctcagcatggctgcacgatcctaccccgtgcccgtgggcgacaaaatagtgacaacc agccagaaatacctcattgcccctgcctgccagcacaaccccccacttcccgctgggcct acactcatcaagttccctgcagaggtgcccacctatactctctactctgcttcctga >gi568815590r:66534504_66766917|GENSCAN_predicted_peptide_3|879_aa MGQCNGTRWMSCIEWICATVEKHSTWESTAFIVSKGRRSPSTLNYHAHSSGRNAQSTVFF TRYSPGQGSSAAHVISCRAGTKLSTLKTSGRHRLSPVPSTSPARNFFHLSLFCCFLRKTV RRKTPKEPGAVAEARLRRRSLGGPRDHPPRQKLGSLTLRILGQLDFSTARFRTRSGAASG PTPRRAQELALPAGPARERGKRLEDRGEGAFAGGWRRLTRVPAQPAPMRYLKDGEEVAQI LFPIPLPPPAIRPSLARAEVFLPASPRPLGLDDRRAAATCGELGTRVFEARDISDLMLPL LRGRFLASEDEDDDLQYADHDYEVPQQKGLKKLWNRVKWTRDEDDKLKKLVEQHGTDDWT LIASHLQNRSDFQCQHRWQKVLNPELIKGPWTKEEDQRVIELVQKYGPKRWSLIAKHLKG RIGKQCRERWHNHLNPEQPFIDEDPDKEKKIKELEMLLMSAENEVRRKRIPSQPGSFSSW SGSFLMDDNMSNTLNSLDEHTSEFYSMDENQPVSAQQNSPTKFLAVEANAVLSSLQTIPE FAETLELIESDPVAWSDVTSFDISDAAASPIKSTPVKLMRIQHNEGAMECQFNVSLVLEG KKNTCNGGNSEAVPLTSPNIAKFSTPPAILRKKRKMRVGHSPGSELRDGSLNDGGNMALK HTPLKTLPFSPSQFFNTCPGNEQLNIENPSFTSTPICGQKALITTPLHKETTPKDQKENV GFRTPTIRRSILGTTPRTPTPFKNALAAQEKKYGPLKIVNTASGKKVRKSLVLDNWEKEE SGTQLLTEDISDMQSENRFTTSLLMIPLLEIHDNRCNLIPEKQDINSTNKTYTLTKKKPN PNTSKVVKLEKNLQYAAIENSRIGLLLREKERATTTCTV >gi568815590r:66534504_66766917|GENSCAN_predicted_CDS_3|2640_bp atggggcaatgtaatggtaccagatggatgtcatgcatagagtggatttgtgctacagtt gagaaacactcaacttgggaatccactgcttttatagtaagcaagggaagaaggtcacct tctacacttaactaccatgctcattcatctggacgaaatgcccagtccactgtcttcttc acgcgttattctccaggacagggctcctccgccgcgcacgttatttcctgtagagcgggc acgaaattaagcaccttgaagaccagcggtagacacagattatctccggttccaagcacg tctcccgctaggaacttttttcacttgtcactcttctgctgctttttgaggaaaacagtg aggcggaagacacctaaggagccaggagctgtcgccgaggcgaggctgcggagacgttcc cttggtgggccgcgggaccacccaccccgccagaaactcggttcgctcaccctaagaatt ttggggcaactggacttctccaccgcccgcttccggacccgctccggcgccgcttccgga cccactcctcggcgagcgcaggaactcgctctcccagcaggccccgcgcgggagcgaggg aagcggctagaggatcggggagaaggagcattcgccggaggctggaggaggctgacccgc gtccccgcccagcctgctcctatgcggtacttgaaggatggcgaagaggtcgcgcagatt ctctttcctatcccgctgccccctcccgccataaggccctccctggcgagggcagaggtt tttcttcccgcgtccccgcggccgcttggcctcgatgaccggagggcagctgcgacgtgt ggagagttggggaccagggtgtttgaggcgcgggacatttctgacctcatgctgccactt cttcgaggcaggtttcttgccagtgaggatgaggatgatgaccttcagtatgccgatcat gattatgaagtaccacaacaaaaaggactgaagaaactctggaacagagtaaaatggaca agggacgaggatgataaattaaagaagttggttgaacaacatggaactgatgattggact ctaattgctagtcatcttcaaaatcgctctgattttcagtgccagcatcgatggcagaaa gttttaaatcctgaattgataaagggtccttggactaaagaagaagatcagagggttatt gaattagttcagaaatatgggccaaaaagatggtctttaattgcaaaacatttaaaagga agaataggcaagcagtgtagagaaagatggcataatcatctgaatcctgagcaacccttc attgatgaagatcctgataaggaaaagaaaataaaggaacttgagatgcttcttatgtca gctgagaatgaagttagaagaaagcgaattccatcacagcctggaagtttttctagctgg tctggtagtttcctcatggatgataacatgtctaatactctaaatagccttgacgagcac actagtgagttttacagtatggatgaaaatcagcctgtgtctgctcagcagaattcaccc acaaagttcctggccgtggaggcaaacgctgtgttatcctctttgcagaccatcccagaa tttgcagagactctagaacttattgaatctgatcctgtagcatggagtgacgttaccagt tttgatatttctgatgctgctgcttctcctatcaaatccaccccagttaaattaatgaga attcagcacaatgaaggagccatggaatgccaatttaacgtcagtcttgtacttgaaggg aaaaaaaacacttgtaatggtggcaacagtgaagctgttcctttaacatccccaaatata gccaagtttagcactccaccagccatcctcagaaagaagagaaaaatgcgagtgggtcat tccccaggcagcgaacttagggatggctcattgaacgatggtggtaatatggcgctaaaa catacaccactgaaaacactaccattttctccttcacagtttttcaacacatgtcctggt aatgaacaacttaatatagaaaatccttcatttacatcaacccctatttgtgggcagaaa gctctcattacaactcctcttcataaggaaacaactcccaaagatcaaaaggaaaatgta gggtttagaacacctactattagaagatctatactgggtaccacaccaagaactcctact ccttttaagaatgcgcttgctgctcaggagaaaaaatatggacctcttaaaattgtgaat accgcttctgggaagaaagtcagaaaatcactagtcttagataattgggaaaaagaagaa tcaggcactcaactgttgactgaagacatttcagacatgcagtcagaaaatagatttact acatccttattaatgataccattattggaaatacatgacaataggtgcaacttgattcct gaaaaacaagatataaattcaaccaacaaaacatatacacttactaaaaagaaaccaaac cctaacacttccaaagttgtcaaattggaaaagaatcttcagtatgctgctatagaaaat agcagaatcggcttgctgctacgagagaaggaaagagcgaccaccacttgcactgtgtga >gi568815590r:66534504_66766917|GENSCAN_predicted_peptide_4|1280_aa MSQPPPPPPPLPPPPPPPEAPQTPSSLASAAASGGLLKRRDRRILSGSCPDPKCQARLFF PASGSVSIECTECGQRHEQQQLLGVEEVTDPDVVLHNLLRNALLGVTGAPKKNTELVKVM GLSNYHCKLLSPILARYGMDKQTGRAKLLRDMNQGELFDCALLGDRAFLIEPEHVNTVGY GKDRSGSLLYLHDTLEDIKRANKSQECLIPVHVDGDGHCLVHAVSRALVGRELFWHALRE NLKQHFQQHLARYQALFHDFIDAAEWEDIINECDPLFVPPEGVPLGLRNIHIFGLANVLH RPIILLDSLSGMRSSGDYSATFLPGLIPAEKCTGKDGHLNKPICIAWSSSGRNHYIPLVG IKGAALPKLPMNLLPKAWGVPQDLIKKYIKLEEDGGCVIGGDRSLQDKYLLRLVAAMEEV FMDKHGIHPSLVADVHQYFYRRTGVIGVQPEEVTAAAKKAVMDNRLHKCLLCGALSELHV PPEWLAPGGKLYNLAKSTHGQLRTDKNYSFPLNNLVCSYDSVKDVLVPDYGMSNLTACNW CHGTSVRKVRGDGSIVYLDGDRTNSRSTGGKCGCGFKHFWDGKEYDNLPEAFPITLEWGG RVVRETVYWFQYESDSSLNSNVYDVAMKLVTKHFPGEFGSEILVQKVVHTILHQTAKKNP DDYTPVNIDGAHAQRVGDVQGQESESQLPTKIILTGQKTKTLHKEELNMSKTERTIQQNI TEQASVMQKRKTEKLKQEQKGQPRTVSPSTIRDGPSSAPATPTKAPYSPTTSKEKKIRIT TNDGRQSMVTLKSSTTFFELQESIAREFNIPPYLQCIRYGFPPKELMPPQAGMEKEPVPL QHGDRITIEILKSKAEGGQSAAAHSAHTVKQEDIAVTGKLSSKELQEQAEKEMYSLCLLA TLMGKMNLILMVKNRKEKIFLNRRWTLENVALTFILVCTEPDNSAQQMTLTQLKKGCICR NGMDGQSALNIEKSEKQNSAMFIYCCTKEKGMADGKHCTFPHLPGKTFVYNASEDRLELC VDAAGHFPIGPDVEDLVKEAVSQVRAEATTRSRESSPSHGLLKLGSGGVVKKKSEQLHNV TAFQGKGHSLGTASGNPHLDPRARETSVVRKHNTGTDFSNSSTKTEPSVFTASSSNSELI RIAPGVVTMRDGRQLDPDLVEAQRKKLQEMVSSIQASMDRHLRDQSTEQSPSDLPQRKTE VVSSSAKSGSLQTGLPESFPLTGGTENLNTETTDGCVADALGAAFATRSKAQRGNSVEEL EEMDSQDAEMTNTTEPMDHS >gi568815590r:66534504_66766917|GENSCAN_predicted_CDS_4|3843_bp atgtctcagccgccgccgccgccgcctccgttgccgccgccacctcctccccctgaggct ccacagactccgtcgtccttggcgtcggcggctgcttcgggggggcttttgaagcggaga gaccggagaatcctttccgggagctgcccggatccgaagtgtcaggcgcgtctatttttc ccggcctccggttctgtcagcatcgagtgtaccgagtgcggccagcggcacgagcagcaa cagctgctgggggttgaggaggtgaccgacccggacgtagtgctacacaacctgctgcgg aacgcgctgctcggggttacgggggcacccaagaagaacacggaactggtaaaggtgatg ggcctttccaactatcactgcaaattgttgtcgcccatattagctcgctatggaatggac aaacagacaggccgggccaagcttctccgggacatgaaccagggcgaactgttcgattgc gccttactgggtgaccgcgccttcctcatagaaccagagcatgttaacactgtgggctat ggcaaggaccgctccggaagcctcctgtatttgcatgacactctggaggacattaagcgg gccaataaaagccaggaatgtctcattccagtgcatgtggacggggatggacactgcttg gtgcatgctgtgtctcgggctctagtaggccgagagctcttctggcatgccttaagagag aatcttaaacagcactttcagcagcacctggcccgatatcaagctctgttccatgacttc attgatgctgctgagtgggaggacattatcaatgagtgtgaccctctgtttgtaccacct gagggtgttcccttgggcctgaggaatatccacatatttggtcttgccaatgtgctacat cgtcctattattctgttagattccctcagtggcatgagaagctctggtgattattcagcc acctttctacctgggctcatccctgcagagaagtgcactgggaaagatggtcatttgaac aaaccaatctgtattgcatggagcagctccggtagaaaccattatatccccttggtaggc ataaaaggggctgctttgcccaaactgcctatgaatttgcttcctaaagcatggggtgtg cctcaggaccttattaaaaagtacataaaacttgaagaggatggtggttgtgttattgga ggtgacagaagtttgcaagataaatacttacttaggcttgttgctgctatggaagaagtc tttatggacaaacatggtatccatcctagtttggttgctgatgtccatcagtatttctac agaaggactggagtgataggagttcagcctgaggaagtcacagcagctgctaaaaaagca gtaatggataatcgccttcacaaatgtttgctctgtggtgccctttctgaacttcatgtt cctccagagtggttggctcctggagggaaattgtataacctggcaaaaagtactcatgga cagctgaggactgacaaaaattacagctttcccttgaacaatttggtttgctcatatgat tcagtgaaagatgttctggtaccagactatggaatgagtaacctaacagcttgtaattgg tgccatggcacatctgtgcgaaaggtcagaggagatgggtctattgtgtatttggatgga gacagaactaattctaggtccactggtggcaaatgtggttgtggattcaaacacttttgg gatggtaaggagtatgacaatctaccagaagctttccctattactttagaatggggtgga agagtggtcagagaaacagtatattggttccagtatgaaagtgattcatctttgaatagt aatgtttacgatgttgcaatgaaacttgttaccaagcactttccaggtgaatttgggagt gaaatcctagttcagaaagttgtccacactatattgcatcagactgccaaaaagaatccc gatgattatactcctgtaaatatagatggtgctcacgcccaaagagttggagatgttcaa ggacaagaatcagagtctcagctcccaactaaaattattcttactggacagaaaacaaaa actttgcacaaggaggagttaaacatgagtaaaactgaaagaactattcaacagaatatt acggaacaggcttctgtaatgcagaaacggaaaacagagaagttaaaacaagaacaaaaa gggcaacccaggactgtttctcccagtaccattcgtgatggtccatcctctgcacctgct acacctaccaaggctccctattcaccgacaacttctaaggagaagaagatccgaatcaca actaatgatggacgacagtccatggttacccttaagtcttcaacaaccttttttgaactt caggaaagtatagccagagaattcaacattcctccatatttacagtgtattcgatacggg tttcctcctaaagagttaatgccaccacaggcaggaatggaaaaggaaccagttccttta cagcatggcgacagaattacaatagaaattctaaaaagtaaagctgaaggtggtcagtct gctgcagcacactcagcccacactgtgaaacaagaagatattgctgttactggtaaactg tcatctaaggaacttcaggagcaagctgaaaaagaaatgtactccttgtgtcttttagca acattaatgggtaagatgaatttaattctgatggtcaaaaacaggaaagagaagatcttc ctcaatagaagatggactcttgagaatgtggctcttacctttatccttgtatgcacagaa cctgataatagtgctcagcagatgacattaacacagctaaagaaaggctgtatctgcaga aatggaatggatggtcagtcagcactgaacattgaaaaaagtgaaaagcagaattcggca atgttcatatactgctgtacaaaggaaaaaggtatggctgatggcaagcattgtactttt ccacatctgcctggcaaaacctttgtctataatgcttctgaagatagactggaattgtgt gtggatgctgcaggacatttccccattggtcctgatgttgaagatttagttaaagaggct gtaagtcaggttcgagcagaggctactacaagaagtagggaatcaagtccctcacatggg ctattaaaactaggtagtggtggagtagtgaaaaagaaatctgagcaacttcataacgta actgcctttcagggaaaagggcattctttaggaactgcatctggtaacccacaccttgat ccaagagctagggaaacttcagttgtaagaaagcataatacagggacagactttagtaat agttccactaaaacagagccttctgtattcacagcttcttctagtaatagtgagcttatt cgaatagctcctggagtagtaacaatgagagacggcaggcagcttgatcctgatttggtt gaggcccagcgaaaaaaattgcaggaaatggtttcttctattcaggcttcaatggacagg caccttcgggatcaaagtacagagcagtcaccatctgatcttcctcaaaggaaaacagaa gttgtgagttcttctgcaaagtctgggagtcttcagactggtttgcctgaatcttttcct ttaactggtggtactgaaaatttgaatacagaaacaactgatggctgtgtagcagatgca ctgggagcagcctttgccacaaggtcaaaagcacaaaggggaaattccgtggaggagctt gaagagatggatagtcaagatgctgagatgactaacacaactgagccaatggatcactct tga >gi568815590r:66534504_66766917|GENSCAN_predicted_peptide_5|197_aa MTTVQQHPRLEQKDRGLLEEGSSREMELKDSSLLDLTVENCMARLLQGLGRIRDKWYYGL FVNKFCLFFRMRGFERSQLLPPYQLTPGNLMVKPERPIQSPPTPPPKESRFCFSVIEEQW PPGGQKASRSWFVNAGGRQETSTPEKKKFIIHSTGKKKKPCPSECLYQFLLLPSPQFHRS DMHGPREWVCIAADEQQ >gi568815590r:66534504_66766917|GENSCAN_predicted_CDS_5|594_bp atgacaactgtgcagcagcatcccagattggaacaaaaggacagaggactcctggaggaa ggttccagcagggaaatggagctgaaagattccagtctgctagatttgactgtggaaaat tgtatggcaaggctgctacagggcttaggaagaattagagacaaatggtactatgggtta tttgttaataaattctgcctgttctttcggatgaggggctttgagaggagtcagctttta cccccttatcagttaacaccggggaacctcatggtaaagccagagagacctattcagtca cccccaacccccccgcccaaagaaagcagattttgctttagcgtaatagaagaacagtgg ccaccaggaggtcaaaaagctagccgttcctggtttgtgaatgctggaggaagacaggag acatctacgccagagaaaaagaagtttattattcacagtacaggaaaaaaaaaaaagcca tgcccatcagaatgtttgtatcagtttctcttgctcccaagtccccagttccacaggagc gacatgcatgggcctagagagtgggtttgcattgcagctgacgaacaacaataa >gi568815590r:66534504_66766917|GENSCAN_predicted_peptide_6|66_aa MRSYGWALIQYDHCSYKKRKFGLSYVHKEDAVRTQGEGSRLQAKERRLRRDQLWWHIDLR LVPPEL >gi568815590r:66534504_66766917|GENSCAN_predicted_CDS_6|201_bp atgaggtcctatgggtgggccctaattcagtatgaccattgttcttataagaagaggaaa tttggacttagctacgtacacaaggaagatgctgtgaggacacagggagaaggcagccgt ctacaagccaaggagaggaggctcaggagagaccagctctggtggcacattgatctcaga cttgtgcctccagaattgtga