GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:00:09 Sequence gi568815585f:36332622_36542662 : 210041 bp : 38.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 3209 2400 810 2 0 92 115 536 0.766 51.45 1.00 Prom - 17721 17682 40 -4.65 2.00 Prom + 24054 24093 40 -4.55 2.01 Init + 27698 27748 51 1 0 74 94 35 0.197 3.91 2.02 Intr + 32879 32983 105 1 0 18 50 116 0.106 0.29 2.03 Intr + 33109 33222 114 0 0 102 94 11 0.169 2.82 2.04 Intr + 50467 50610 144 0 0 100 61 31 0.022 1.16 2.05 Intr + 72473 72540 68 1 2 54 92 57 0.149 -0.52 2.06 Term + 75532 75838 307 1 1 25 43 747 0.200 57.50 2.07 PlyA + 76527 76532 6 1.05 3.00 Prom + 93385 93424 40 -5.05 3.01 Init + 94029 94106 78 2 0 62 101 39 0.732 3.71 3.02 Intr + 97989 98083 95 2 2 30 75 109 0.550 1.64 3.03 Term + 98216 98450 235 2 1 77 39 227 0.812 11.51 3.04 PlyA + 98468 98473 6 -1.95 4.00 Prom + 98603 98642 40 -9.45 4.01 Init + 99560 99661 102 1 0 116 51 73 0.769 6.34 4.02 Intr + 100412 100600 189 1 0 77 95 244 0.963 22.86 4.03 Intr + 105008 105254 247 1 1 53 93 277 0.947 20.81 4.04 Intr + 105446 105570 125 0 2 52 100 71 0.580 4.08 4.05 Intr + 106023 106246 224 2 2 58 88 283 0.995 21.30 4.06 Intr + 107358 107562 205 0 1 86 93 244 0.997 22.98 4.07 Intr + 108497 108610 114 1 0 63 86 55 0.846 2.52 4.08 Term + 110472 110582 111 2 0 129 47 62 0.739 3.78 4.09 PlyA + 111892 111897 6 -0.45 5.05 PlyA - 112039 112034 6 1.05 5.04 Term - 114318 114206 113 1 2 117 39 41 0.535 -0.16 5.03 Intr - 115350 115155 196 0 1 101 97 87 0.631 8.97 5.02 Intr - 122550 122379 172 2 1 70 80 53 0.240 1.72 5.01 Init - 134422 134304 119 1 2 64 39 122 0.546 4.62 5.00 Prom - 136168 136129 40 -3.65 6.03 PlyA - 136925 136920 6 1.05 6.02 Term - 138746 137981 766 1 1 7 41 321 0.281 11.19 6.01 Init - 140524 140409 116 1 2 58 44 127 0.157 5.03 6.00 Prom - 154060 154021 40 -3.65 7.02 PlyA - 154825 154820 6 1.05 7.01 Sngl - 155839 154958 882 1 0 42 43 292 0.947 15.87 7.00 Prom - 155998 155959 40 -11.44 8.04 PlyA - 156046 156041 6 -0.45 8.03 Term - 157357 156218 1140 1 0 0 42 500 0.468 27.55 8.02 Intr - 157660 157444 217 0 1 8 98 165 0.353 7.08 8.01 Init - 158860 158781 80 1 2 47 75 69 0.282 2.08 8.00 Prom - 161012 160973 40 -4.55 9.04 PlyA - 161300 161295 6 1.05 9.03 Term - 185487 185390 98 1 2 76 47 146 0.698 6.45 9.02 Intr - 186685 186547 139 1 1 48 50 69 0.009 -1.78 9.01 Init - 207298 207245 54 1 0 103 70 70 0.553 8.04 9.00 Prom - 207845 207806 40 -2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 24750 25020 271 0 1 55 38 198 0.813 5.47 S.002 Sngl + 90763 90927 165 0 0 46 41 172 0.822 2.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_1|270_aa MEQEPQNGEPAEIKIIREAYKKAFLFVNKGLNTDELGQKEEAKNYYKQGIGHLLRGISIS SKESEHTGPGWESARQMQQKMKETLQNVRTRLEILEKGLATSLQNDLQEVPKLYPEFPPK DMCEKLPEPQSFSSAPQHAEVNGNTSTPSAGAVAAPASLSLPSQSCPAEAPPAYTPQAAE GHYTVSYGTDSGEFSSVGEEFYRNHSQPPPLETLGLDADELILIPNGVQIFFVNPAGEVS APSYPGYLRIVRFLDNSLDTVLNRPPGFLQ >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_1|810_bp atggagcaagagccacaaaatggagaacctgctgaaattaagatcatcagagaagcatat aagaaggcctttttatttgttaacaaaggtctgaatacagatgaattaggtcagaaggaa gaagcaaagaactactataagcaaggaataggacacctgctcagagggatcagcatttca tcaaaagagtctgaacacacaggtcctgggtgggaatctgctagacagatgcaacagaaa atgaaagaaactctacagaatgtacgcaccaggctggaaattctagagaagggtcttgcc acttctctgcagaatgatcttcaggaggtgcccaagttatatccagaatttccacctaaa gacatgtgtgaaaaattaccagagcctcagtcttttagttcagctcctcagcatgctgaa gtaaatggaaacacctcaactccaagtgcaggggcagttgctgcacctgcttctctgtct ttaccatcacaaagttgtccagcagaagctcctcctgcttatactcctcaagctgctgaa ggtcactacactgtatcctatggaacagattctggggagttttcatcagttggagaggag ttttataggaatcattctcagccaccgcctcttgagaccttagggctggatgcagatgaa ttgattttgataccaaatggagtacagattttttttgtaaatcctgcaggggaggttagt gcaccttcgtatcctgggtaccttcgaattgtgaggtttttggataattctctcgatacg gttctaaaccgtcctcccgggtttcttcag >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_2|262_aa MSNEISSSRMMNVRATLITRKAKSLFLDFFSYKPPIDTYPLLYGAQMKDIGHLQHLAQRL QWNRRAVNICGMHGWTKDLSPHSRMPSMLEKQCCSKGRLSSIDSIYLDIIPSSSRIFSDD TELLARDWCSHSWTTEFQACCNDSLATAYVHWRLWGSTISRVLSDKSVQKKKKEEKEKEG KGKKKRKKKKRKKKKRKKKKRNKKEEEKKKKEEEEEVEEEEGEGEGEEEEEEEEEEEEEE EEEEEEEEEEEEELKKQSSSFC >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_2|789_bp atgtcaaatgaaatcagcagttctagaatgatgaatgttagagctaccctaattactagg aaagccaagtctctcttcctggatttcttctcctataaaccgcctattgatacctaccca cttctctatggagcacagatgaaggatattggtcatctccagcatctagcacaacgtctg caatggaacaggcgagctgtgaatatttgtggaatgcatgggtggactaaagacctatca cctcactctagaatgcccagcatgttggagaaacagtgttgttcaaaagggagactttct agtatagatagcatctatttagatatcattcctagcagcagcaggattttctctgatgat acagagctgttagctagagattggtgtagtcactcatggaccactgagtttcaggcctgc tgtaatgactccctggctactgcctatgttcactggaggctctggggctctacaatcagc agggtgcttagtgataaatcagttcagaagaagaagaaggaggagaaggagaaggagggg aaggggaagaagaagaggaagaagaagaagaggaagaaaaagaagaggaagaagaagaag aggaataagaaggaggaggagaagaagaagaaagaagaggaagaagaggtagaagaagaa gaaggagaaggagaaggagaggaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaggaagaagaagaagaagaattaaagaaacaaagctctagc ttttgctga >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_3|135_aa MEYYSAIKKNEILSFATGWMELEIIMRLRLLAGLAVESQTELATESFSDPGNNNTVGRCY QNSRVSFIFKGTGNLGCKRTAVSQGIEVCDKRIEMSFMLNKIRFCVPPSSSLDSPDAYQL LNIYLNCKAFNRRFC >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_3|408_bp atggaatactattcagctataaaaaagaatgagatcctgtcatttgcaacaggatggatg gaactggagatcattatgcgactcaggctcttggcaggacttgctgtagagtcccagacg gagttagccactgagtccttcagcgacccaggaaataacaacactgttggcagatgttac cagaattcccgtgtttccttcatattcaaaggcacaggaaatctggggtgcaagcggact gctgtctctcaaggaatagaagtctgtgacaaacgtatagaaatgagtttcatgttgaac aagatccgtttctgcgttcctccctcatccagccttgattctccagatgcttaccaactt cttaatatttacctcaactgcaaggctttcaaccggcgcttttgttag >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_4|438_aa MAMRPRRAHACRGRHGNAPARSGGAADWPIQQTRQQPVESEAMHCSNPKSGVVLATVARG PDACQILTRAPLGQDPPQRTVLGLLTANGQYRRTCGQGITRIRCYSGSENAFPPAGKKAL PDCGVQEPPKQGFDIYMDELEQGDRDSCSVREGMAFEDVYEVDTGTLKSDLHFLLDFNTV SPMLVDSSLLSQSEDISSLGTDVINVTEYAEEIYQYLREAEIRHRPKAHYMKKQPDITEG MRTILVDWLVEVGEEYKLRAETLYLAVNFLDRFLSCMSVLRGKLQLVGTAAMLLASKYEE IYPPEVDEFVYITDDTYTKRQLLKMEHLLLKVLAFDLTVPTTNQFLLQYLRRQGVCVRTE NLAKYVAELSLLEADPFLKYLPSLIAAAAFCLANYTVNKHFWVARHRLVKEGRDSVDRWV KRFEYALPALASLYHPSQ >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_4|1317_bp atggcgatgcggccccggagagcgcacgcctgccgcggtcggcatggaaacgctcccgct aggtccgggggcgccgctgattggccgattcaacagacgcggcagcagcccgtggagtct gaagcaatgcactgcagcaaccccaagagtggagttgtgctggctacagtggcccgaggt cccgatgcttgtcagatactcaccagagccccgctgggccaggatcccccgcagaggaca gtgctagggctgctaactgcaaatgggcagtacaggaggacctgtggccaggggatcaca agaatcaggtgttattctggatcagaaaatgccttccctccagctggaaagaaagcactc cctgactgtggggtccaagagccccccaagcaagggtttgacatctacatggatgaacta gagcagggggacagagacagctgctcggtcagagaggggatggcatttgaggatgtgtat gaagtagacaccggcacactcaagtcagacctgcacttcctgctggatttcaacacagtt tcccctatgctggtagattcatctctcctctcccagtctgaagatatatccagtcttggc acagatgtgataaatgtgactgaatatgctgaagaaatttatcagtaccttagggaagct gaaataaggcacagacccaaagcacactacatgaagaagcagccagacatcacggaaggc atgcgcacgattctggtggactggctggtggaggttggggaagaatataaacttcgagca gagaccctgtatctggctgtcaacttcctggacaggttcctttcatgtatgtctgttctg agagggaaactgcagctcgtaggaacagcagctatgcttttggcttcgaaatatgaagag atatatcctcctgaagtagacgagtttgtctatatcaccgatgatacatacacaaaacga caactgttaaaaatggaacacttgcttctgaaagttctagcttttgatctgacagtacca accaccaaccagtttctccttcagtacttgaggcgacaaggagtgtgcgtcaggactgag aacctggctaagtacgtagcagagctgagtctacttgaagcagatccattcttgaaatat cttccttcactgatagctgcagcagctttttgcctggcaaactatactgtgaacaagcac ttttgggtagctcgccatagattagtaaaggaaggaagggacagcgtagacagatgggtg aagagatttgaatatgcgctgcctgctctagcttcactctaccatcctagtcagtaa >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_5|199_aa MEPQKTQDRQSLLEEKLRAEDITLHDFKLYYKAVVAKTACSKQELHVGLSKATQLELNCD FIQTLQENWAIIGTDNKYTSRRKDKRQKGVKQNKEAQHLTQTSIQCQFEARAAEHRKHPG RTFLSCLGLTPSRAARRSKATHRHSPAACRCRAQASTLSFNVSLLPHYSGSLSTDHLVFQ NSRSSPGITDEGHIKQFAA >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_5|600_bp atggaaccacaaaagactcaggacaggcaaagccttcttgaggaaaagctccgagctgaa gacatcacattacatgacttcaaattatactacaaagctgtagtagccaaaacagcatgc agcaagcaggagcttcatgtggggctgtcgaaagctacacagctggagctaaactgtgat tttattcagactctgcaagagaattgggcaattataggcacagataataaatacacctcc aggagaaaggataaaagacaaaagggggtcaaacaaaataaagaagcacagcacctgacc cagacgtctatccaatgtcagtttgaggctagggcagcggagcacagaaagcatcctggt cgtactttcctgtcatgcttgggtctaactccatcccgagctgctcgcagaagcaaggcg acacataggcacagcccagcagcttgtaggtgcagagcccaagcttccactctgagcttc aatgtaagccttctccctcattactctggctcactgagcactgaccatttagtttttcaa aactcacgatcaagtccaggaatcactgatgaaggacatattaaacaatttgctgcctaa >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_6|293_aa MELKTMSQELADECTNFSSQFNQLEERVPVIEDQMNEMKEARAHSKASRRQEITKIREEL KEIETQKTLQKINESRSWFFEKINKIDRLLARLIKKKREKNQIDAMKNNKGDITTNPTEI QTTIKEYYKHLYANELENLEEMDKFLDAYTLPRLNQEEVEFLNRPITVSEIEVIINSLPT KKSPGPDGFLAEFYQRYKEELVPFFLKPFQSREKEGILPNSFYEASIILIPKPGRDTTKK KENFRPISLMNIDAKILSKIRQTESSSTSKSLSTMIKWASSLGCKAGSKYANQ >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_6|882_bp atggagctgaaaaccatgtcacaagaactagctgatgaatgcacaaacttcagtagccaa ttcaatcaactggaagaaagggtaccagtgattgaagatcaaatgaatgaaatgaaagaa gcaagagcacattcaaaagctagcagaaggcaagaaataactaagatcagagaagaactg aaagagatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctggtttttt gaaaagatcaacaaaattgatagactgctagcaagactaataaagaagaaaagagagaag aatcaaatagatgcaatgaaaaataataagggggatatcaccaccaatcccacagaaata caaactaccatcaaagaatactataaacacctctatgcaaatgaactagaaaatctagaa gaaatggataaattcctggacgcatacactctcccaagactaaaccaggaagaagttgaa ttcctgaatagaccaataacagtctctgaaattgaggtaataattaatagcctaccaacc aaaaaaagtccaggaccagatggattcttagctgaattctaccagaggtacaaggaggaa ctggtaccattctttctgaaaccattccaatctagagaaaaagagggaatcctccctaac tcattttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaa aaagagaatttcagaccaatatccctgatgaacatcgatgcaaaaatcctcagtaaaata cggcaaactgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttca tccctgggatgcaaggctggttcaaaatatgcaaatcaataa >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_7|293_aa MIISIDAEKAFDKIQQRFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNVVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQLMSKLPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLLDFKLYCKPTVTKTA >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_7|882_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgcta aaaactctcaataaattaggtattgatgggacatatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacgtagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcctatacaccaataacagacaaacagagagccaactcatg agtaaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagttaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacttgac ttcaaactatactgcaagcctacagtaaccaaaacagcatga >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_8|478_aa MNDQHVSPPRASTSVSYKGNGGDPLIYGVGVLITQILELAGNGTCDNKKTCIILHHLQLA SCNNEDVNKLLGKVIITQVTVLPNIQAKMLPKKTESHLKIKGDKEGHYIMVKGSIQQEEL TILNIYAPNTGAPRFIKQVLSDLQRDLDSHTIIMGDFNTPLSTLDRSTRQKVNKDTQELN SALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNCLSD HSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWD TFKAVCRGKFIALNAHKRKQERSEIDTLTSQLKELEKQEQTHSKASRGQEITKIRVELKE IETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNKVDTIKNDKGDITTNPTETQT TIREYYKHLYANKLENLQEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPT >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_8|1437_bp atgaatgaccagcatgtcagtcctccaagagcatcaactagtgtgagttacaaggggaat ggaggggaccctttaatctatggtgttggagtgctgattactcagattctggagttggca ggcaatgggacctgtgacaacaagaagacatgtatcatcctgcaccacctgcagctggcc agttgcaataatgaagatgtcaacaagttgctgggtaaagtcattatcacacaggtgact gtcttgcccaacatccaggccaagatgctgcccaagaagactgagagtcacctcaagatc aaaggagacaaagaaggccattacataatggtaaagggatcaattcaacaagaagagcta actatcctaaatatatatgcacccaatacaggagcacccagattcataaagcaagtcctg agtgacctacaaagagacttagactcccacacaataataatgggagattttaacacccca ctgtcaacattagacagatcaacgagacagaaagttaacaaggatacccaggaattgaac tcagctctgcatcaagcggatctaatagacatctacaggactctccaccccaaatcaaca gaatatacatttttttcagcaccacaccacacctattccaaaattgaccacatagttgga agtaaagctctcctcagcaaatgtaaaagaacagaaattataacaaactgtctctcggac cacagtgcaatcaaactagaactcaggattaagaaactcactcaaaaccgctcaactaca tggaaactgaacaacctgctcctgaatgactactgggtacataacgaaatgaaggcagaa ataaagatgttctttgaaaccaacgagaacaaagacacaacataccagaatctctgggac acattcaaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcag gaaagatccgaaattgacaccctaacatcacaattaaaagaactagaaaagcaagagcaa acacattcaaaagctagcagagggcaagaaataactaaaatcagagtagaactgaaggaa atagagacacaaaaaacccttcaaaaaattaatgaatccaggagctggttttttgaaagg atcaataaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaataaa gtagacacaataaaaaatgataaaggggatatcaccaccaatcccacagaaacacaaact accatcagagaatactacaaacacctctacgcaaataaactagaaaatctacaagaaatg gataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctg aatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaacttaa >gi568815585f:36332622_36542662|GENSCAN_predicted_peptide_9|96_aa MEEGKGEAGTFLTGWQNRWVHKGISDLEESKAMSTHANSLHSSKVRKCKLKMLGDSLTHK EENGGPLLGTEDPVKNTIVPLIRGLVGRVPKPLVDA >gi568815585f:36332622_36542662|GENSCAN_predicted_CDS_9|291_bp atggaggaaggcaaaggagaagcaggcaccttcctcacagggtggcagaacagatgggtc cacaagggaatttctgatctagaggaatccaaagcaatgtccacacatgcaaattctcta cacagcagcaaagtcaggaaatgcaaactcaaaatgctgggagacagtttgactcacaaa gaggaaaatgggggacccttgctaggcactgaggacccagtgaagaatacaatagtcccc cttattcgagggctagtgggaagagttccaaaacccctggtggatgcctga