GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:21:41 Sequence gi568815582r:84876346_85090618 : 214273 bp : 50.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1093 1165 73 2 1 120 116 84 0.951 12.86 1.02 Intr + 4164 4239 76 0 1 106 76 57 0.795 5.82 1.03 Intr + 12885 13018 134 2 2 88 97 292 0.923 29.54 1.04 Intr + 20544 20667 124 0 1 92 43 50 0.121 1.49 1.05 Intr + 26883 26917 35 2 2 99 80 18 0.150 -0.88 1.06 Term + 31022 31082 61 2 1 68 49 103 0.169 1.68 1.07 PlyA + 31285 31290 6 -0.45 2.00 Prom + 31538 31577 40 -4.66 2.01 Init + 44132 44262 131 1 2 96 10 144 0.645 7.02 2.02 Term + 44280 44532 253 0 1 80 36 379 0.997 27.01 2.03 PlyA + 44697 44702 6 1.05 3.00 Prom + 45222 45261 40 -1.06 3.01 Init + 53162 53233 72 1 0 84 26 112 0.491 5.67 3.02 Intr + 56657 56791 135 1 0 60 64 131 0.574 8.66 3.03 Intr + 61658 61711 54 1 0 76 117 13 0.200 2.18 3.04 Intr + 64978 65201 224 0 2 92 100 57 0.340 4.23 3.05 Intr + 69298 69343 46 1 1 135 61 25 0.239 2.91 3.06 Term + 71474 71620 147 1 0 82 54 66 0.348 0.50 3.07 PlyA + 71755 71760 6 1.05 4.18 PlyA - 74375 74370 6 1.05 4.17 Term - 78109 78020 90 1 0 73 37 88 0.078 -0.08 4.16 Intr - 82894 82779 116 2 2 88 34 74 0.076 2.17 4.15 Intr - 85689 85521 169 0 1 79 34 90 0.441 2.22 4.14 Intr - 86701 86598 104 2 2 100 66 38 0.409 2.69 4.13 Intr - 88218 88096 123 1 0 74 44 61 0.338 0.86 4.12 Intr - 90492 90417 76 0 1 108 77 6 0.331 0.59 4.11 Intr - 91668 91544 125 1 2 88 47 54 0.069 1.60 4.10 Intr - 100174 100021 154 1 1 109 32 63 0.047 2.45 4.09 Intr - 100793 100750 44 0 2 82 115 59 0.627 5.96 4.08 Intr - 101660 101579 82 2 1 92 119 11 0.898 3.81 4.07 Intr - 102940 102844 97 0 1 59 93 31 0.810 0.71 4.06 Intr - 105649 105480 170 1 2 67 87 189 0.422 15.44 4.05 Intr - 110173 110101 73 0 1 55 53 176 0.255 10.21 4.04 Intr - 111374 111298 77 2 2 61 86 64 0.894 1.81 4.03 Intr - 112528 112418 111 1 0 69 70 74 0.915 4.28 4.02 Intr - 114290 113959 332 0 2 123 76 471 0.972 44.75 4.01 Init - 117884 117755 130 2 1 102 103 12 0.940 4.44 4.00 Prom - 130560 130521 40 -5.86 5.00 Prom + 134137 134176 40 -6.16 5.01 Init + 135047 135202 156 1 0 75 78 145 0.855 10.11 5.02 Intr + 135353 135550 198 1 0 -8 78 156 0.434 4.55 5.03 Intr + 156926 156992 67 1 1 88 71 6 0.015 -2.62 5.04 Intr + 158892 158991 100 1 1 87 29 78 0.082 0.97 5.05 Term + 160831 160996 166 1 1 92 48 109 0.771 4.69 5.06 PlyA + 161133 161138 6 1.05 6.05 PlyA - 162071 162066 6 1.05 6.04 Term - 168871 168845 27 1 0 94 42 53 0.440 -0.63 6.03 Intr - 169688 169304 385 1 1 53 41 201 0.366 6.75 6.02 Intr - 171300 171181 120 2 0 95 29 59 0.168 0.31 6.01 Init - 178555 178458 98 1 2 79 94 86 0.510 8.09 6.00 Prom - 182400 182361 40 -4.76 7.00 Prom + 182653 182692 40 -7.56 7.01 Init + 190727 191055 329 1 2 91 86 499 0.997 46.80 7.02 Intr + 195438 195537 100 0 1 125 110 150 0.980 21.01 7.03 Intr + 196580 196653 74 1 2 114 91 47 0.995 5.80 7.04 Intr + 199499 199569 71 2 2 87 103 103 0.977 10.53 7.05 Intr + 201080 201287 208 0 1 85 53 421 0.928 36.44 7.06 Intr + 202070 202110 41 2 2 102 105 40 0.915 4.97 7.07 Intr + 202580 202658 79 0 1 110 95 178 0.964 19.31 7.08 Intr + 206285 206454 170 2 2 79 75 58 0.472 3.19 7.09 Intr + 209935 210379 445 2 1 16 74 375 0.356 21.36 7.10 Intr + 210727 210821 95 1 2 97 100 208 0.999 22.51 7.11 Intr + 211197 211233 37 1 1 108 65 42 0.820 1.22 7.12 Intr + 211357 211400 44 1 2 95 55 60 0.434 1.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_1|167_aa XQDLDCYTTVAQLCPFEKPATHCPRIHCPAHCKDEPSYWAPVFGTNIYADTSSICKTAVH AGVISNESGGDVDVMPVDKKKTYVGSLRNGVQSESAGFRTPQGSFPVCLWNQNRPSTLLN PVCVLGDAKRQALPPQPGPFLDLPQEPRPFVFMAHLFSAVTFGPIED >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_1|504_bp ntgcaggatttggactgctacacgaccgttgctcagctgtgcccgtttgaaaagccagca actcactgcccaagaatccattgtccggcacactgcaaagacgaaccttcctactgggct ccggtgtttggaaccaacatctatgcagatacctcaagcatctgcaagacagccgtgcac gcgggagtcatcagcaacgagagtgggggtgacgtggacgtgatgcccgtggataaaaag aagacctacgtgggctcgctcaggaatggagttcagtctgaaagtgctggcttcaggact ccacagggcagcttccctgtctgcctctggaaccagaaccggccgtccaccctccttaat cccgtctgtgtgttgggtgatgccaaaaggcaagctcttcctcctcagcctggccccttc ctggacctcccccaagaacccaggcccttcgtcttcatggcccacctgttttctgccgtg acctttggtcccattgaggactaa >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_2|127_aa MNYTVQTFFTPANSSRSPNYKMLKEEQEVAVLGAPHNPAPLTSTETSVTDHVIWSLFNTL FMNSCCLGFIAFAYSVKSRDRKMVDDLNRAQAYASTAKHVNIWALTVGILMTILLIIIPV LTFQIYR >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_2|384_bp atgaactacaccgtccaaaccttcttcactcccgccaacagcagccgttcccctaactat aagatgctcaaggaggagcaagaggtggctgtgctgggggcaccccacaaccctgctccc ctgacgtccaccgagacctctgtgaccgaccatgtcatctggtccctgtttaacaccctc ttcatgaactcctgctgcctgggcttcatagcatttgcctactccgtgaagtctagggac aggaagatggttgacgacctgaacagggcccaggcctatgcctccaccgccaagcatgtg aacatctgggccctgactgtgggcatcctcatgaccattctgctcatcatcatcccggtg ctgaccttccaaatctatcgatag >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_3|225_aa MPPDAADAVAMSEGGTPSSQQDGEGIPGFSIESPTSQTTPQCQAIWDSWSPYEELMSPEQ PSSKEGQEQKGHPAHGSLYSGKAIKTPGRMGQLRPVSPTLPSGLSAGRSCRCRPWCCPCA ACGVPEPTHPCTFEHSPFPERSLNYPNSSVTSGPALPNPLPHHVKKVLASALPSAMIPTQ VRRECGQFNSADGPKTNQEEAARLGASLIGTIQNDAGDVRAVLAI >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_3|678_bp atgccgccggatgctgcggatgctgtggccatgagcgagggtgggacccccagcagccag caggacggggagggcattcctggtttcagcattgaaagtcccacgtcccagacaacccct cagtgccaggcaatctgggacagctggtcaccctacgaggagctgatgtccccagagcag ccctccagcaaggaaggtcaagaacagaagggccacccggctcacggttctctttattca ggcaaagccatcaagaccccgggccgcatgggccagctgcgtcccgtgtccccaacctta ccctcgggcctcagtgctgggcgctcctgccgttgccggccctggtgctgcccgtgcgcc gcctgcggtgtcccggagcccacgcatccctgcacctttgagcacagtccttttcctgag cgctccctaaattatcccaactcaagtgtgacttcggggcctgctcttcccaaccccctt ccccaccatgtaaagaaggtgcttgcttctgctttaccttctgccatgattcccacccag gtcagaagggaatgtggccaatttaactctgcggacggccccaaaaccaaccaggaagaa gctgcacgcctcggagcctcactgataggcacgattcagaatgacgctggggatgttcgt gcagtcctggccatttga >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_4|690_aa MPLLCARQHARHEGDRAGGDPDPPGGADIRPGGRTCKSSQVASGCVREIMQPSGHRLRDV EHHPLLAENDNYDSSSSSSSEADVADRVWFIRDGCGMICAVMTWLLVAYADFVVTFVMLL PSKDFWYSVVNGVIFNCLAVLALSSHLRTMLTDPEKSSDCRPSACTVKTGLDPTLVGICG EGTESVQSLLLPSLVDGHLGEFCFSAVTNHAALNGRVACVFNASPVDAYEPSRQPCECLP EGAVPKGNATKEYMESLQLKPGEVIYKCPKCCCIKPERAHHCRYVRCCIKPERIRHCGIC KRCIRKMDHHCPWVNNCVGEKNQRFFVLFTMYIALSSVHALILCGFQFISCVRGQWTVMF GTQIHSICNDETEIERLKSEKPTWERRLRWEGMKSVFGGPPSLLWMNPFVGFRFRRLPTR PRKGSAHVVARVQTSCLFMAEHDSITCLRLYRRYPPSIPQQPRGRGKQLRKDGVNRRGAP AMATTVPHHKDLEQMLVNYMNEEMNGYFSLIVQGLAQNPPLPGSLPGHPWQDRAVWDPSS YSGYIHKGCLYEEIVRKFSLQGRHEQEERLTVPVASDEHARNFSRLTVLAAPSKCRQATA HADSPPQGKIKGGETQTPESCQRRALPVIVGVKVPGTASRRNSSSWDLSVLFWNMESEGV KRPEVTGIPWLLAASSTFRAGNVASLLLLR >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_4|2073_bp atgcccttgctgtgcgccagacagcatgcaaggcatgagggggacagggcaggaggagac ccagacccccctggaggggctgacatcaggccgggaggtcggacctgcaagagctctcag gtagcaagtgggtgcgtcagggaaatcatgcagccatcaggacacaggctccgggacgtc gagcatcatcctctcctggctgaaaatgacaactatgactcttcatcgtcctcctcctcc gaggctgacgtggctgaccgggtctggttcatccgtgacggctgcggcatgatctgtgct gtcatgacgtggcttctggtcgcctatgcagacttcgtggtgactttcgtcatgctgctg ccttccaaagacttctggtactctgtggtcaacggggtcatctttaactgcttggccgtg cttgccctgtcatcccacctgagaaccatgctcaccgaccctgaaaaatccagtgactgc cgaccatctgcctgcacagtgaaaactgggctggacccaacccttgtgggcatttgtggt gagggaaccgagtctgtgcaaagcctcctgcttccatcgttagttgatgggcatctgggc gagttctgcttttcggctgttacaaatcatgctgctctgaacggccgcgtagcctgtgtc ttcaatgccagccccgtggatgcctacgagccttctcggcagccgtgcgagtgccttccg gagggggcagtacccaaaggaaacgctacgaaagaatacatggagagcttgcagctgaag cccggggaagtcatctacaagtgccccaagtgctgctgtattaaacccgagcgcgcccac cactgcaggtacgtgcgctgctgtattaaacccgagcgcatccgccactgcggtatttgc aaaagatgtattcggaaaatggatcatcactgcccgtgggtgaacaattgtgtaggagaa aagaatcaaagattttttgtgctcttcactatgtatatagctctgtcttcagtccatgct ctgatcctttgtggatttcagttcatctcctgtgtccgagggcagtggactgttatgttt ggcacccaaatccactccatatgcaacgacgagacggagatcgagcgattgaaaagtgag aagcccacatgggagcggaggctgcgatgggaagggatgaagtccgtctttggggggccc ccctcactcctctggatgaatccctttgtgggcttccgatttaggcgactgcccacgaga cccagaaaaggttctgcccatgtggtagcgcgtgtccagacttcatgcctttttatggca gaacacgacagcatcacgtgtttgcgcctgtatcgccgttatccacccagtatcccacag caacctcgaggcaggggcaagcagctccggaaggacggagtaaacaggcggggagcgcca gccatggccaccactgtcccccatcacaaagacttggaacagatgcttgttaactacatg aatgaagaaatgaatggctacttctcactcattgttcaaggcctggctcagaatccacct cttccagggagcctccctggtcacccctggcaggaccgagctgtctgggaccctagctca tattcgggttacatccataaaggctgtctctacgaggaaattgttaggaaattttctcta caaggcagacatgagcaggaggagcgcttgaccgttccagtcgcctcagacgagcatgca cgcaacttcagtagactcacagtgctggcggccccttctaagtgccgacaggccactgca catgcagacagcccaccccagggaaaaatcaagggaggagagacgcaaaccccagaatca tgccaacgacgggccttgcctgtcattgtgggtgtcaaagtaccagggactgccagcagg aggaattcctcctcctgggacctcagcgtcctcttctggaatatggagtctgagggtgtg aagcgtccagaggtcactggcatcccatggctcctggccgcttcgtccaccttcagagcc ggcaatgttgcatctctgctgctgcttcggtaa >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_5|228_aa MAGLERAPRLGSAWSPGSRRAGRKEAAAKQMPQPGALAGERGGASAGSEPDERCRQTEVR RRKDRGYGLQQGPSHAVRLRNRVGNALSGVCTRARGGVSEAAPVACTICGDVTEGNGKSP QHLYPSQQGRHSSRDATAASGQKWLCKFCVVLLVLRFPRPSPVLSMEQVTKKRWCVWRHF PGLYITARRDSQADDSVAKVTNELGENENLGTVEQMENARVSACGRLL >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_5|687_bp atggccgggctggagcgggccccgcggctcggctcggcttggtcccctgggtcccgccgg gcaggtcggaaggaggctgcagccaagcaaatgcctcagcccggagccttggctggagag cggggcggtgcgagcgccggaagtgaacccgacgagagatgccgccaaactgaagtccgg aggagaaaggataggggctacgggctgcagcagggtccaagccacgccgttaggctgcgg aaccgggttgggaatgcccttagcggagtgtgtacccgcgcccgcggcggtgtctcagag gcagctccagtggcctgtactatatgcggggatgtgaccgaaggaaatgggaaaagccca caacatctttacccctcccagcagggacgccacagcagcagggatgccacagctgcttca ggtcagaagtggctctgcaagttctgtgttgtcctgctcgttcttcggttccccaggcct agcccagtgctcagcatggagcaggtgacgaagaaacgctggtgtgtttggagacatttt cctggcctttacatcacagccaggcgtgactcacaagcagatgactcagtggcaaaagtg accaatgagctaggggaaaatgagaatctgggaacagtggaacaaatggaaaatgccaga gtcagtgcgtgtggacggctgctgtga >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_6|209_aa MPFNEPSQSEYVHAATTRARTYNIAIPFRPPQRKFHSGSGGNSDSASRGVSLQVSEALRK PSFPAGGTVVQRGGCGTTHPLRHFLWILPRVVVPLRCTLSAAIICLALQEHSHGTKISTE SPSQMGRWKWATHWSSTKPCEVPADAGGTRPHSLHRLEQRPTHRAEPGPLPDDNLGPLTL QNGSRFQEFHVMSLKQGHWGWHRDVQKRI >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_6|630_bp atgccgttcaatgaaccctcacaaagcgaatacgtccatgcagccaccacccgcgccagg acgtacaacattgccatcccctttaggcctccccaaaggaaattccacagcggcagtggt ggcaacagtgacagtgccagcagaggggtcagcctgcaggtgtctgaagctctgaggaaa ccttcctttccagctggaggaacggtggtccaaagagggggctgtggcacgacccacccc ctgcgacatttcctgtggatccttcctcgggtggtcgtccctttgagatgcacattgtcg gctgccatcatctgcctggccctgcaggagcacagccatgggactaagataagtacagaa agtccaagccagatgggcagatggaagtgggctactcactggagctcaacaaagccttgc gaggtgcccgctgatgcaggtggcactcgtccacacagcctgcacaggctggagcagagg ccaacgcacagagcagaacctggtcccctgcctgatgacaacttaggaccactgactctt caaaacggatcccggttccaggaattccatgtcatgtccttaaaacaaggtcactggggc tggcacagagatgtccagaaacgcatctga >gi568815582r:84876346_85090618|GENSCAN_predicted_peptide_7|565_aa METPEVPVGSLIDFGPEAPTSSPLEAPPPVLQDGDGSLGDGASESETTESADSENDMGES PSHPSWDQDRRSSSNESFSSNQSTESTQDEETLALRDFMRGYVEKIFSGGEDLDQEEKAK FGEYCSSENGKGREWFARYVSAQRCNSKCVSEATFYRLVQSFAVVLFECHQMDDFGPAKN LMTMCFTYYHIGKPQLLPPESREKPAGSIDSYLKSANSWLAEKKDIAERLLKNTSARTEN VKGFFGGLETKLKGPLARRNEEDENKPQEKRPRAVTAYSPEDEKKGEKIYLYTHLKQQPI CAGSCEGHCRVGAVHCAELLQQAQPRRRCSLVAGGEGGPGEVRPALGQAGNASQNWAGQA QTPLSNRSKALGPPSLPVECLDSRVVLRAFRAIKSTFISTSLYLMKSEMNVGDPKRGVRI PAGLVEGPAHGSPVEGTPFCARWGPYMGAASTCGSEQGEELGQGQHHLSPASVTSCACRE KWCHMTQEERDDSLRFNENITFGQLGTFTHNMLAFGLNKKLCNDFLKKQAVIGNLDEDHT AIMSIGGQGWRWFFFIEKDLHLGVX >gi568815582r:84876346_85090618|GENSCAN_predicted_CDS_7|1695_bp atggagaccccagaggtccccgtgggctcgctaatcgactttgggcctgaggcacccacc tcttctcccctggaggcaccaccccctgtgctgcaggacggcgatggctccctgggggac ggtgcatcagagagtgagaccactgagtctgcggacagtgagaatgacatgggcgagtcg ccctcgcacccgtcctgggaccaagaccgccgttcctcctccaacgagtccttctcctcc aaccagagcaccgagtctacccaggatgaagagaccctggcactcagggacttcatgcgt ggctacgtggagaagatcttctctggaggggaggacttggatcaggaggagaaagccaag tttggagagtactgcagcagtgaaaatggaaaaggccgggagtggtttgctcgatacgtg agtgcccagcgctgcaactccaagtgtgtctcagaggcaaccttctaccgcctggtgcag tcttttgcagtggtgctgttcgagtgtcatcagatggatgactttgggcctgccaagaac ctcatgaccatgtgcttcacctactaccacatcggaaaaccacagctgctgcccccggag tcccgggagaagcccgcgggcagcatcgactcctacctgaaatccgcaaacagctggctg gccgaaaagaaggacatcgccgagcggctgctgaagaacacctcggccaggactgagaat gtcaagggcttcttcggggggctggagaccaagctgaaggggcccctggccaggaggaac gaggaagacgagaacaaaccccaggagaagcggcccagggctgtgaccgcgtacagcccc gaggacgaaaagaagggggagaagatctacctgtacacgcacctgaagcaacagcccatc tgtgcgggctcctgcgaaggccactgcagggtgggggctgtgcactgtgctgagctgctg cagcaggcacagcccagacgcaggtgcagcctggtggccggcggggagggcggccctggg gaggtgaggccagcgctagggcaggctggcaatgcgtcccagaactgggcaggacaggcc cagaccccactctccaaccgtagtaaggctctggggccaccctcgctgcctgtagagtgc ctggatagccgcgtggtgctccgtgccttcagggccataaagagcactttcatttccacg tctctttatcttatgaagagtgagatgaatgttggggaccccaagcgcggtgtgaggatc ccggcaggccttgtggaaggcccagcacacggctctccggtggaaggcacccccttctgt gcccggtgggggccgtacatgggtgccgccagcacctgcgggtcagaacagggcgaggag ctggggcaaggacagcaccatctgagccccgcttctgtcacctcctgtgcttgcagggag aagtggtgccacatgacccaggaggagcgcgacgacagcctccggttcaacgagaacatc accttcgggcagctgggcacattcacgcacaacatgctggcctttggactgaacaagaag ctgtgcaatgacttcctgaagaagcaggctgtgattggcaacctggatgaagaccacact gccatcatgagtatcgggggacaagggtggcgttggttcttcttcatcgaaaaggactta cacttaggcgtggnn