GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:58:36 Sequence gi568815597f:28159848_28380799 : 220952 bp : 46.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 451 446 6 1.05 1.14 Term - 4617 4447 171 0 0 69 33 110 0.178 1.43 1.13 Intr - 14276 14250 27 2 0 96 70 41 0.230 1.41 1.12 Intr - 15754 15621 134 2 2 103 54 23 0.145 0.76 1.11 Intr - 17021 16919 103 2 1 110 25 72 0.214 2.85 1.10 Intr - 37887 37802 86 1 2 71 80 86 0.489 5.64 1.09 Intr - 41523 41409 115 0 1 98 92 171 0.987 18.52 1.08 Intr - 43975 43900 76 0 1 92 84 82 0.998 7.72 1.07 Intr - 45502 45411 92 1 2 66 83 50 0.989 1.09 1.06 Intr - 48566 48495 72 2 0 60 105 57 0.945 4.20 1.05 Intr - 50219 50125 95 0 2 5 93 137 0.971 5.58 1.04 Intr - 50790 50724 67 0 1 90 63 52 0.950 1.38 1.03 Intr - 55149 55093 57 0 0 63 92 44 0.452 1.38 1.02 Intr - 69176 69075 102 2 0 71 83 20 0.127 0.17 1.01 Init - 73151 73074 78 2 0 92 51 66 0.268 4.37 1.00 Prom - 73599 73560 40 -5.76 2.00 Prom + 74055 74094 40 -10.05 2.01 Init + 76337 76423 87 1 0 87 101 163 0.914 16.29 2.02 Intr + 76514 76605 92 1 2 93 93 120 0.913 11.79 2.03 Term + 77990 78131 142 2 1 65 42 183 0.999 8.80 2.04 PlyA + 78234 78239 6 1.05 3.00 Prom + 80990 81029 40 -9.16 3.01 Init + 84791 84840 50 1 2 91 95 6 0.401 2.10 3.02 Intr + 93562 93703 142 1 1 62 73 53 0.499 1.56 3.03 Intr + 99637 99888 252 0 0 25 -30 316 0.944 11.13 3.04 Intr + 99896 100090 195 1 0 1 99 281 0.835 20.11 3.05 Intr + 108011 108235 225 1 0 116 75 81 0.865 7.78 3.06 Intr + 109336 109401 66 0 0 94 99 76 0.992 8.40 3.07 Intr + 111827 112024 198 1 0 92 89 248 0.988 24.85 3.08 Intr + 112437 112619 183 2 0 83 61 266 0.789 23.38 3.09 Intr + 112734 112946 213 2 0 149 98 225 0.999 28.51 3.10 Intr + 113511 113661 151 2 1 47 92 344 0.999 30.44 3.11 Intr + 114193 114311 119 2 2 110 91 133 0.996 15.98 3.12 Intr + 114978 115168 191 2 2 68 99 316 0.968 29.08 3.13 Intr + 119178 119394 217 0 1 119 86 188 0.726 20.21 3.14 Term + 120869 120955 87 1 0 109 41 180 0.999 13.06 3.15 PlyA + 122615 122620 6 1.05 4.00 Prom + 165349 165388 40 -3.16 4.01 Init + 169095 169148 54 2 0 79 68 54 0.778 3.80 4.02 Intr + 170750 170888 139 1 1 69 110 137 0.788 14.04 4.03 Term + 174570 175123 554 1 2 122 42 553 0.994 48.58 4.04 PlyA + 175983 175988 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28159848_28380799|GENSCAN_predicted_peptide_1|424_aa MAASGESGTSGGGGSTEEAFMTFYSEVKQIEKRDSVLTSKNQIERLTRPGSSYFNLNPFE VLQIDPEVTDEEIKKRFRQLSILVHPDKNQDDADRAQKAFEAVDKAYKLLLDQEQKKRAL DVIQAGKEYVEHTVKERKKQLKKEGKPTIVEEDDPELFKQAVYKQTMKLFAELEIKRKER EAKEMHERKRQREEEIEAQEKAKREREWQKNFEESRDGRVDSWRNFQANTKGKKEKKNRT FLRPPKVKMEQLEQIGSPAVEPTAGKRTFVGRGEVEGASQLLASGAVYTPPHPSLFEKPL CSAQVLAAQLGSSCGGPDMGLCTHSLVKYDRTSRLSPHTSGCSPLALTEYPAVFQVFGVH DESGEKPLSKEYLDPRECAHLECGVPPPSPLLVLALLVNAPSGLRGCKELLVQELWMNLD VSYL >gi568815597f:28159848_28380799|GENSCAN_predicted_CDS_1|1275_bp atggcggcttcaggagagagcgggacttcaggcggcggaggcagcaccgaggaagcattt atgaccttctacagtgaggtgaaacaaatagagaagagagactcggttctaacttcgaaa aatcagattgaaagactgacccgtcctggttcctcttacttcaatttgaacccatttgag gttcttcagatagatcctgaagttacagatgaagaaataaaaaagaggtttcggcagtta tccatcttggtgcatcctgacaaaaatcaagatgatgctgacagagcacaaaaggctttt gaagctgtggacaaagcttacaagttgctactggatcaggagcaaaagaagagggccctg gatgtaattcaggcaggaaaagaatacgtggaacacactgtgaaagagcgaaaaaaacaa ttaaagaaggaaggaaaacctacaattgtagaggaggatgatcctgagctgttcaaacaa gctgtatataaacagacaatgaaactctttgcagagctggaaattaaaaggaaagagaga gaagccaaagagatgcatgaaaggaaacgacaaagggaagaagagattgaagctcaagaa aaagccaaacgggaaagagagtggcagaaaaactttgaggaaagtcgagatggtcgtgtg gacagctggcgaaacttccaagccaatacgaaggggaagaaagagaagaaaaatcggacc ttcctgagaccaccgaaagtaaaaatggagcaactggagcagataggctccccagctgtg gagccaactgctggcaagaggacctttgtgggcagaggtgaggtggaaggagccagccag ctcctggcctcgggcgctgtctacaccccaccccatccctcgctctttgagaagcctctg tgcagtgcccaggtcctggcggcccagcttggaagttcctgtggaggtccagacatgggg ctgtgcactcactcacttgtgaaatatgacagaaccagcaggctctctccccacacctct ggctgctctcctttagctctgactgaatacccagctgtgttccaggtgtttggggtgcat gatgagagcggggagaagcctctgagcaaggagtacttggatccaagagagtgtgcccac ttggaatgtggtgttcccccgccttcaccacttctagtccttgccctgttggtgaatgca ccctcaggcctgcggggctgcaaggaactgctggtgcaggagctgtggatgaacttggat gtatcttatttatag >gi568815597f:28159848_28380799|GENSCAN_predicted_peptide_2|106_aa MAVTALAARTWLGVWGVRTMQARGFGSDQSENVDRGAGSIREAGGAFGKREQAEEERYFR AQSREQLAALKKHHEEEIVHHKKEIERLQKEIERHKQKIKMLKHDD >gi568815597f:28159848_28380799|GENSCAN_predicted_CDS_2|321_bp atggcagtgacggcgttggcggcgcggacgtggcttggcgtgtggggcgtgaggaccatg caagcccgaggcttcggctcggatcagtccgagaatgtcgaccggggcgcgggctccatc cgggaagccggtggggccttcggaaagagagagcaggctgaagaggaacgatatttccga gcacagagtagagaacaactggcagctttgaaaaaacaccatgaagaagaaatcgttcat cataagaaggagattgagcgtctgcagaaagaaattgagcgccataagcagaagatcaaa atgctaaaacatgatgattaa >gi568815597f:28159848_28380799|GENSCAN_predicted_peptide_3|762_aa MTLSNSFELKLLQGPGRKAERREEAAVEKLKPSIVCFMRLKERSLLNNRKIQGEAAHADG EAAARAELLVSEPGERWQFRGGDAEERWVREQPWPLRTSEAVKTPALRPFPGPRGRSPFP KPDWGKSPAPKRPFSDSGAFWSPERRPGKLPGGAQSRLHSGVPPKPTRVHGSSASRDRVL ARTMIVADSECRAELKDYLRFAPGGVGDSGPGETSMNDLALLSTKCDLVSGKQNRQFDDS VFRSKSPGNFGVIGEGQKKVPRLLWFSFAKGVTTGTGPVVSGSPWEGEEQRESRARRGPR GPSAFIPVEEVLREGAESLEQHLGLEALMSSGRVDNLAVVMGLHPDYFTSFWRLHYLLLH TDGPLASSWRHYIAIMAAARHQCSYLVGSHMAEFLQTGGDPEWLLGLHRAPEKLRKLSEI NKLLAHRPWLITKEHIQALLKTGEHTWSLAELIQALVLLTHCHSLSSFVFGCGILPEGDA DGSPAPQAPTPPSEQSSPPSRDPLNNSGGFESARDVEALMERMQQLQESLLRDEGTSQEE MESRFELEKSESLLVTPSADILEPSPHPDMLCFVEDPTFGYEDFTRRGAQAPPTFRAQDY TWEDHGYSLIQRLYPEGGQLLDEKFQAAYSLTYNTIAMHSGVDTSVLRRAIWNYIHCVFG IRVGLRGGSCLWEESMSNAAGTFPSRYDDYDYGEVNQLLERNLKVYIKTVACYPEKTTRR MYNLFWRHFRHSEKVHVNLLLLEARMQAALLYALRAITRYMT >gi568815597f:28159848_28380799|GENSCAN_predicted_CDS_3|2289_bp atgaccctgtccaacagttttgaactcaagttgctacaggggccaggcaggaaagctgag agacgtgaggaagctgctgtagaaaagcttaaacctagcatagtttgcttcatgaggttg aaagaaagaagccttctcaataacagaaaaatacaaggtgaagcagcccatgctgatgga gaagctgctgcacgggctgaactgctggtgtcagagcccggcgagcgctggcagttccgc ggcggggatgctgaggagcgctgggtccgggagcagccctggcccctgcggacttccgag gccgtgaaaacccctgcgctgcggcccttcccaggcccccgaggccgttcgccgttcccg aagcccgactgggggaagagtccagcaccaaagcggccgttctcggattccggagcgttc tggagccccgagagacgccccgggaagctccccggcggcgcccagtcccggcttcattcg ggcgtccctccgaaacccactcgggtgcacgggtcgtcggcgagccgcgaccgggtcctg gcgcgcaccatgatcgtggcggactccgagtgccgcgcagagctcaaggactacctgcgg ttcgccccgggcggcgtcggcgactcgggccccggagagactagtatgaatgacctggcg ctgctctccactaaatgtgatctggtgtcagggaaacaaaacagacaatttgatgacagt gtttttagaagcaaaagccctgggaattttggggtcataggtgagggccagaaaaaggtt cctagactattgtggtttagctttgcaaaaggagtgaccactgggactgggccagttgta tctggaagcccatgggagggggaggagcagagggagagccgggctcggcgaggccctcga gggcccagcgccttcatccccgtggaggaggtccttcgggagggggctgagagcctcgag cagcacctggggctggaggcactgatgtcctctgggcgagtagacaacctggcagtggtg atgggcctgcaccctgactactttaccagcttctggcgcctgcactacctgctgctgcac acggatggtcccttggccagctcctggcgccactacattgccatcatggctgccgcccgc catcagtgttcttacctggtaggctcccacatggccgagtttctgcagactggtggtgac cctgagtggctgctgggcctccaccgggcccccgagaagctgcgcaaactcagcgagatc aacaagttgctggcgcatcggccatggctcatcaccaaggaacacatccaggccttgctg aagaccggcgagcacacttggtccctggccgagctcattcaggctctggtcctgctcacc cactgccactcgctctcctccttcgtgtttggctgtggcatcctccctgagggggatgca gatggcagccctgccccccaggcacctacaccccctagtgaacagagcagccccccaagc agggacccgttgaacaactctgggggctttgagtctgcccgcgacgtggaggcgctgatg gagcgcatgcagcagctgcaggagagcctgctgcgggatgaggggacgtcccaggaggag atggagagccgctttgagctggagaagtcagagagcctgctggtgaccccctcagctgac atcctggagccctctccacacccagacatgctgtgctttgtggaagaccctactttcgga tatgaggacttcactcggagaggggctcaggcaccccctaccttccgggcccaggattat acctgggaagaccatggctactcgctgatccagcggctttaccctgagggtgggcagctg ctggatgagaagttccaggcagcctatagcctcacctacaataccatcgccatgcacagt ggtgtggacacctccgtgctccgcagggccatctggaactatatccactgcgtctttggc atcagggtggggttgcggggagggagctgcctttgggaagaaagcatgagtaatgctgct gggacctttccatccagatatgatgactatgattatggggaggtgaaccagctcctggag cggaacctcaaggtctatatcaagacagtggcctgctacccagagaagaccacccgaaga atgtacaacctcttctggaggcacttccgccactcagagaaggtccacgtgaacttgctg ctcctggaggcgcgcatgcaagccgctctgctgtacgccctccgtgccatcacccgctac atgacctga >gi568815597f:28159848_28380799|GENSCAN_predicted_peptide_4|248_aa MKLFGLLGTHTFGNGSFEVYPVPYLTGGSESSCVVFNLDTMEAPPVTMMPVTGGTINMME YLLQGSVLDHSLESLIHRLRGLCDNMEPETFLDHEMVFLLKGQQASPFVLRARRSMDRAG APWHLRYLGQPEMGDKNRHALVRNCVDIATSENLTDFLMEMGFRMDHEFVAKGHLFRKGI MKIMVYKIFRILVPGNTDSTEALSLSYLVELSVVAPAGQDMVSDDMKNFAEQLKPLVHLE KIDPKRLM >gi568815597f:28159848_28380799|GENSCAN_predicted_CDS_4|747_bp atgaagctcttcggccttctcggcacccatacgtttgggaacggcagtttcgaggtatat cccgtgccttacctgactgggggctctgagtccagttgtgttgtcttcaacttagacacc atggaggcacctccagtcaccatgatgcctgtcactgggggcaccattaacatgatggag tacctgttgcagggaagtgttttagatcacagtttggaaagcctcatccaccgccttcgt ggtttgtgtgacaacatggaacctgagactttccttgaccatgagatggtattcctcctt aagggccagcaagccagcccatttgttctcagggcccgacgctctatggacagggcaggg gcaccctggcatctgcgctacctgggacagccagaaatgggagacaagaaccgccatgcc ctggtgcgaaactgcgtggacattgccacatctgagaacctcaccgacttcttgatggaa atgggcttccgcatggaccatgagtttgttgctaagggacatttgttccgtaagggcatc atgaagattatggtgtacaagattttccgcatcctggtgccagggaacacagacagcact gaggccttgtcactctcctatctcgtggaattaagtgtggtagcacccgctgggcaggac atggtctctgatgacatgaagaacttcgcagaacagctaaaacctctggttcacctagag aaaatagaccccaagaggctcatgtga