GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:16:58 Sequence gi568815586r:15521320_15782951 : 261632 bp : 37.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5737 5776 40 -2.65 1.01 Init + 16039 16192 154 0 1 78 99 76 0.397 7.89 1.02 Intr + 25250 25389 140 0 2 68 84 78 0.317 4.66 1.03 Intr + 30195 30352 158 2 2 7 97 127 0.301 3.39 1.04 Intr + 36136 36204 69 1 0 79 105 29 0.539 1.08 1.05 Intr + 41243 41305 63 2 0 85 63 89 0.597 3.01 1.06 Intr + 44274 44309 36 0 0 82 106 25 0.274 0.16 1.07 Intr + 48098 48179 82 2 1 19 105 54 0.405 -1.08 1.08 Intr + 57534 57624 91 2 1 109 66 95 0.982 8.05 1.09 Intr + 58720 58796 77 2 2 106 95 105 0.999 11.32 1.10 Intr + 59378 59512 135 1 0 89 83 58 0.967 5.24 1.11 Intr + 60360 60482 123 2 0 91 92 193 0.994 19.86 1.12 Intr + 65578 65732 155 0 2 75 91 131 0.989 10.05 1.13 Intr + 68136 68271 136 0 1 57 75 155 0.872 10.75 1.14 Intr + 68906 68984 79 1 1 49 57 90 0.085 0.21 1.15 Intr + 82934 83020 87 0 0 66 82 49 0.034 1.12 1.16 Intr + 86544 86689 146 1 2 33 84 125 0.024 5.68 1.17 Term + 98051 98221 171 1 0 69 43 79 0.018 -1.66 1.18 PlyA + 98320 98325 6 1.05 2.25 PlyA - 98844 98839 6 1.05 2.24 Term - 100111 99998 114 1 0 95 28 133 0.900 5.59 2.23 Intr - 101968 101839 130 0 1 138 64 38 0.998 6.18 2.22 Intr - 103088 102908 181 0 1 42 56 171 0.931 7.30 2.21 Intr - 110345 110123 223 2 1 92 103 143 0.909 12.98 2.20 Intr - 119527 119384 144 1 0 49 99 185 0.996 15.26 2.19 Intr - 120511 120403 109 0 1 98 75 114 0.999 10.47 2.18 Intr - 125941 125808 134 1 2 97 60 119 0.989 8.52 2.17 Intr - 129687 129504 184 2 1 91 53 210 0.984 16.67 2.16 Intr - 132974 132826 149 2 2 36 115 124 0.983 8.01 2.15 Intr - 136834 136760 75 1 0 112 81 5 0.665 0.99 2.14 Intr - 137266 137178 89 2 2 61 119 65 0.997 5.67 2.13 Intr - 139421 139295 127 2 1 55 107 79 0.909 5.83 2.12 Intr - 140780 140707 74 0 2 46 103 106 0.983 6.11 2.11 Intr - 144573 144437 137 2 2 78 74 128 0.979 9.69 2.10 Intr - 145203 145121 83 0 2 98 83 87 0.995 6.72 2.09 Intr - 148217 148068 150 2 0 55 93 83 0.901 4.94 2.08 Intr - 148506 148372 135 0 0 16 42 201 0.948 8.14 2.07 Intr - 149604 149537 68 1 2 87 63 66 0.002 1.71 2.06 Intr - 159983 159907 77 1 2 52 78 63 0.032 -0.06 2.05 Intr - 161653 161574 80 1 2 117 100 33 0.106 4.83 2.04 Intr - 167720 167642 79 1 1 73 40 87 0.023 1.03 2.03 Intr - 185717 185633 85 0 1 58 71 55 0.023 -1.24 2.02 Intr - 201914 201867 48 0 0 63 89 76 0.308 2.93 2.01 Init - 208250 208187 64 2 1 70 97 48 0.298 5.26 2.00 Prom - 214291 214252 40 -6.85 3.00 Prom + 218735 218774 40 -5.35 3.01 Init + 221597 222022 426 1 0 68 86 223 0.622 16.45 3.02 Term + 240932 241075 144 1 0 10 42 184 0.199 2.93 3.03 PlyA + 241398 241403 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 155946 156254 309 2 0 23 31 254 0.833 8.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:15521320_15782951|GENSCAN_predicted_peptide_1|633_aa MEMVSKNQQKRLRSIGQGSKKEHRKVSCPRSQGKRKPQRVEIVQVKPVNKSEPAPPKSLF AVNKTQTSVTLLWVEEGVADFFEVFCQQVGSSQKTKLQRTYDENAQLISLVTEMNPNVVV ISVLAILSTLLIGLLLVTLIILRKKHLQMARECGAGTFVNFASLERDGKLPYNCVARISS WGGSRISVEHQYPVRSKNGLKKRKLTNPVQLDDFDAYIKDMAKDSDYKFSLQFEELKLIG LDIPHFAADLPLNRCKNRYTNILPYDFSRVRLVSMNEEEGADYINANYIPGYNSPQEYIA TQGPLPETRNDFWKMVLQQKSQIIVMLTQCNEKRRVKCDHYWPFTEEPIAYGDITVEMIS EEEQDDWACRHFRINYADEMQDVMHFNYTAWPDHGVPTANAAESILQFVHMVRQQATKSK GPMIIHCSAGVGRTGTFIALDRLLQHIRDHEFVDILGLVSEMRSYRMSMVQTEPEDALSL ERQPKATPADSTACLRAAVVPRHLQSECAPLISQSHMGALSMSMPPTQEQQAIIQAEQDS PTELKRQRPEFSSPEVPEISKKGHKGGRICAEWPQKSIPVTKPLRCLDPNSVVLLPQDSK FQEPLLFVPTSLADVTAFCNYYLCDTFLWHLIA >gi568815586r:15521320_15782951|GENSCAN_predicted_CDS_1|1902_bp atggagatggtgagtaaaaaccagcaaaagagactgaggagcattggccagggaagcaag aaagagcacagaaaagtgtcctgtcccaggagccaagggaagagaaagcctcagcgagtg gagattgtccaggtcaaacctgtgaacaaatctgaaccagctccacccaaatcactcttc gcagtgaacaaaacccagacttcagtgactttgctgtgggtggaagagggagtagctgat ttctttgaagttttctgtcaacaagttggctccagtcagaaaaccaaacttcagagaacc tatgatgaaaatgcacaacttatctctttagttacagagatgaatcccaatgtggtagtg atctccgtgctggccatccttagcacacttttaattggactgttgcttgttaccctcatt attcttaggaaaaagcatctgcagatggctagggagtgtggagctggtacatttgtcaat tttgcatccttagagagggatggaaagcttccatacaactgtgttgctcgaatctcttcc tggggtggtagcagaatcagtgtggagcaccagtacccagttaggagtaaaaatggttta aagaagaggaaactgacaaacccggttcaactggatgactttgatgcctatattaaggat atggccaaagactctgactataaattttctcttcagtttgaggagttgaaattgattgga ctggatatcccacactttgctgcagatcttccactgaatcgatgtaaaaaccgttacaca aacatcctaccatatgacttcagccgtgtgagattagtctccatgaatgaagaggaaggt gcagactacatcaatgccaactatattcctggatacaactcaccccaggagtatattgcc acccaggggccactgcctgaaaccagaaatgacttctggaagatggtcctgcaacaaaag tctcagattattgtcatgctcactcagtgtaatgagaaaaggagggtgaaatgtgaccat tactggccattcacggaagaacctatagcctatggagacatcactgtggagatgatttca gaggaagagcaggacgactgggcctgtagacacttccggatcaactatgctgacgagatg caggatgtgatgcattttaactacactgcatggcctgatcatggtgtgcccacagcaaat gctgcagaaagtatcctgcagtttgtacacatggtccgacagcaagctaccaagagcaaa ggtcccatgatcattcactgcagtgctggcgtgggacggacaggaacattcattgccctg gacaggctcttgcagcacattcgggatcatgagtttgttgacatcttagggctggtgtca gaaatgaggtcataccggatgtctatggtacagacagagcctgaggatgcacttagtctg gagcgacaacccaaggcaacgccggcagacagcaccgcctgcctcagagcagccgtggta ccaagacatctacagagtgagtgtgcccctttgatttctcagtcccacatgggcgccctt agcatgagcatgccccctacacaagagcagcaagctataattcaagcagaacaagatagt cccactgaactgaagaggcaaagaccagaattcagttcacctgaagtgcctgaaatttct aaaaaggggcacaagggtggacggatttgtgcagagtggccacagaaatctataccagtg accaaacccctcagatgtctggatccaaattctgtggtgctgctccctcaagattctaaa ttccaggaacctcttctctttgtccccacatccttagctgatgtgactgctttctgcaat tactacctctgtgatacattcctttggcatcttatagcataa >gi568815586r:15521320_15782951|GENSCAN_predicted_peptide_2|912_aa MQNAFDSFDAISRLERANERIGSSDSPTAAFRIAEATGVLHTSIVLNLERYYNHFTNEET RLIEAKACNVGVACFFSAPLLRPLGEHADWQANTQVKDTMNGHISNHPSSFGMYPSQMNG YGSSPTFSQTDREHGSKTSAKALYEQRKNYARDSVSSVSDISQYRVEHLTTFVLDRKDAM ITVDDGIRKLKLLDAKGKVWTQDMILQVDDRANELENFPLNTIQHCQAVMHSCSYDSVLA LVCKEPTQNKPDLHLFQCDEVKANLISEDIESAISDSKGGKQKRRPDALRMISNADPSIP PPPRAPAPAPPGTVTQVDVRSRVAAWSAWAADQGDFEKPRQYHEQEETPEMMAARIDRDV QILNHILDDIEFFITKLQKAAEAFSELSKRKKNKKGKRKGPGEGVLTLRAKPPPPDEFLD CFQKFKHGFNLLAKLKSHIQNPSAADLVHFLFTPLNMVVQATGGPELASSVLSPLLNKDT IDFLNYTVNGDERQLWMSLGGTWMKARAEWPKEQFIPPYVPRFRNGWEPPMLNFMGATME QDLYQLAESVANVAEHQRKQEIKRLSTEHSSVSEYHPADGYAFSSNIYTRGSHLDQGEAA VAFKPTSNRHIDRNYEPLKTQPKKYAKSKYDFVARNNSELSVLKDDILEILDDRKQWWKV RNASGDSGFVPNNILDIVRPPESGLGRADPPYTHTIQKQRMEYGPRPADTPPAPSPPPTP APVPVPLPPSTPAPVPVSKVPANITRQNSSSSDSGGSIVRDSQRHKQLPVDRRKSQMEEV QDELIHRLTIGRSAAQKKFHVPRQNVPVINITYDSTPEDVKTWLQSKGFNPVTVNSLGVL NGAQLFSLNKDELRTVCPEGARVYSQITVQKAALEDSSGSSELQEIMRRRQEKISAAASD SGVESFDEGSSH >gi568815586r:15521320_15782951|GENSCAN_predicted_CDS_2|2739_bp atgcagaatgcctttgattcctttgatgccatcagtagattggagagggctaatgaaaga atcggctcaagtgattctcccacagcagccttccggatagctgaagctacaggggttttg catacttcgattgttcttaaccttgagagatactataaccactttacaaatgaagaaact aggctcatagaggctaaggcatgcaatgtcggtgtggcttgtttcttcagtgccccgctg ctcagacctctaggggagcatgcagactggcaggctaacacacaagtgaaagacacaatg aatggtcatatttctaatcatcccagtagttttggaatgtacccatctcagatgaatggc tacggatcatcacctaccttttcccagacggacagagaacatggttcaaaaacaagtgca aaggccctttatgaacaaaggaagaattatgcacgggacagtgtcagcagtgtgtcagat atatctcaataccgtgttgaacacttgactacctttgtcctggatcggaaagatgctatg atcactgttgatgatggaataaggaaattgaaattgcttgatgccaagggcaaagtgtgg actcaagatatgattcttcaagtggatgacagagctaatgaactggagaattttccttta aacacaatccagcactgccaagctgtgatgcattcatgcagctatgattcagttcttgca ctggtgtgcaaagagccaacccagaacaagccagatcttcatctcttccagtgtgatgag gttaaggcaaacctaattagtgaagatattgaaagtgcaatcagtgacagtaaaggaggg aaacagaagaggcggcccgacgccctgaggatgatttccaatgcagaccctagtataccg cctccacccagagctcctgcccctgcgccccctgggaccgtcacccaggtggatgttaga agtcgagtggcagcctggtctgcatgggcagccgaccaaggggactttgagaaaccaagg cagtatcatgagcaggaagaaacacctgagatgatggcagcccgcattgacagagatgtg caaatcttaaaccacattttggatgacattgaattttttatcacaaaactccaaaaagca gcagaagcattttctgagctttctaaaaggaagaaaaacaagaaaggtaaaaggaaagga ccaggagagggtgttttaacgctgcgggcaaaacctccacctcctgatgaatttcttgac tgtttccaaaagtttaaacacggatttaaccttctggccaaactgaagtctcatattcag aatcctagtgctgcagatttggttcactttttgtttactccattaaatatggtggtgcag gcaacaggaggtcctgaactagccagttcagtacttagtcccctattgaataaggacaca attgatttcttaaattatactgtcaatggtgatgaacggcagctgtggatgtcattggga ggaacttggatgaaagccagagcagagtggccaaaagaacagtttattccaccatatgtt ccacgattccgcaatggctgggagcccccaatgctgaactttatgggagccacaatggaa caagatctttatcaactggcagaatctgtggcaaatgtagcagaacatcagcgcaaacag gaaataaaaagattatccacagagcattccagtgtatcagagtatcatccagccgatggc tatgcgttcagtagcaacatttacacaagaggatcccacctggaccaaggggaagctgct gttgcttttaagccaacttctaatcgccatatagatagaaattatgaaccactcaaaaca caacccaagaaatatgccaaatccaagtatgactttgtagcaaggaacaacagtgagctc tcggttctaaaggatgatattttagagatacttgatgatcggaagcaatggtggaaagtt cgaaatgcaagtggagactctggatttgtgccaaataacattttggatattgtgagacct ccagaatctggattggggcgtgctgatccaccttatactcatactatacagaaacaaagg atggagtatggcccaagaccagctgatactccccctgctccatcacctcctccaacacca gctcctgttcctgttccccttcccccttccactccagcacctgttcctgtgtcaaaggtc ccagcaaatataacacgtcaaaacagcagctccagtgacagtggtggcagtatcgtgcga gacagccagagacacaaacaacttccggtggaccgaaggaaatctcagatggaggaagtg caagatgaactcatccacagactgaccattggtcggagtgccgctcagaagaaattccat gtgccacggcagaacgtgccagttatcaatatcacttacgactccacaccagaggatgtg aagacgtggttacagtcaaagggattcaaccctgtgactgtcaatagtcttggagtatta aatggtgcacaacttttctctctcaataaggatgaactgaggacagtctgccctgaaggg gcgagagtctatagccaaatcactgtacaaaaagctgcattggaggatagcagtggcagc tccgagttacaagaaattatgagaagacgacaggaaaaaatcagtgctgccgctagtgat tcaggagtggaatcttttgatgaaggaagcagtcactaa >gi568815586r:15521320_15782951|GENSCAN_predicted_peptide_3|189_aa MRHEKEIKGIQLGKQEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSEVSGYKINVQKS QAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQFTRDVKDLFKENYKPLLNEIKEDTNK WKNIPCSWIGRINIVKMAILPKLRWLLSKRQTITNADKDVEKREPSYTVGGDVNYCNRYA EQFGGASKN >gi568815586r:15521320_15782951|GENSCAN_predicted_CDS_3|570_bp atgaggcacgagaaagaaataaaaggtattcaactaggaaaacaggaagtcaaattgtcc ctgtttgcagatgacatgattgtctatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcgaagtctcaggatacaaaatcaatgtgcaaaaatca caagcattcctatacaccaataacagacaaacagagagtcaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaatttacaagggatgtgaag gacctcttcaaggagaactacaaaccactactcaatgaaataaaagaggacacaaacaaa tggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatggccatactg cccaagttaagatggcttttatccaaaagacagacaataacaaatgccgacaaggatgta gagaaaagggaaccctcatacactgttggtggcgatgtaaattactgcaaccgctatgca gaacagtttggcggtgcctcaaaaaactaa