GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:23:25 Sequence gi568815588r:43286640_43487884 : 201245 bp : 48.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4952 4991 40 -2.46 1.01 Init + 7547 7751 205 1 1 93 72 74 0.733 5.31 1.02 Intr + 8287 8474 188 2 2 92 12 111 0.770 3.31 1.03 Intr + 9080 9120 41 1 2 115 31 48 0.019 -1.18 1.04 Intr + 26116 26236 121 1 1 148 86 -13 0.031 5.00 1.05 Intr + 28124 28197 74 1 2 40 44 68 0.025 -4.20 1.06 Intr + 30974 31035 62 2 2 102 65 52 0.316 2.68 1.07 Intr + 31822 31956 135 2 0 63 86 56 0.424 3.44 1.08 Intr + 32964 33018 55 1 1 101 37 96 0.356 3.84 1.09 Intr + 46184 46314 131 2 2 89 37 71 0.273 2.44 1.10 Intr + 46552 46581 30 2 0 108 62 47 0.181 2.20 1.11 Intr + 51473 51505 33 0 0 76 70 50 0.067 0.19 1.12 Intr + 56790 56879 90 1 0 84 62 39 0.078 0.87 1.13 Term + 75980 76044 65 0 2 110 48 32 0.034 -0.55 1.14 PlyA + 77675 77680 6 1.05 2.00 Prom + 83923 83962 40 -4.36 2.01 Init + 84941 85030 90 1 0 87 99 22 0.486 2.85 2.02 Intr + 86088 86133 46 2 1 95 111 23 0.377 3.28 2.03 Intr + 87831 87863 33 1 0 95 97 34 0.900 3.19 2.04 Intr + 88860 88934 75 1 0 72 61 89 0.903 4.09 2.05 Intr + 89056 89095 40 2 1 123 99 49 0.949 6.78 2.06 Intr + 89393 89430 38 2 2 70 94 41 0.695 0.81 2.07 Intr + 97381 97464 84 2 0 45 101 79 0.569 4.69 2.08 Term + 98394 98521 128 1 2 31 45 124 0.609 0.84 2.09 PlyA + 98704 98709 6 1.05 3.02 PlyA - 99229 99224 6 1.05 3.01 Sngl - 101242 99998 1245 1 0 68 29 1283 0.614 116.24 3.00 Prom - 117092 117053 40 -1.76 4.04 PlyA - 119147 119142 6 1.05 4.03 Term - 120984 120517 468 0 0 39 48 195 0.841 5.57 4.02 Intr - 122708 122492 217 1 1 133 81 117 0.624 14.01 4.01 Init - 137007 136979 29 0 2 67 79 -6 0.034 -4.30 4.00 Prom - 143168 143129 40 -2.96 5.00 Prom + 147406 147445 40 -6.16 5.01 Init + 150195 150496 302 2 2 103 81 162 0.075 13.93 5.02 Intr + 184571 184745 175 2 1 91 8 107 0.013 2.94 5.03 Intr + 189082 189208 127 0 1 27 82 143 0.977 7.85 5.04 Intr + 189468 189563 96 1 0 94 95 42 0.953 5.48 5.05 Intr + 194790 195210 421 1 1 77 -11 172 0.479 -0.80 5.06 Intr + 195745 196028 284 1 2 89 56 234 0.496 17.36 5.07 Term + 196242 196543 302 1 2 30 43 236 0.001 8.68 5.08 PlyA + 200389 200394 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 169214 168977 238 1 1 50 44 181 0.881 5.54 S.002 Term + 196095 196543 449 1 2 11 43 349 0.979 18.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:43286640_43487884|GENSCAN_predicted_peptide_1|409_aa MPYSPLDPGNTCEQQPLGLDSRSCPSCGRLAHSPKVPDPRGRDRNGEVWVCPATRIGTSF LILETGSGGRAGLQGTLASSPAGLSPEIPPEDCSNPELLLAWPHSSVEPLLSSSQGSEEG PKRRVDSQPWREGVNPYIRRPRETKMDPRGQTLLRRMVAEGQGRDVWETMKSGLGSFNVG LVRIRTLEEAAQIRPGASHDQTGGRRPGNCDGCSLRQCHLLFMQHVAAPSDTKLRKAPRL AAWLYTQTILVLHDSQIPHLTARDGSHMRQALEITGLIIIAKEKPHVFKKSLWWQWSMKA STTNKATTAQCLCNTQIKPPPPFSMSSQCSDIAKSNAGLLDFRQKLPVSRSTLRETPKGQ CCGGVAKSPNQQVGTGNNTSNKNQHRTAGPKAPCAKGPVPSLPYTRESA >gi568815588r:43286640_43487884|GENSCAN_predicted_CDS_1|1230_bp atgccctacagtcctctggatccagggaacacctgtgagcaacagcctctgggcttggac agccgatcctgcccctcatgtggacggctggcccactcccccaaagtccccgatcctaga gggagagacagaaatggggaggtctgggtctgtccagcaaccaggataggcacctccttc ttgatcctggagacaggctcagggggtagagctggactccagggcactctggcttcatcc cctgctgggctcagccctgagatccccccggaggactgcagcaaccctgagttgctgctt gcctggccccacagctctgtggagccgctcctctcctcctcacagggctctgaggaggga cctaaaaggcgagtggacagccagccatggagggaaggtgttaatccatacatccggaga cccagagaaaccaagatggacccgagagggcagacgctcctccgccgcatggtggcagaa gggcagggccgagatgtctgggagactatgaagtctggtttagggtctttcaatgtgggg ctggttaggattaggaccctagaggaggcagcgcagatcagacctggtgcctcccacgat cagactggcggccggcggcctggcaactgtgatggctgctccctccgccaatgtcacctg ctgttcatgcagcacgtggccgcaccttctgacactaaactcaggaaggcaccaagactg gcggcatggctttacacccaaaccatcttggtgcttcatgacagccagatccctcacctc actgcccgagatggcagccacatgagacaggccctggaaataacaggcctgatcatcatc gccaaggagaaacctcatgtttttaagaagtccctgtggtggcagtggtccatgaaggcc agcaccacaaataaagctacaactgctcagtgtttatgtaacacccagataaaaccaccc ccacccttctccatgtcatctcaatgtagtgacatagccaaaagtaacgcaggactgctg gatttccggcaaaaactcccagtctctcgttccaccttacgagaaacacccaaaggacag tgctgtggaggggtcgctaagagtccgaatcagcaggtgggcactggtaataataccagt aataagaaccaacacaggacagcagggcccaaagccccctgcgctaagggccctgtccca tctctgccctacacgcgggagtcagcctaa >gi568815588r:43286640_43487884|GENSCAN_predicted_peptide_2|177_aa MEPWAWLQGLKSRPTCPAASSDPFSALPAQDTGEGAVRNLQSHTVGLTALEANDPFDWKN LQLSGLICGGLLAIAGIAAVLSGKCKCKSSQKQHSPVPEKAIPLITPGRFLTLAKSNKPL SPSTFVLVFGISYTSVFRVPLSASLYPAIPGDAAALTSGHPSMQNISMQNTGTKGCT >gi568815588r:43286640_43487884|GENSCAN_predicted_CDS_2|534_bp atggagccctgggcgtggctgcagggtttaaagagccgacccacgtgcccagcagcctcc tcagatccgttctctgcgctgccagctcaggacactggtgaaggagcagtgaggaacctg cagagtcacacagttggcctgactgccttggaagccaatgacccatttgactggaaaaac ctgcagctgagcggactgatctgcggagggctcctggccattgctgggatcgcggcagtt ctgagtggcaaatgcaaatgcaagagcagccagaagcagcacagtcctgtacctgagaag gccatcccactcatcactccaggcagatttctcaccttggccaaatcaaataaaccttta tctccaagcacctttgtcttggtgtttggcatcagctacacatcagtcttccgagtgcct ctttctgcgtccctgtaccctgccattcctggtgatgctgctgccctcacatcaggccat ccaagcatgcagaacataagcatgcagaacactggaacgaagggctgtacctaa >gi568815588r:43286640_43487884|GENSCAN_predicted_peptide_3|414_aa MLGPEGGEGFVVKLRGLPWSCSVEDVQNFLSDCTIHDGAAGVHFIYTREGRQSGEAFVEL GSEDDVKMALKKDRESMGHRYIEVFKSHRTEMDWVLKHSGPNSADSANDGFVRLRGLPFG CTKEEIVQFFSGLEIVPNGITLPVDPEGKITGEAFVQFASQELAEKALGKHKERIGHRYI EVFKSSQEEVRSYSDPPLKFMSVQRPGPYDRPGTARRYIGIVKQAGLERMRPGAYSTGYG GYEEYSGLSDGYGFTTDLFGRDLSYCLSGMYDHRYGDSEFTVQSTTGHCVHMRGLPYKAT ENDIYNFFSPLNPVRVHIEIGPDGRVTGEADVEFATHEEAVAAMSKDRANMQHRYIELFL NSTTGASNGAYSSQVMQGMGVSAAQATYSGLESQSVSGCYGAGYSGQNSMGGYD >gi568815588r:43286640_43487884|GENSCAN_predicted_CDS_3|1245_bp atgctgggccctgagggaggtgaaggctttgtggtcaagctccgtggcctgccctggtcc tgctctgttgaggacgtgcagaacttcctctctgactgcacgattcatgatggggccgca ggtgtccatttcatctacactagagagggcaggcagagtggtgaggcttttgttgaactt ggatcagaagatgatgtaaaaatggccctgaaaaaagacagggaaagcatgggacaccgg tacattgaggtgttcaagtcccacagaaccgagatggattgggtgttgaagcacagtggt cccaacagtgccgacagcgccaacgatggcttcgtgcggcttcgaggactcccatttgga tgcacaaaggaagaaattgttcagttcttctcagggttggaaattgtgccaaacgggatc acattgcctgtggaccccgaaggcaagattacaggggaagcgttcgtgcagtttgcctcg caggagttagctgagaaggctctagggaaacacaaggagaggatagggcacaggtacatt gaggtgtttaagagcagccaggaggaagttaggtcatactcagatccccctctgaagttc atgtccgtgcagcggccagggccctatgaccggcccgggactgccaggaggtacattggc atcgtgaagcaggcaggcctggaaaggatgaggcctggtgcctacagcacaggctacggg ggctacgaggagtacagtggcctcagtgatggctacggcttcaccaccgacctgttcggg agagacctcagctactgtctctccggaatgtatgaccacagatacggcgacagtgagttc acagtgcagagcaccacaggccactgtgtccacatgaggggcctgccgtacaaagcgacc gagaacgacatttacaacttcttctctcctctcaaccctgtgagagtccatattgagatt ggcccagatggaagagtgacgggtgaagcagatgttgagtttgctactcatgaagaagct gtggcagctatgtccaaagacagggccaatatgcagcacagatatatagaactcttcttg aattcaacaacaggggccagcaatggggcgtatagcagccaggtgatgcaaggcatgggg gtgtctgctgcccaggccacttacagtggcctggagagccagtcagtgagtggctgttac ggggccggctacagtgggcagaacagcatgggtggctatgactag >gi568815588r:43286640_43487884|GENSCAN_predicted_peptide_4|237_aa MKAVRMSKARNVLACPPPPPPPPAVSALLRAQPANRRLSSGASQCAVAAPPPDAGRRNSA LPRSLRPSGSVVEAGLRRVFLKYFRRFRPARRKLVDGARLLLRRLPGSPNRRVVDQGFEE QGTRLALPPRPPRAREPLPLGAWDPRSAPRVSRRRVEGEPSSAIARGRPGDELTRLFSSR NCSRALGARAGHLRGGTRGARWRLRAKGNGRRRRLPGTARGGPCAGVRATASGGREF >gi568815588r:43286640_43487884|GENSCAN_predicted_CDS_4|714_bp atgaaggcagttagaatgtctaaagcaagaaacgtcctcgcttgtccgcctccaccgcct cccccgccagccgtgtcggctctgctccgcgcccagccggccaaccggcggctcagctct ggcgcgtcacaatgcgccgtcgcggccccgcccccggacgccggacgcagaaactccgcc ctgccgcggtccttgcgtccttccggctccgtcgtggaagcaggactgcgccgcgtcttc ctcaagtattttcgtcgattccgccctgctcgtaggaaacttgtggacggggcccggctc ctgctgcgccggctgcccggctcccccaaccgcagagtcgtggaccagggttttgaggag caaggcacccgactcgccctccccccaaggccgccccgggcgcgggaacccctcccccta ggcgcctgggacccccgctcggctccgcgtgtgtcccggcggagggtggaaggagagccg agctcggccattgcccgcgggcggcccggggacgagctgacgcgcctctttagttctcga aactgctcgcgggcgctgggggctcgggccggccatctgcgtggcgggacgcgaggcgcg cgctggcggctgcgcgcgaagggcaacgggcggcggcggcggcttcccggaaccgcgcgg ggcgggccctgcgccggggtcagggccacagcctcaggcggcagggagttctga >gi568815588r:43286640_43487884|GENSCAN_predicted_peptide_5|568_aa MASRPRPRTPSRGPSDLRFRGEAGLRRVFLKKAGVRVRPADKRAAGSRVGCPWHRAEPPL GTREQQGFRKRRERWTGGRPGFAQAPPLGGPAQGALRQFPWMEHVSECGIWLAAPGTGRS KLRVQNGQPDPTLARTLGWRCQGEGAHYPKTLEGVLRCSGSVSFSDVAVGFTQEEWQHLD SAQRTPYRDMMLENYSLLLSVGYCITKPEVVCKLEHGQVLWILEEESPSQSHLDCCIDDD LMEKRQENQDQHLQKVDFVNNKTLTMDRNGVLGKTFSLDTNPILSRKIRGNCDSSGMNLN NISELIISNRSSFVRNPAECNVRGKFLLCMKRENPYARGKPLEYDGNGKAVSQNEDLFRH QYIQTLKQCFEYNHRGYTEERNPMNALNVGKLLDIGHALQYIKEHTRDKTYECNECGKNF CEKSNLHVHQRTHTGEKPYGCNECQKAFGDRSALKVHQRIHTGEKPYELHQRTHTGEKPY ACSECGKTFYQKSSLTTHQRTHTREQPYEYNESFYQNPNFTKCQRDNIEETLVNILKAQK PSPSWTRSIAETTQVGVRTPNKDEKSFA >gi568815588r:43286640_43487884|GENSCAN_predicted_CDS_5|1707_bp atggcgtcgcggccccgcccccggacgcccagccgcggtcccagcgaccttcgctttcgt ggggaagccggactgcgccgtgtcttcctgaagaaggctggggtaagagtccggccagcg gacaagagggcagctggtagcagggtgggatgcccgtggcatagggccgagcccccgcta ggcacgcgggagcagcaaggcttccggaagcgcagggagcgctggactggcgggcgcccg ggattcgcgcaggccccgcccctcggcggccccgcccagggagcgctgcggcagtttcca tggatggagcacgtgagtgagtgcgggatctggctggctgctccgggcactggcaggagc aagctccgtgtccagaatggccagccagatcccacacttgctcgcacacttggatggcgg tgccaaggcgagggtgcccactaccccaaaaccctagagggagtgttacggtgctctgga tcagtgtccttcagtgatgtggctgtgggcttcacccaggaggagtggcagcatctggac tctgctcagaggaccccgtacagagacatgatgctggagaactacagcctcctcctctca gtgggatattgcattaccaaaccagaggtggtttgcaagttggagcatggacaggtgctg tggatattagaggaagagtccccaagtcagagccacctagactgctgcatagatgatgac ctgatggagaagagacaggaaaatcaagaccagcatttgcagaaagttgattttgtcaac aataaaacactgactatggacagaaatggtgtattaggaaaaacattttctcttgacaca aaccccattctatcaagaaaaatacgtggcaactgtgactcatctggaatgaatttgaat aatatttcagaattaattattagtaatagaagctcctttgtaaggaaccctgctgagtgt aatgtacgtgggaaatttctcctctgtatgaagcgtgagaatccttatgccagagggaaa cctttggaatatgatggaaatgggaaagccgtctctcagaatgaggacttatttaggcat cagtatattcaaactcttaagcagtgttttgaatacaatcacagaggatacacagaggag agaaaccctatgaatgcactgaatgtgggaaaacttttggatataggtcatgccttgcag tacatcaaagaacacacacgggacaaaacctatgaatgtaatgaatgtggaaaaaacttc tgtgagaagtcaaatcttcatgtacatcagagaacacacacaggagagaaaccctatgga tgtaatgaatgtcagaaagcctttggtgataggtcagctctaaaagtacatcagagaata catactggcgagaaaccctatgagttacatcagagaacccacacaggagagaaaccctat gcatgtagtgaatgtgggaaaaccttctaccagaagtcatccctcacaacacatcagaga acacacacaagggagcaaccctatgaatataatgaaagcttttaccagaatcccaacttc actaaatgtcagagagacaacatagaggaaacccttgtcaacatcctgaaggctcagaaa ccttcaccttcttggactcgttccatagcagaaacaacccaggttggggtgagaacaccc aataaagatgagaaatcttttgcctag