GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:45:24 Sequence gi568815578r:5015286_5219798 : 204513 bp : 45.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 68 63 6 1.05 1.02 Term - 12973 12879 95 2 2 85 49 62 0.863 -0.01 1.01 Init - 14509 14452 58 1 1 86 58 98 0.880 6.07 1.00 Prom - 42710 42671 40 -4.26 2.00 Prom + 45659 45698 40 -3.76 2.01 Init + 53164 53217 54 0 0 86 70 85 0.771 7.78 2.02 Intr + 55766 55849 84 1 0 29 58 106 0.335 1.72 2.03 Intr + 68589 68672 84 2 0 94 95 -10 0.024 0.32 2.04 Intr + 79591 79649 59 0 2 61 101 23 0.050 -1.42 2.05 Term + 81948 82086 139 0 1 69 38 123 0.282 2.84 2.06 PlyA + 83908 83913 6 1.05 3.07 PlyA - 84588 84583 6 1.05 3.06 Term - 85646 85506 141 2 0 91 42 260 0.999 19.63 3.05 Intr - 91025 90903 123 2 0 94 86 105 0.978 11.68 3.04 Intr - 94160 94047 114 2 0 46 99 138 0.651 11.34 3.03 Intr - 97266 97167 100 2 1 101 17 37 0.582 -1.99 3.02 Intr - 97655 97458 198 1 0 41 57 111 0.551 1.77 3.01 Init - 97743 97676 68 0 2 49 107 97 0.986 6.34 3.00 Prom - 98411 98372 40 -8.76 4.07 PlyA - 99695 99690 6 1.05 4.06 Term - 100077 99998 80 1 2 112 48 80 0.910 4.33 4.05 Intr - 100287 100164 124 0 1 110 91 59 0.996 8.56 4.04 Intr - 102379 102185 195 1 0 72 87 145 0.999 12.41 4.03 Intr - 103392 103325 68 1 2 64 81 68 0.990 2.32 4.02 Intr - 103581 103484 98 2 2 93 108 89 0.994 11.05 4.01 Init - 104513 104293 221 2 2 90 99 417 0.987 41.10 4.00 Prom - 117223 117184 40 -3.26 5.00 Prom + 127527 127566 40 -3.46 5.01 Init + 151123 151125 3 0 0 71 101 0 0.132 -0.40 5.02 Intr + 154932 155099 168 2 0 72 43 80 0.185 2.04 5.03 Intr + 158238 158374 137 2 2 78 86 87 0.645 6.87 5.04 Intr + 159898 159994 97 1 1 79 97 16 0.666 1.61 5.05 Intr + 161363 161460 98 1 2 106 107 102 0.988 12.71 5.06 Intr + 163532 163671 140 2 2 111 57 45 0.900 3.81 5.07 Intr + 167102 167160 59 0 2 77 58 55 0.842 0.00 5.08 Intr + 167776 167858 83 0 2 87 85 35 0.899 1.54 5.09 Intr + 169573 169660 88 1 1 99 94 45 0.957 6.17 5.10 Intr + 170473 170541 69 0 0 95 82 76 0.992 7.08 5.11 Intr + 171402 171554 153 2 0 93 71 269 0.997 25.97 5.12 Intr + 173782 173901 120 0 0 87 19 161 0.994 9.79 5.13 Intr + 174450 174553 104 2 2 101 80 166 0.995 16.07 5.14 Term + 174817 174949 133 1 1 78 50 219 0.997 14.66 5.15 PlyA + 175099 175104 6 -0.45 6.03 PlyA - 175577 175572 6 1.05 6.02 Term - 176547 176435 113 1 2 48 49 109 0.650 1.72 6.01 Init - 179296 179215 82 1 1 96 110 13 0.546 5.43 6.00 Prom - 179540 179501 40 -5.56 7.03 PlyA - 179564 179559 6 1.05 7.02 Term - 181346 181241 106 1 1 85 44 107 0.928 3.88 7.01 Init - 184518 184349 170 0 2 59 110 66 0.537 4.91 7.00 Prom - 184603 184564 40 -2.46 8.00 Prom + 184897 184936 40 -6.46 8.01 Sngl + 185265 185918 654 2 0 43 48 243 0.979 11.98 8.02 PlyA + 185950 185955 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_1|50_aa MGFLHAGLAGLAGLELLTSDTATIRFLNLFPTFPAFLFHKAAIVILARSQ >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_1|153_bp atggggtttctccatgctggtctggctggtctggctggtctcgaactcctgacctcagac acggcaaccatccgatttctcaatcttttccccacctttcccgcctttctattccacaaa gccgccattgtcatcctggcccgttctcaatga >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_2|139_aa MVEGKGEAGTFFTAHQDRVNGNGVVVRGYSILKSQLIMSLDAPEMMVQGIHVQVCYVGKL CIVGVWYTDYFVTQLQGQLVLLPRFLMVTSYFLITSQFLAEQKCLWYDHVKLNQETHQSG NGSEISIICRTAEATGEGF >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_2|420_bp atggtggaaggcaaaggagaagcaggcaccttcttcacagcgcaccaggacagagtgaat ggcaatggagtagttgtccgagggtactcaatactcaagagtcagctgatcatgtcctta gatgcaccagaaatgatggttcagggaatacatgtgcaagtttgttacgtgggtaaatta tgtatcgtgggggtttggtatactgactatttcgtcacccagctacagggacagctggtc ttactgcccaggttcctcatggtcacctcctatttcctaatcacctcacaattcctggca gaacagaagtgcctctggtatgaccacgtgaagctcaaccaagagacccaccagagtggc aacggatctgaaatttccatcatatgcagaacagctgaagcaactggagagggtttttag >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_3|247_aa MQPWALPTVGELWVCGRPGAALRAGTEPSSRALGVSETALPAEIKLRVIRVGHSSPLAQL ASFQKPVLFVLRLNSRRFLFLSLARSEDGILFAKSKHSSPLSLTPLRCIVLMRMYEQLMS GDLCQRVMMPSRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIALATVLF LIGAFLIIIGSLLLSGYISKGGADRAVPVLIIGILVFLPGFYHLRIAYYASKGYRGYSYD DIPDFDD >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_3|744_bp atgcaaccttgggcgctgccaaccgtgggcgagctctgggtgtgcgggcggcctggcgcg gcgctccgggcggggacagaaccgtcctctcgggctctgggcgtgtccgagaccgcgctc cccgccgaaatcaagctccgagtcatccgtgtggggcattcgtcccccctggcacagttg gcctctttccagaagcccgttttgtttgttttacgtctaaattcgcgtcggttcttattt ctctccctggcaaggtctgaagacggaatcctctttgccaagagtaaacattcgtctccc ctctccctcacgcccctgcgatgtattgttttgatgcggatgtatgaacagttaatgtca ggggacctgtgtcagcgtgttatgatgccgtcccgtaccaacctggctactggaatcccc agtagtaaagtgaaatattcaaggctctccagcacagacgatggctacattgaccttcag tttaagaaaacccctcctaagatcccttataaggccatcgcacttgccactgtgctgttt ttgattggcgcctttctcattattataggctccctcctgctgtcaggctacatcagcaaa gggggggcagaccgggccgttccagtgctgatcattggcattctggtgttcctacccgga ttttaccacctgcgcatcgcttactatgcatccaaaggctaccgtggttactcctatgat gacattccagactttgatgactag >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_4|261_aa MFEARLVQGSILKKVLEALKDLINEACWDISSSGVNLQSMDSSHVSLVQLTLRSEGFDTY RCDRNLAMGVNLTSMSKILKCAGNEDIITLRAEDNADTLALVFEAPNQEKVSDYEMKLMD LDVEQLGIPEQEYSCVVKMPSGEFARICRDLSHIGDAVVISCAKDGVKFSASGELGNGNI KLSQTSNVDKEEEAVTIEMNEPVQLTFALRYLNFFTKATPLSSTVTLSMSADVPLVVEYK IADMGHLKYYLAPKIEDEEGS >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_4|786_bp atgttcgaggcgcgcctggtccagggctccatcctcaagaaggtgttggaggcactcaag gacctcatcaacgaggcctgctgggatattagctccagcggtgtaaacctgcagagcatg gactcgtcccacgtctctttggtgcagctcaccctgcggtctgagggcttcgacacctac cgctgcgaccgcaacctggccatgggcgtgaacctcaccagtatgtccaaaatactaaaa tgcgccggcaatgaagatatcattacactaagggccgaagataacgcggataccttggcg ctagtatttgaagcaccaaaccaggagaaagtttcagactatgaaatgaagttgatggat ttagatgttgaacaacttggaattccagaacaggagtacagctgtgtagtaaagatgcct tctggtgaatttgcacgtatatgccgagatctcagccatattggagatgctgttgtaatt tcctgtgcaaaagacggagtgaaattttctgcaagtggagaacttggaaatggaaacatt aaattgtcacagacaagtaatgtcgataaagaggaggaagctgttaccatagagatgaat gaaccagttcaactaacttttgcactgaggtacctgaacttctttacaaaagccactcca ctctcttcaacggtgacactcagtatgtctgcagatgtaccccttgttgtagagtataaa attgcggatatgggacacttaaaatactacttggctcccaagatcgaggatgaagaagga tcttag >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_5|483_aa MKVFLDISPPRCAQRSKLMNVFSMYHLSCGKEHGFQVPACFQTLCGSCRWHSASFSGESE SEAKVDGETASDSESRAESAPLPVSADDTPEVLNRALSNLSSRWKNWWVRGILTLAMIAF FFIIIYLGPMVLMIIVMCVQIKCFHEIITIGYNVYHSYDLPWFRTLSWYFLLCVNYFFYG ETVTDYFFTLVQREEPLRILSKYHRFISFTLYLIGFCMFVLSLVKKHYRLQFYMFGWTHV TLLIVVTQSHLVIHNLFEGMIWFIVPISCVICNDIMAYMFGFFFGRTPLIKLSPKKTWEG FIGGFFATVVFGLLLSYVMSGYRCFVCPVEYNNDTNSFTVDCEPSDLFRLQEYNIPGVIQ SVIGWKTVRMYPFQIHSIALSTFASLIGPFGGFFASGFKRAFKIKDFANTIPGHGGIMDR FDCQYLMATFVNVYIASFIRGPNPSKLIQQFLTLRPDQQLHIFNTLRSHLIDKGMLTSTT EDE >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_5|1452_bp atgaaagtttttcttgacatctccccacctcgctgtgcccagaggagcaaactgatgaat gtgttcagcatgtaccacctttcttgtgggaaggagcatgggttccaggtccctgcctgc ttccagaccctctgtggaagctgccgctggcactcagcttccttttccggggagtcagag tcagaagcaaaggtagatggagagactgcatcggacagtgagagccgggcagaatccgca cccctgccagtctctgcagatgataccccggaggtcctcaatagggccctttccaacttg tcttcaagatggaagaactggtgggtgagaggcatcctgactttggccatgattgcattt ttcttcatcatcatttacctgggaccaatggttttgatgataatcgtgatgtgcgttcag attaagtgtttccatgagataatcactattggctacaacgtctaccactcatatgatctg ccctggttcaggacgctcagctggtactttctcctgtgtgtaaactatttcttctatggt gagacagtgacggattacttcttcaccctggtccagagagaagagcctttgcggattctc agtaaataccaccggttcatttcctttactctctatctaataggattctgcatgtttgta ctgagtctggtcaagaagcattatcgactgcagttctacatgtttggctggacccatgtg acattgctgattgttgtaacacagtcacatcttgttatccacaacctatttgaaggaatg atctggttcattgtccccatatcttgtgtgatctgtaatgacatcatggcctatatgttt ggctttttctttggtcggaccccactcatcaagctgtccccgaagaagacctgggaaggc ttcattgggggcttctttgctactgtggtgtttggccttctgctgtcctatgtgatgtcc gggtacagatgctttgtctgccctgtggagtacaacaatgacaccaacagcttcactgtg gactgtgagccctcggacctgtttcgcctgcaggagtacaacattcctggggtgatccag tcagtcattggctggaaaacggtccggatgtaccccttccagattcacagcatcgctctc tccacctttgcctcgctcattggcccctttggaggattcttcgcaagtggattcaaacga gcctttaaaatcaaagactttgccaataccattcctggccatggaggcatcatggatcgc tttgactgccagtatctgatggccacctttgtcaatgtatacatcgccagttttatcaga ggccctaacccaagcaaactgattcagcagttcctgactttacggccagatcagcagctc cacatcttcaacacgctgcggtctcatctgatcgacaaagggatgctgacatccaccaca gaggacgagtag >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_6|64_aa MVPIQTGDSLNQETAPDENSLHPPERPGALDLNKRSHEAEMEIKVQRKEAASAPHTYGKS LKDV >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_6|195_bp atggtccctatacaaacaggtgacagccttaatcaagagacagccccagatgagaacagc ctccatccacctgagaggccaggggccttggacctgaataagagaagccacgaggctgaa atggaaataaaagttcagcggaaagaagccgcttcagccccacacacatatgggaagtca ctcaaggatgtgtga >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_7|91_aa MIDWIKKMWHIYTVEYHAAIKKDEFMSFVGTWMKLETIILSKLSQEQKTKHRMFSRRKAS IQEPSAGVKKRHPNCEDRGGYVDDYRQTQFH >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_7|276_bp atgatagactggattaagaaaatgtggcacatatacaccgtggaataccacgcagccata aaaaaggatgagttcatgtcctttgtagggacatggatgaagctggaaaccatcattctc agcaaactatcgcaagaacaaaaaaccaaacaccgcatgttctcacgcagaaaagccagc atacaagaacctagcgcaggtgtcaaaaaaaggcatccaaactgcgaggacagaggtggc tatgtggatgattatcggcaaacacagtttcactag >gi568815578r:5015286_5219798|GENSCAN_predicted_peptide_8|217_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGSFNIRKSINVIQYINRTNEKNHM IISIDAEKAFDKIQQCFMLKTLNKLGIDGMYLKIIRAIYDKPTASIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLRKEEVKLSLFADDMIVYLENPIVSA QNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTAKS >gi568815578r:5015286_5219798|GENSCAN_predicted_CDS_8|654_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctcattcaac atacgcaaatcaataaatgtaatccagtatataaacagaaccaacgaaaaaaaccatatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaatgcttcatgctaaaa acgctcaataaattaggtattgatgggatgtacctcaaaataataagagctatctatgac aaacccacagccagtatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcagttgagaaaagaggaagtc aaattgtccctctttgcagatgacatgattgtgtatctagaaaaccccatcgtctcagcc caaaatctccttaagctgataggcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagccaaatcatga