GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:23:45 Sequence gi568815596r:25828981_26081917 : 252937 bp : 45.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 16583 16501 83 2 2 92 109 48 0.559 5.74 1.01 Init - 27959 27723 237 2 0 58 85 357 0.831 30.41 1.00 Prom - 46162 46123 40 -2.96 2.00 Prom + 56086 56125 40 -4.86 2.01 Init + 57216 57384 169 2 1 92 107 134 0.623 15.40 2.02 Term + 60153 60196 44 1 2 133 36 28 0.597 -0.58 2.03 PlyA + 60226 60231 6 1.05 3.13 PlyA - 60327 60322 6 1.05 3.12 Term - 82643 82540 104 0 2 75 37 158 0.315 7.84 3.11 Intr - 86632 86496 137 0 2 67 15 48 0.222 -4.39 3.10 Intr - 86949 86697 253 1 1 47 97 138 0.317 7.09 3.09 Intr - 89020 88912 109 1 1 51 76 64 0.132 1.36 3.08 Intr - 101083 100975 109 0 1 43 90 146 0.020 10.59 3.07 Intr - 122925 122809 117 2 0 99 67 231 0.111 21.68 3.06 Intr - 125405 125287 119 2 2 54 30 273 0.998 17.36 3.05 Intr - 126683 126561 123 2 0 69 100 249 0.999 24.98 3.04 Intr - 127464 127363 102 0 0 105 91 168 0.987 19.17 3.03 Intr - 128697 128645 53 1 2 77 96 40 0.888 2.33 3.02 Intr - 130841 130814 28 2 1 90 79 35 0.193 0.49 3.01 Init - 144821 144735 87 2 0 92 42 151 0.448 9.54 3.00 Prom - 145584 145545 40 -3.26 4.02 PlyA - 146353 146348 6 -0.45 4.01 Sngl - 152937 151321 1617 0 0 95 54 2718 0.956 263.64 4.00 Prom - 163323 163284 40 -8.56 5.03 PlyA - 165153 165148 6 1.05 5.02 Term - 165859 165765 95 2 2 134 36 171 0.956 14.49 5.01 Init - 180362 180329 34 2 1 99 62 14 0.120 -0.07 5.00 Prom - 180577 180538 40 -4.26 6.02 PlyA - 180720 180715 6 1.05 6.01 Sngl - 199632 199225 408 0 0 87 38 415 0.927 30.69 6.00 Prom - 199823 199784 40 -9.16 7.03 PlyA - 199948 199943 6 1.05 7.02 Term - 205762 205473 290 2 2 80 55 296 0.907 21.04 7.01 Init - 225773 225704 70 2 1 70 66 43 0.294 1.51 7.00 Prom - 231262 231223 40 -1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 122925 122805 121 2 1 99 47 247 0.886 19.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_1|107_aa MRYRTLSQLPPRLVSIEDPFDQNDWATWTSFLSGVDIQIVGDDLTVTNPKRIAQSFEKKV CSCLLLKVNQIGSVTESIQVLEKYPNTPMSHKEILQVIQREGLKEIS >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_1|321_bp atgaggtataggacactgtcccagctgccacctagactcgtctccatcgaggaccccttt gaccagaatgactgggccacttggacctcgttcctctcaggggtggacatccagattgtg ggggatgacctgacagtcaccaaccccaagaggattgcccagtcctttgagaagaaggtc tgcagctgtctgctgctgaaggtcaaccagatcggctcggtgactgaatcgatccaggtc ttagaaaaataccccaatacacccatgagtcataaagaaattcttcaagttatccagaga gaaggactaaaagaaatcagn >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_2|70_aa MVAGMEATNGPNTVGSHSPLNFYLLLLLNVQSASKKDENQPPDLTPSLKYTNQPLGGVDP EGTSNELPTC >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_2|213_bp atggtggcaggaatggaggctacaaatggacccaacaccgtgggttcccactcaccactg aacttttacctactactgctactgaatgtccaatctgccagcaagaaagatgaaaaccaa ccccctgatttgacaccatccctcaagtacaccaatcagccacttggaggtgttgatcct gaaggcacctctaatgaacttcccacatgctag >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_3|446_aa MGLPILLLIALFLCPGLLLVTREYELWKQVPQMIGPHDSGLLLPKSDSTPCFEIPQAMES KLLIGGRNIMDHTNEQQKMLELKRQEIAEQKRREREMQQEMMLRDEETMELRGTYTSLQQ EVEVKTKKLKKLYAKLQAVKAEIQDQHDEYIRVRQDLEEAQNEQTRELKLKYLIIENFIP PEEKNKIMNRLFLDCEEEQWKFQPLVPAGVSSSQMKKRPTSAVGYKRPISQYARVAMAMG SHPRYRPQLYALALFLFTVHALYSLGSGLFAKSKRLSQEQSQDPGAQLASPSGSPTGAAG GAACQSRAVCLHSSALGWSMGLGAVEQEAALIGEARAAQEPTEWRGGSGMVGCRSRALPG GKAAKARASRPAALSAGSAGATPTRNSRWPASTAYSPGSRPRLSLHTSPQAEVYVYRYRY LYLYLSHAEPKLDCTAAISAHCNLPA >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_3|1341_bp atgggcctccctatcctactgctgattgcactgtttttgtgtcctggattgttgctggta accagggagtacgaattatggaagcaggttccccagatgatcgggccccatgacagtggt ctcctgctccccaagtctgacagcaccccctgctttgagatccctcaggccatggagagc aagctcctcatcgggggcaggaacatcatggatcacaccaacgaacagcagaagatgttg gaactgaagaggcaggagattgccgagcagaaacgtcgtgagcgggagatgcagcaggag atgatgctccgggacgaggagactatggagctccggggcacctacacatccctgcagcag gaggtggaggtcaaaaccaagaaactcaagaagctctacgccaagctgcaggcggtgaag gcggagatccaggaccagcatgatgagtatatccgcgtgcggcaggacctggaggaggcg cagaacgagcagacccgcgaactcaagctcaagtacctaatcatcgagaacttcatcccg ccggaggagaagaacaagatcatgaaccggcttttcctggactgtgaggaggagcagtgg aagttccagccactggtgccagccggcgtcagtagcagccagatgaagaagcggccaaca tctgcagtgggctacaagaggcctatcagccagtatgctcgggttgccatggcaatgggg tcccaccccaggtacaggccacagctgtatgcactggccctgttcctgttcacagtgcat gctctgtattccttaggctctggtctctttgcaaagtccaagcgtctcagccaggagcag agccaagacccaggagcccagctggcttcacccagtggatcccccacgggggctgcaggt ggagctgcctgccagtcccgcgccgtgtgcctgcactcctcagcccttggatggtcgatg ggactgggcgccgtggagcaggaggcggcgctcatcggggaggctcgggccgcacaggag cccacggagtggcggggaggctcaggcatggtgggctgcaggtcccgagccctgcccggc gggaaggcagctaaggcccgggccagccggccggctgctctgagtgcagggtccgccggg gccacgcccacccggaactcgcgctggcccgcaagcaccgcgtacagccccggttcccgc ccgcgcctctccctccacacctccccgcaagctgaggtctacgtctaccgctaccgctac ctctacctctacctctcccatgccgagccgaagctggactgtactgctgccatctcggct cactgcaacctccctgcctga >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_4|538_aa MASKTKASEALKVVARCRPLSRKEEAAGHEQILTMDVKLGQVTLRNPRAAPGELPKTFTF DAVYDASSKQADLYDETVRPLIDSVLQGFNGTVFAYGQTGTGKTYTMQGTWVEPELRGVI PNAFEHIFTHISRSQNQQYLVRASYLEIYQEEIRDLLSKEPGKRLELKENPETGVYIKDL SSFVTKNVKEIEHVMNLGNQTRAVGSTHMNEVSSRSHAIFIITVECSERGSDGQDHIRVG KLNLVDLAGSERQNKAGPNTAGGAATPSSGGGGGGGGSGGGAGGERPKEASKINLSLSAL GNVIAALAGNRSTHIPYRDSKLTRLLQDSLGGNAKTIMVATLGPASHSYDESLSTLRFAN RAKNIKNKPRVNEDPKDTLLREFQEEIARLKAQLEKRGMLGKRPRRKSSRRKKAVSAPPG YPEGPVIEAWVAEEEDDNNNNHRPPQPILESALEKNMENYLQEQKERLEEEKAAIQDDRS LVSEEKQKLLEEKEKMLEDLRREQQATELLAAKYKVRAPEELGTRDVPAGDPGKEALL >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_4|1617_bp atggccagtaagaccaaggccagcgaggccctcaaggtggtggcccggtgccgccccctc agcaggaaggaggaggctgctggtcacgagcagatcctgaccatggacgtgaaactgggc caggtgaccctgcggaacccccgcgccgccccgggggagctgcccaagaccttcaccttt gacgccgtgtatgatgccagctccaagcaggccgacctgtatgacgaaaccgtgaggccc ctgatagactccgtgctccagggtttcaatggcacggtgtttgcctatggccagacgggc actggcaagacctataccatgcaggggacctgggtggagcccgagctgcgcggggtcatc ccgaatgcctttgagcacatcttcacccacatctcccgctcccagaaccaacagtacctg gtccgggcctcctatttggagatctaccaggaagagattcgagacctgctctccaaggag ccgggcaagaggctagagctgaaagagaaccccgagactggcgtctacatcaaggacctc tcctccttcgtcaccaagaatgtcaaggagattgagcatgtgatgaacctggggaaccag acccgggctgtgggcagcacccacatgaatgaggtcagctcccgctcccatgccatcttc atcatcactgtggagtgcagcgaacgtggctctgatggccaggaccacatccgagtgggc aagctcaacctcgtggacctggctggcagcgagaggcagaacaaggcaggccccaacaca gcgggaggggcagccacaccatcctcgggtggcggtggtggcggtggaggcagtggtggt ggtgctggtggagagaggcctaaggaagcctccaaaatcaacctctcattatctgccctg ggcaacgtgattgctgccctggcgggcaacaggagcacccacattccctaccgggactcc aagctgacccggctgctccaggactccctgggggggaatgccaagaccatcatggtagcc acactggggccagcttctcacagctacgatgagagcctctccaccttgcgctttgccaac cgagccaagaacatcaagaacaagccccgggtgaacgaggaccccaaggacacactgctg cgggaattccaagaggagattgcccgcctgaaggcccagctggagaagagggggatgctg gggaagcggccccggaggaagagcagccgcaggaagaaggccgtgtccgccccgcctggg taccctgagggcccagtgattgaggcctgggtggcagaagaggaggatgacaacaacaac aaccaccgcccgccccagcccatcctggagtcagccttggagaagaacatggagaattac ctgcaggaacagaaggagcggctggaggaggagaaggcagccatccaggatgaccgcagc ctggtgagcgaggagaagcagaagctgctggaggagaaggagaagatgctggaggacctg cggcgggaacagcaggccacagagctgcttgcggccaagtacaaggtaagggccccagag gagctgggcactcgagatgtccccgcaggggatcccggcaaggaggctcttctttga >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_5|42_aa MPVMVRDKKNFGSPDSVSIFRSSSSPMSGYGSGFNTSSNCGT >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_5|129_bp atgcctgtcatggtcagggataaaaagaactttggatctccagactcggtcagcatcttt cgctcgtcctccagtcctatgtctggctacggttctggattcaacacgagcagcaactgc ggcacctaa >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_6|135_aa MKLLTHNLLSSHVRGMGSRGFPLCLQATEVRICPVEFNPQFMWHHTWHVSYVVHMIPKVE WSAFLEAADSLRLIQMPKGPVEGYEENEEFLRTMHHLLLEVEVIEGTLQCPESGRMFPIS RGIPNMLLSEEETES >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_6|408_bp atgaaactgctcacccacaatctgctgagctcgcatgtgcgggggatggggtcccgtggc ttccccctgtgcctccaggccaccgaggtccgtatctgccctgtggagttcaacccccaa tttatgtggcatcatacgtggcacgtatcatacgtggtacatatgatacctaaggtggag tggtcggcgttcctggaggcggccgatagcttgcgcctgatccagatgcctaaagggcca gttgagggatatgaggagaatgaagagtttctgaggaccatgcaccacctgctgctggag gtggaagtgatagagggcaccctgcagtgcccggaatctggacgtatgttccccatcagc cgcgggatccccaacatgctgctgagtgaagaggaaactgagagttga >gi568815596r:25828981_26081917|GENSCAN_predicted_peptide_7|119_aa MLTTGESGRSKGIHRNECSNFPAGLTYGGNKGSIEGIIRKTKKDAGLPHSGIPDQEQLEK QVVRLLRHWEERLGLSPPAAPRDRSLSLRPTPHSGVLRPEDRGSVGKRLETARATYEQL >gi568815596r:25828981_26081917|GENSCAN_predicted_CDS_7|360_bp atgctgacaactggtgaatctggaagatctaaggggattcatcgtaatgagtgttctaac tttcctgcaggtcttacctatggtggaaataaaggtagtattgaaggcatcatccgaaaa acgaaaaaggacgcaggtcttccccactccggaatccccgatcaggagcagcttgaaaag caggtcgtacgtcttcttcgccattgggaggagcggctcgggctctcgccgccggcggcc ccaagggatcggtccctctcactacggccaactcctcactcgggcgttctcaggccggag gaccggggaagcgtgggaaaaaggctcgagacggcgcgggcgacctacgaacagctttga