GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:22:50 Sequence gi568815596r:162044026_162249178 : 205153 bp : 38.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 1587 1507 81 0 0 110 93 53 0.024 6.92 1.03 Intr - 13417 13337 81 1 0 81 75 67 0.167 3.62 1.02 Intr - 29531 29330 202 1 1 5 71 284 0.062 16.77 1.01 Init - 29956 29805 152 1 2 85 -32 143 0.065 1.83 1.00 Prom - 32623 32584 40 -8.35 2.02 PlyA - 33295 33290 6 1.05 2.01 Sngl - 33640 33386 255 1 0 87 42 210 0.775 9.19 2.00 Prom - 33912 33873 40 -7.75 3.00 Prom + 34037 34076 40 -5.85 3.01 Init + 49293 49427 135 2 0 73 89 121 0.714 10.89 3.02 Term + 60150 60425 276 2 0 107 41 119 0.044 3.58 3.03 PlyA + 61731 61736 6 1.05 4.00 Prom + 66282 66321 40 -3.25 4.01 Init + 74523 74712 190 2 1 60 66 136 0.459 7.82 4.02 Intr + 76208 76254 47 0 2 102 94 8 0.082 0.01 4.03 Intr + 81893 81984 92 1 2 49 90 56 0.039 -0.23 4.04 Intr + 82828 82966 139 1 1 70 80 77 0.510 4.65 4.05 Term + 96916 97071 156 0 0 14 47 113 0.002 -3.25 4.06 PlyA + 98598 98603 6 1.05 5.21 PlyA - 98875 98870 6 1.05 5.20 Term - 100145 99998 148 1 1 29 42 164 0.292 2.29 5.19 Intr - 101652 101515 138 2 0 88 89 169 0.994 15.66 5.18 Intr - 103489 103328 162 0 0 64 110 106 0.985 8.57 5.17 Intr - 105162 105062 101 0 2 99 67 58 0.472 2.79 5.16 Intr - 125696 125529 168 2 0 63 63 106 0.457 4.82 5.15 Intr - 128859 128786 74 1 2 121 43 61 0.899 3.11 5.14 Intr - 129196 129124 73 1 1 112 77 55 0.969 4.86 5.13 Intr - 140589 140559 31 2 1 84 102 20 0.059 0.21 5.12 Intr - 144338 144144 195 1 0 90 99 189 0.387 17.81 5.11 Intr - 145147 145078 70 2 1 67 31 34 0.155 -7.08 5.10 Intr - 145729 145631 99 2 0 42 46 121 0.140 2.46 5.09 Intr - 150155 149993 163 2 1 96 9 93 0.127 0.83 5.08 Intr - 150723 150676 48 0 0 116 73 34 0.732 2.66 5.07 Intr - 154298 154154 145 1 1 50 97 74 0.521 3.86 5.06 Intr - 154856 154728 129 1 0 92 65 125 0.538 9.49 5.05 Intr - 156594 156541 54 2 0 74 91 58 0.386 1.88 5.04 Intr - 158917 158847 71 1 2 72 93 3 0.335 -3.74 5.03 Intr - 170048 169913 136 1 1 93 97 51 0.843 6.15 5.02 Intr - 171976 171873 104 1 2 84 63 85 0.957 3.65 5.01 Init - 174110 173961 150 2 0 52 95 139 0.907 11.09 5.00 Prom - 176988 176949 40 -6.85 6.00 Prom + 178381 178420 40 -5.35 6.01 Init + 183637 183694 58 0 1 67 46 85 0.735 3.72 6.02 Intr + 184506 184538 33 1 0 95 93 25 0.495 0.98 6.03 Intr + 191235 191356 122 1 2 105 88 91 0.345 10.09 6.04 Term + 196037 196084 48 1 0 56 42 67 0.109 -4.77 6.05 PlyA + 198469 198474 6 1.05 7.02 PlyA - 199613 199608 6 1.05 7.01 Term - 204793 204653 141 1 0 74 33 157 0.834 5.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 117343 117121 223 1 1 71 75 121 0.888 7.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_1|172_aa MKVSAAPRTSPGRAVGTSSPNSPNGPKAGETGPGTPLRRKPQGHPLTGEGTPDPPTETGA TRFPLIGLDLLGLQTPWKVLLGLLGAAALVTIITVPVVLLNKGSKFETFEKDRTVGAQNL ARILQMRLLGNIAVEPFPRITEQGRDEFGHSINDYSISPDGQFILLEYNYVK >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_1|516_bp atgaaggtgagtgccgcgccacgtacgtccccggggcgagctgtggggacctcttcgcca aactccccaaatggcccaaaagctggcgaaaccggaccagggactccccttaggcggaag ccccaaggccaccctctgacaggtgaagggacccccgaccctcctacggaaactggggcc acacgcttccctctaattggacttgatctgctcggcttgcagacaccgtggaaggttctt ctgggactgctgggtgctgctgcgcttgtcaccatcatcaccgtgcccgtggttctgctg aacaaaggcagtaagtttgaaacatttgaaaaagataggacagttggagctcagaaccta gcaagaatcctgcagatgaggcttcttggaaatatagctgtagagcccttccccagaatt actgagcaaggaagggatgagtttggacattctatcaatgattattcaatatctcctgat gggcagtttattctcttagaatacaactacgtgaag >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_2|84_aa MVVLGWIPPALPLRRVWVQWTRSCSIFIEVETQKQRFQQLVHQMTELCWEKCMDKPGPKL DSRAEACFVNCIERFIDTSQFILN >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_2|255_bp atggttgtcctgggatggattcctcctgctcttcctctgcggcgggtttgggtgcagtgg acccgcagttgcagcattttcattgaggtagagactcaaaagcagcgcttccagcagctg gtgcaccagatgactgaactttgttgggagaagtgcatggacaagcctgggccaaagttg gacagtcgggctgaggcctgttttgtgaactgcattgagcgcttcattgatacaagccag ttcatcttgaattga >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_3|136_aa MTQVLKNMIHATKNTLQLSLKRKRLGVNIPVAQMAVHEWALVRRMVLTPALWTHFPDKLP TCKPLFQGLLSEIQSTLKSMELLVGIHIIGAQGTVLQLKASSQGKERAPGKLGWGSCDLK APVSVFTCIAYLSVSG >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_3|411_bp atgactcaagtgttgaaaaatatgatccacgcaactaaaaatacattgcaactgagtttg aagaggaaaagactgggtgtcaacattccagtggcccaaatggctgtgcatgaatgggct ttggttcgaaggatggtcctaacccctgcactctggactcatttcccagataaactaccc acatgcaagcctttatttcagggtcttctttctgagatccagagtacgctaaagtccatg gagctgctggtgggaattcatattataggtgcccaaggaacagtgctccaactcaaggcc agcagtcaggggaaagagagggcaccaggaaagctaggctggggctcctgtgaccttaag gccccagtctctgtgttcacctgtattgcatatctgtctgtttctggctga >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_4|207_aa MPVKVPMKERLICKHLPGLESPKPVQEGHFLALYHVSLLALALAQKLAGKLRGGHQKGGG KALAVTQRKPKTATYSGSQSEMHMNRYKDYLHFVDMGGPDTHKEINKCLRLSNAPPPQRC PCPGPQDCEYVTLPGKRDFADMSKDLEMGRVSRIIQVRAEDGAATLQSGVTMGGKPFVSD GRTERKRPRAIPDMFTFDILSSGLLIL >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_4|624_bp atgcccgtgaaagtgcccatgaaagaaagactcatctgcaaacatttgccaggcctggaa agcccgaagccagtacaagaaggacatttcctggccctttaccatgtctctcttctagct ctggccttagcccagaaactggctggcaagctgcggggagggcatcagaagggaggaggg aaagcactagctgtcacacagaggaaacccaagacagcaacatattctggaagccagagc gaaatgcacatgaatcgatataaagattatctgcattttgtggacatgggtggtccagac acccacaaggagataaataaatgtttgagactgagtaatgctcccccaccccaaagatgt ccgtgtcctggtccccaggactgtgaatatgttaccttgcctggcaaaagggactttgca gatatgagtaaggaccttgagatggggagagtatccagaattattcaggtaagagctgaa gatggagcagccacacttcaatctggagttaccatgggaggaaagccatttgttagtgat ggcagaacagaaagaaaaagaccacgagctatccctgacatgtttacctttgatattctt tcctctggacttctgattttgtga >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_5|752_aa MLATKYALWWSPNGKFLAYAEFNDTDIPVIAYSYYGDEQYPRTINIPYPKAGAKNPVVRI FIIDTTYPAYVGPQEVPVPAMIASSDYYFSWLTWVTDERVCLQWLKRVQNVSVLSICDFR EDWQTWDCPKENAIQITSGKWEAINIFRVTQDSLFYSSNEFEEYPGRRNIYRISIGSYPP SKKCVTCHLRKERCQYYTASFSDYAKYYALVCYGREISNSHIQGNEQRLGNPRCPRRRSI IPQTVVSMGIEDEPLSFLNLENIAQASPFPPFMMDALIKLHHIVATYERPEIWRLFPFST AIKTVLLYPVFVILVQNSSHLKDDQFCSQQSCPKIKILEENKELENALKNIQLPKEEIKK LEVDEITLWYKMILPPQFDRSKKYPLLIQVYGGPCSQSVRSVFAVNWISYLASKEGMVIA LVDGRGTAFQGDKLLYAVYRKLGVYEVEDQITAVRLLSVLQNQCRNSTVMARAEYFRNVD YLLIHGTADDNVHFQNSAQIAKALVNAQVDFQAMLSETIAPNSGHIAPPHTSGRDVLWWS GILDNTFPDSGLATAPLFFLHAMAVALRTRTAEMKSIYFVAGLFVMLVQGSWQRSLQDTE EKSRSFSASQADPLSDPDQMNEDKRHSQGTFTSDYSKYLDSRRAQDFVQWLMNTKRNRNN IAKRHDEFERHAEGTFTSDVSSYLEGQAAKEFIAWLVKGRGRRDFPEEVAIVEELGRRHA DGSFSDEMNTILDNLAARDFINWLIQTKITDR >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_5|2259_bp atgcttgctacaaaatatgctctctggtggtctcctaatggaaaatttttggcatatgcg gaatttaatgatacggatataccagttattgcctattcctattatggcgatgaacaatat cctagaacaataaatattccatacccaaaggctggagctaagaatcccgttgttcggata tttattatcgataccacttaccctgcgtatgtaggtccccaggaagtgcctgttccagca atgatagcctcaagtgattattatttcagttggctcacgtgggttactgatgaacgagta tgtttgcagtggctaaaaagagtccagaatgtttcggtcctgtctatatgtgacttcagg gaagactggcagacatgggattgtccaaaggaaaatgctattcaaattacaagtggcaag tgggaggccataaatatattcagagtaacacaggattcactgttttattctagcaatgaa tttgaagaataccctggaagaagaaacatctacagaattagcattggaagctatcctcca agcaagaagtgtgttacttgccatctaaggaaagaaaggtgccaatattacacagcaagt ttcagcgactacgccaagtactatgcacttgtctgctacggtagagaaatatcaaactct cacatccaggggaatgaacaacggcttggaaacccgagatgcccacgtcggaggtcaatc attcctcaaactgtagtttcaatggggatagaagatgaacctttgtcctttctaaatttg gaaaacatcgcccaggcatccccatttccacccttcatgatggacgcactgatcaagcta catcacattgttgccacgtatgaacgccctgaaatttggaggctctttccattttcaact gctatcaagacagtcctcctctatcctgtatttgtgatattggttcaaaatagttcacat ctcaaagacgaccagttttgctcgcagcagagttgtccaaaaattaaaatcctggaagaa aacaaggaattggaaaatgctttgaaaaatatccagctgcctaaagaggaaattaagaaa cttgaagtagatgaaattactttatggtacaagatgattcttcctcctcaatttgacaga tcaaagaagtatcccttgctaattcaagtgtatggtggtccctgcagtcagagtgtaagg tctgtatttgctgttaattggatatcttatcttgcaagtaaggaagggatggtcattgcc ttggtggatggtcgaggaacagctttccaaggtgacaaactcctctatgcagtgtatcga aagctgggtgtttatgaagttgaagaccagattacagctgtcagactcctcagtgtcctt cagaatcagtgtcggaattcaactgtgatggcaagagcagaatatttcagaaatgtagac tatcttctcatccacggaacagcagatgataatgtgcactttcaaaactcagcacagatt gctaaagctctggttaatgcacaagtggatttccaggcaatgctttctgaaaccattgcc cctaactcaggccacatagctccacctcacacatcaggaagggatgttctgtggtggtca ggaattctggacaacaccttcccagattcaggactggccacagcccctcttttcttcctc catgcaatggctgttgccctgaggaccagaacagcagaaatgaaaagcatttactttgtg gctggattatttgtaatgctggtacaaggcagctggcaacgttcccttcaagacacagag gagaaatccagatcattctcagcttcccaggcagacccactcagtgatcctgatcagatg aacgaggacaagcgccattcacagggcacattcaccagtgactacagcaagtatctggac tccaggcgtgcccaagattttgtgcagtggttgatgaataccaagaggaacaggaataac attgccaaacgtcacgatgaatttgagagacatgctgaagggacctttaccagtgatgta agttcttatttggaaggccaagctgccaaggaattcattgcttggctggtgaaaggccga ggaaggcgagatttcccagaagaggtcgccattgttgaagaacttggccgcagacatgct gatggttctttctctgatgagatgaacaccattcttgataatcttgccgccagggacttt ataaactggttgattcagaccaaaatcactgacaggtga >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_6|86_aa MTLPVKTTDDDLVGLQTPAATYSMAVRIVAVPGPMDGPRAEECQRMARDWRAAPPAAPVR DPLGEASWAPETKRGDACEEEPQKKK >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_6|261_bp atgactcttccagtaaagaccacagatgatgatctggttggactacagacaccggctgca acttattccatggctgtaagaatagtggctgtgcccggtcccatggatggcccaagggct gaggagtgccagcgcatggcgcgggactggcgggcagctccacctgcggctccggtgcga gatccactgggtgaggccagctgggctcctgagacgaagaggggtgatgcttgtgaagaa gaacctcaaaagaaaaagtga >gi568815596r:162044026_162249178|GENSCAN_predicted_peptide_7|46_aa IKNEVRLERSRYTVGGFKDKGQKAIKLEDWIDFDALTPIHAPRINA >gi568815596r:162044026_162249178|GENSCAN_predicted_CDS_7|141_bp attaagaatgaggtgagactggagagatcccggtatacagttggtgggtttaaggataaa ggtcagaaagccattaaactggaggactggattgactttgatgcactgactcccatacat gctcctaggataaatgcttag