GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:12:18 Sequence gi568815596r:174648127_174864394 : 216268 bp : 42.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5070 5147 78 2 0 25 113 115 0.832 8.81 1.02 Intr + 11838 12005 168 2 0 37 62 113 0.058 2.82 1.03 Intr + 20602 20671 70 0 1 96 49 39 0.013 -1.36 1.04 Intr + 24857 25000 144 0 0 57 45 150 0.215 6.93 1.05 Term + 29584 29702 119 2 2 44 37 119 0.265 0.12 1.06 PlyA + 29872 29877 6 1.05 2.00 Prom + 32403 32442 40 -6.15 2.01 Sngl + 34966 35550 585 0 0 81 54 308 0.929 22.63 2.02 PlyA + 36242 36247 6 1.05 3.00 Prom + 36657 36696 40 -6.05 3.01 Init + 41555 41736 182 1 2 82 20 143 0.177 5.60 3.02 Intr + 48155 48262 108 2 0 66 70 95 0.316 3.98 3.03 Term + 56242 56395 154 1 1 116 43 102 0.300 4.91 3.04 PlyA + 57575 57580 6 1.05 4.02 PlyA - 60248 60243 6 1.05 4.01 Sngl - 61991 61602 390 2 0 59 37 273 0.706 13.28 4.00 Prom - 67723 67684 40 -6.85 5.00 Prom + 68146 68185 40 -4.15 5.01 Sngl + 71782 72192 411 0 0 79 48 475 0.997 38.64 5.02 PlyA + 72225 72230 6 -0.45 6.00 Prom + 72474 72513 40 -5.35 6.01 Init + 76400 76562 163 1 1 72 8 92 0.039 -0.40 6.02 Intr + 77927 78010 84 0 0 123 62 55 0.070 5.27 6.03 Intr + 81595 81814 220 2 1 115 28 198 0.692 12.94 6.04 Intr + 82128 82270 143 0 2 29 42 145 0.582 3.08 6.05 Term + 85106 85230 125 0 2 86 49 146 0.954 8.07 6.06 PlyA + 87476 87481 6 1.05 7.03 PlyA - 88772 88767 6 1.05 7.02 Term - 89026 88935 92 2 2 51 48 71 0.042 -3.70 7.01 Init - 96652 96484 169 1 1 64 100 137 0.473 12.25 7.00 Prom - 98610 98571 40 -3.35 8.10 PlyA - 99495 99490 6 1.05 8.09 Term - 100129 99998 132 1 0 99 53 167 0.999 11.41 8.08 Intr - 100693 100454 240 1 0 70 75 197 0.998 13.42 8.07 Intr - 102043 101820 224 2 2 101 113 207 0.507 21.42 8.06 Intr - 105614 105377 238 2 1 89 94 280 0.602 24.96 8.05 Intr - 106288 106093 196 0 1 58 94 240 0.989 20.10 8.04 Intr - 109549 109440 110 1 2 95 63 69 0.983 3.26 8.03 Intr - 111249 111205 45 0 0 121 93 56 0.988 7.09 8.02 Intr - 111553 111362 192 1 0 20 69 249 0.493 14.97 8.01 Init - 122759 122706 54 2 0 69 86 89 0.374 6.13 8.00 Prom - 122875 122836 40 -7.25 9.06 PlyA - 123111 123106 6 1.05 9.05 Term - 123745 123563 183 1 0 28 42 173 0.389 3.26 9.04 Intr - 124853 124685 169 1 1 84 32 182 0.735 11.23 9.03 Intr - 128140 127979 162 0 0 61 94 97 0.167 5.77 9.02 Intr - 134878 134709 170 1 2 65 84 86 0.062 3.72 9.01 Init - 142480 142298 183 1 0 61 56 129 0.204 5.07 9.00 Prom - 145061 145022 40 -5.75 10.08 PlyA - 148731 148726 6 1.05 10.07 Term - 152228 151990 239 0 2 57 32 188 0.568 5.45 10.06 Intr - 160916 160779 138 0 0 42 100 88 0.919 4.91 10.05 Intr - 163462 163385 78 2 0 50 87 117 0.932 6.40 10.04 Intr - 164356 164183 174 2 0 92 83 153 0.795 14.19 10.03 Intr - 168089 168006 84 0 0 51 85 75 0.531 2.37 10.02 Intr - 168311 168254 58 2 1 90 55 90 0.673 3.44 10.01 Init - 172568 172482 87 2 0 91 56 55 0.667 3.30 10.00 Prom - 173509 173470 40 -6.25 11.04 PlyA - 173746 173741 6 -1.75 11.03 Term - 174801 174662 140 1 2 64 49 130 0.915 3.84 11.02 Intr - 176392 176308 85 1 1 161 110 10 0.965 8.97 11.01 Init - 199005 198754 252 0 0 67 116 103 0.600 8.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 18677 18502 176 2 2 52 96 185 0.886 14.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_1|192_aa MSAPNTYGVNHSMLEYGALDERGSPKCALASHQYLDVDGNLYSTLCAFLGETERSRVELL PKSLARWFGTRRDVITVFKYLQWISGPNPEACMQQTEGHEQHWVSGLMDIPNQGQCEGGQ EDISLQASAYGGLKKMQVFTNSSAKTTVCRKRKASNDDQSEDSYSVQQTNEEKGQCGEEE WQCNSIRKESAV >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_1|579_bp atgtctgcccccaacacctatggtgttaaccactccatgttagaatatggtgctctggat gaacgtggctcccctaagtgcgcccttgcctctcaccagtacctggatgtggatggtaat ttatacagtactctttgtgcttttctgggagaaacagagagatccagggtagaactgctc cccaagagcttggcaaggtggtttggaaccaggagagatgtgataactgtcttcaaatat ttacagtggatcagtggccctaacccagaagcatgcatgcaacaaacagagggacatgaa caacactgggtcagtggattgatggacatcccaaaccagggacagtgtgaaggtgggcaa gaggacatcagccttcaggcatcagcatatgggggcctcaaaaagatgcaagtttttaca aattccagtgccaaaaccacagtatgccgcaagagaaaagcaagcaatgatgatcaatct gaagattcttatagcgttcaacaaacaaatgaagagaaggggcagtgtggagaggaggaa tggcaatgcaacagcatacgaaaggaaagtgcagtttag >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_2|194_aa MVGAGVRSFLSPAPLRGEGTEARGQEFAEGRELEGESSPGPFPPTLESSPPSCGCWRGDR VGVFPLFEPEGTPFPTLSRLLSKKECPTFRYLKFKLIFKNGFKIGVNPTEGQGPHASDSC PLCGDNVLLFPRKDAASPHKSVGIESFCCQWGWIYHQPPTSLNTVGSTANMWRKPKLYSI ALANSCSSGGSDQL >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_2|585_bp atggtcggggctggagtccgcagtttcctcagtcccgctcccctgcgcggcgaggggacc gaggcaagaggacaggagtttgcagaggggcgggaactggaaggggagtctagcccaggg cctttccccccaactcttgaatcttctccaccttcttgcggctgctggcggggggatcgg gtgggggtgttccctctgttcgagccggaaggaacgccttttcccacgctttcccgtctg ctctcaaaaaaagagtgtccaactttcagatatctgaaattcaaattaatttttaaaaat ggatttaaaatcggggtgaaccccacagagggccaaggtccacacgcatctgactcctgc ccactatgtggggacaacgtgctgctcttccctcggaaagatgctgcctctcctcacaag tcggttggaattgagtcgttttgctgccagtggggatggatttatcatcagccacccact tccttaaatacggtaggatccacagcaaacatgtggagaaaaccgaagctctactccatt gccctagctaactcctgctcatcagggggttctgaccagctctga >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_3|147_aa MVTQWNEGKRGKYPDFIPLLPARPSCELTQEPEDRKPTTLSMQISLPGLEQVENGDSGLE GPGLGHCGPLTALHLGAIVQQEPDCRIHTLAPSPGARKISVEFKYLMPHLLIDPKSYFAG KKDEEGDIYVESSTSKFVFMAEFGDQL >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_3|444_bp atggtgacccagtggaatgagggaaaaaggggtaaataccctgactttattcctctcctg ccagctcgcccatcatgtgagctcactcaggagccagaggacaggaaacccacaacgctg tccatgcagatcagcctccctggcctggaacaggtggagaatggtgacagtggcctggag ggcccaggcttgggccactgtgggcccctcacagccttacaccttggggccatagttcag caggagccagactgccgaattcacaccctggctccctcacctggagcaagaaaaatcagc gttgaatttaaatatcttatgccacatctattaatagaccctaaatcatattttgcagga aagaaggatgaagagggagatatttatgttgagagttcaacctctaaatttgtcttcatg gctgaatttggggatcagttataa >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_4|129_aa MTACWQPSLALGASSALGPILAELEEPFSLPLHPGSPSLGWLRPEPAPSACREVWRERHE WEPGLHPALAGQLEFWAGVGLTGPAVGAASQPRAVRGLAPGPAPAEGALGPPAALAHQRC ARFRFLAGP >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_4|390_bp atgacagcgtgctggcagccctcgcttgctctcggtgcctcctcggccttggggcccatt ctggccgagcttgaggagcccttcagcctgccgctgcaccctgggagcccttctctgggc tggctgaggccagagccggctccctcggcttgcagggaggtgtggagggagaggcacgag tgggaaccggggctgcaccccgcgcttgcgggccagctggagttctgggcgggcgtgggc ttgacgggacccgcagttggagcagccagccagccccgggcagtgaggggcttagcacct gggccagcccctgcggagggtgccctgggtcccccagcagccctggcccaccagcgctgc gctcgatttcgatttctcgcagggccttag >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_5|136_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTE LLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLARRIRGERA >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_5|411_bp atggctcgtacaaagcagactgcccgcaaatcgaccggtggtaaagcacccaggaagcaa ctggctacaaaagccgctcgcaagagtgcgccctctactggaggggtgaagaaacctcat cgttacaggcctggtactgtggcgctccgtgaaattagacgttatcagaagtccactgaa cttctgattcgcaaacttcccttccagcgtctggtgcgagaaattgctcaggactttaaa acagatctgcgcttccagagcgcagctatcggtgctttgcaggaggcaagtgaggcctat ctggttggcctttttgaagacaccaacctgtgtgctatccatgccaaacgtgtaacaatt atgccaaaagacatccagctagcacgccgcatacgtggagaacgtgcttaa >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_6|244_aa MMSFDSMSHIQVTLMQEVGSHSLGQLHPCGFAGYSPFWLLSWAGVECLWLFQVHAGKTLG NTFLSLIVLGSEAKRETLDGVTGGVFGGCSPFHSKHAWEATLPKAFGQSSVNTASQVVRP AGTEPSDPGGESGMDEFVAVAAGGVGRSARFARVGVPPSPELSLTRISYAFKRVWSGEAV HLPHSSRAGKPGGEGLWRAGAPGTSRENVYYSPDRTQHKHEPSRLFPQVLCLLVEAFLSL SQTV >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_6|735_bp atgatgtcctttgactccatgtctcacatccaggtcacactgatgcaagaggtgggttcc catagtcttgggcagctccacccctgtggctttgcagggtacagccccttctggctgctt tcatgggctggcgttgagtgtctgtggcttttccaggtgcacgctgggaaaacccttgga aatacattcttgagccttattgtgttgggctcagaggctaagagggagaccctggatgga gtgacaggtggtgtctttggtggctgttcacccttccacagtaaacatgcttgggaagca actttgccaaaagccttcggacagtccagcgtcaacaccgcaagccaagtggtaaggccc gcggggacagaaccgtcggaccctggtggcgaatctgggatggatgaattcgttgctgtt gcggcgggcggtgttggcaggagcgctagatttgctcgggttggagtccctcccagcccg gagctgagtctgacgaggatttcttacgcgttcaagcgagtgtggagcggcgaggcagtg cacttgccccatagctcccgcgctgggaaacccggaggggaggggctgtggagggcggga gccccgggcacctctcgtgaaaatgtctattactccccagaccgcactcagcacaagcac gaaccttcccgtttattcccacaagtgctgtgcctgctggtggaagcatttctctccctt tctcagacagtttag >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_7|86_aa MSPWPHVFGYEVSSLVRVNAVWNIMMMDKASISPQMVVLAEALHAGKANPYTYVYSIVSV SISPILNIDIHQNPPTSGETAKPIPV >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_7|261_bp atgagcccatggccacatgtctttggctacgaagtgagttccctggtcagagtcaatgct gtatggaatatcatgatgatggataaggcatctataagtccacagatggtagttttggca gaagcattgcatgcaggaaaggcaaatccatatacttatgtctattccatagtgtcagtg tcaatctctcctatcctgaacattgacattcaccaaaatcccccaacatctggggaaaca gcaaagcccattccagtttga >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_8|476_aa MDRARWLAPVISALWEAKGGEKNLTVLVSSAVSAGLVLGSEHETRLVAKLFKDYSSVVRP VEDHRQVVEVTVGLQLIQLINVDEVNQIVTTNVRLKQQWVDYNLKWNPDDYGGVKKIHIP SEKIWRPDLVLYNNADGDFAIVKFTKVLLQYTGHITWTPPAIFKSYCEIIVTHFPFDEQN CSMKLGTWTYDGSVVAINPESDQPDLSNFMESGEWVIKESRGWKHSVTYSCCPDTPYLDI TYHFVMQRLPLYFIVNVIIPCLLFSFLTGLVFYLPTDSGEKMTLSISVLLSLTVFLLVIV ELIPSTSSAVPLIGKYMLFTMVFVIASIIITVIVINTHHRSPSTHVMPNWVRKVFIDTIP NIMFFSTMKRPSREKQDKKIFTEDIDISDISGKPGPPPMGFHSPLIKHPEVKSAIEGIKY IAETMKSDQESNNAAAEWKYVAMVMDHILLGVFMLVCIIGTLAVFAGRLIELNQQG >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_8|1431_bp atggaccgggcgcggtggctcgcgcctgtaatctcggcactttgggaggccaagggtggg gagaagaatctgacagtgttggtgtcatctgctgtctcagctggcctcgtcctgggctcc gaacatgagacccgtctggtggcaaagctatttaaagactacagcagcgtggtgcggcca gtggaagaccaccgccaggtcgtggaggtcaccgtgggcctgcagctgatacagctcatc aatgtggatgaagtaaatcagatcgtgacaaccaatgtgcgtctgaaacagcaatgggtg gattacaacctaaaatggaatccagatgactatggcggtgtgaaaaaaattcacattcct tcagaaaagatctggcgcccagaccttgttctctataacaatgcagatggtgactttgct attgtcaagttcaccaaagtgctcctgcagtacactggccacatcacgtggacacctcca gccatctttaaaagctactgtgagatcatcgtcacccactttccctttgatgaacagaac tgcagcatgaagctgggcacctggacctacgacggctctgtcgtggccatcaacccggaa agcgaccagccagacctgagcaacttcatggagagcggggagtgggtgatcaaggagtcc cggggctggaagcactccgtgacctattcctgctgccccgacaccccctacctggacatc acctaccacttcgtcatgcagcgcctgcccctctacttcatcgtcaacgtcatcatcccc tgcctgctcttctccttcttaactggcctggtattctacctgcccacagactcaggggag aagatgactctgagcatctctgtcttactgtctttgactgtgttccttctggtcatcgtg gagctgatcccctccacgtccagtgctgtgcccttgattggaaaatacatgctgttcacc atggtgttcgtcattgcctccatcatcatcactgtcatcgtcatcaacacacaccaccgc tcacccagcacccatgtcatgcccaactgggtgcggaaggtttttatcgacactatccca aatatcatgtttttctccacaatgaaaagaccatccagagaaaagcaagacaaaaagatt tttacagaagacattgatatctctgacatttctggaaagccagggcctccacccatgggc ttccactctcccctgatcaaacaccccgaggtgaaaagtgccatcgagggcatcaagtac atcgcagagaccatgaagtcagaccaggagtctaacaatgcggcggcagagtggaagtac gttgcaatggtgatggaccacatactcctcggagtcttcatgcttgtttgcatcatcgga accctagccgtgtttgcaggtcgactcattgaattaaatcagcaaggatga >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_9|288_aa MRKLASLGLGPPKYQNRGLALISTSPAKLSLGSAGEGDYKDDGLPATPTGKTSDGIPVFS MNALASVITPPSPPSTQGSGGDQTCCPDPGSCERSLGFGQQYPKLSNFFPMLGRGEALKI KLYIVHWLNSSIVLAVDHIKTGTGNVDQSQGQKLSHRTRTGPEQDCVTTLGREYEKLAME EFCHQSISCIDQVVGPQSCRGQGLASVRVDSAWSCSHSPRLLLPGVHWGLLGWLAGVSKT QALGAQQERHLIRREPLALGHRFNNHLLPPAGHWTKALHAGNRSLAKS >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_9|867_bp atgagaaagctggccagtctggggctggggcccccgaaatatcaaaatcgagggcttgct ttgatatctacaagccctgcaaagctttccttggggtcagcaggggagggtgactacaag gatgatggccttccggccactcccacaggaaagacctcagatgggatccccgtgttcagc atgaatgcattggcctctgtcatcacaccaccttctccgccatcaacacagggatcggga ggagaccagacatgttgtccagacccaggctcttgtgaaaggtctttgggttttggacag cagtatccaaagctttccaacttcttccctatgctagggagaggcgaggctctaaagatc aaactttatattgttcattggttgaactcgtctatagtcctggcagtggaccacataaag acaggaactggaaatgtggaccaaagtcaagggcagaaactctctcacaggactaggact ggacctgagcaagactgtgtcaccacactgggcagagaatatgagaagctggccatggaa gaattctgccatcaaagcatttcctgcattgatcaagtggttggaccccagagctgccgt ggccaaggcctggcttcggttcgtgttgactctgcgtggtcctgcagccacagtccacgc ttacttctgccaggggtccactggggcctgctgggctggcttgctggagtttccaaaaca caggccctgggggcacagcaggagagacacctcatccggagagagcctctagcgctcggc caccgctttaataaccatttgttgccacctgctggccactggaccaaggcactccatgct ggaaacaggagtttggctaagagttag >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_10|285_aa MRTTEGAAGTILRMLIITLHGLFKTTQCKVNRDPSATPSQWMLQQQSLVEAATSQPKSMG RAVSSSLTCSNEAAETDCGLNVHKQCSKMVPNDCKPDLKHVKKVYSCDLTTLVKAHTTKR PMVVDMCIREIESRGLNSEGLYRVSGFSDLIEDVKMAFDRDGEKADISVNMYEDINIITG ALKLYFRDLPIPLITYDAYPKFIESAMFKHNEEHMAYTKSFIPVLAFFRVTLHEKENLMN AENLGIVFGPTLMRSPELDAMAALNDIRYQRLVVELLIKNEDILF >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_10|858_bp atgaggacaactgagggagcagcagggaccatcctgagaatgctaatcattacattacac gggctattcaaaaccacgcagtgcaaggtaaacagagacccttctgctacaccttctcag tggatgctgcagcagcagagtctagtagaagcagcaacctctcagcccaagtctatgggg agagctgtatcttcatccctgacctgcagtaatgaggcagctgagactgattgtggtttg aatgttcataagcagtgttccaagatggtcccaaatgactgtaagccagacttgaagcat gtcaaaaaggtgtacagctgtgaccttacgacgctcgtgaaagcacataccactaagcgg ccaatggtggtagacatgtgcatcagggagattgagtctagaggtcttaattctgaagga ctataccgagtatcaggatttagtgacctaattgaagatgtcaagatggctttcgacaga gatggtgagaaggcagatatttctgtgaacatgtatgaagatatcaacattatcactggt gcacttaaactgtacttcagggatttgccaattccactcattacatatgatgcctaccct aagtttatagaatctgccatgttcaagcacaatgaagaacatatggcttacacaaagtca tttattcctgtacttgcatttttcagagtgaccctccacgaaaaggagaatcttatgaat gcagagaaccttggaatcgtctttggacccacccttatgagatctccagaactagacgcc atggctgcattgaatgatatacggtatcagagactggtggtggagctgcttatcaaaaac gaagacattttattttaa >gi568815596r:174648127_174864394|GENSCAN_predicted_peptide_11|158_aa MPSKESWSGRKTNRAAVHKSKQEGRQQDLLIAALGMKLGSPKSSVTIWQPLKLFAYSQLT SLVRRATLKENEQIPKYEKIHNFKVHTFRGPHWCEYCANFMWGLIAQGVKCAGSGIRILH VWTWSCAKVFVRERLSSESKLRVNEESLAVAGGYPELG >gi568815596r:174648127_174864394|GENSCAN_predicted_CDS_11|477_bp atgccatccaaagagtcttggtcagggaggaaaactaatagggctgcagttcacaaatca aaacaagagggccgtcagcaagatttattgatagcagccttgggaatgaaactgggttct ccaaagtcgtctgtgacaatctggcaacctctgaaactctttgcttattcgcagttgaca tcacttgttagaagagcaactctgaaagaaaacgagcaaattccaaaatatgaaaagatt cacaatttcaaggtgcatacattcagagggccacactggtgtgaatactgtgccaacttt atgtggggtctcattgctcagggagtgaaatgtgcaggaagtgggataagaatcctacac gtgtggacttggagctgtgcaaaggtgttcgtcagggagcgcctcagctcagagtcgaag ctcagagtgaatgaggaaagccttgctgtggctggtggttatccagagttgggatga