GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:10:48 Sequence gi568815587r:124653649_124862154 : 208506 bp : 43.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 1145 1007 139 2 1 101 91 43 0.510 5.94 1.03 Intr - 7155 6980 176 1 2 57 91 80 0.913 4.86 1.02 Intr - 15873 15712 162 1 0 82 116 162 0.989 18.35 1.01 Init - 20060 19994 67 2 1 91 94 89 0.892 9.05 1.00 Prom - 32569 32530 40 -4.66 2.00 Prom + 34366 34405 40 -4.76 2.01 Init + 40649 40794 146 1 2 28 61 134 0.183 4.29 2.02 Intr + 60133 60227 95 1 2 82 60 60 0.117 2.21 2.03 Intr + 86264 86451 188 0 2 15 103 153 0.479 8.91 2.04 Intr + 89672 89692 21 1 0 114 100 -7 0.434 0.84 2.05 Term + 91855 92076 222 0 0 80 47 291 0.698 21.12 2.06 PlyA + 93537 93542 6 1.05 3.18 PlyA - 93846 93841 6 -1.75 3.17 Term - 94019 93887 133 1 1 65 38 136 0.569 3.86 3.16 Intr - 94886 94742 145 0 1 61 97 69 0.804 4.44 3.15 Intr - 95115 94996 120 1 0 77 70 153 0.974 12.87 3.14 Intr - 96218 96060 159 0 0 116 116 12 0.988 6.76 3.13 Intr - 97273 97066 208 1 1 95 82 200 0.999 18.75 3.12 Intr - 97932 97775 158 1 2 85 55 186 0.942 14.73 3.11 Intr - 98517 98429 89 2 2 67 65 71 0.453 2.31 3.10 Intr - 100692 100566 127 1 1 73 108 94 0.927 9.64 3.09 Intr - 101115 100993 123 1 0 79 82 119 0.998 10.96 3.08 Intr - 102714 102559 156 1 0 62 66 63 0.452 1.48 3.07 Intr - 103094 102893 202 2 1 46 92 216 0.995 16.66 3.06 Intr - 104879 104701 179 0 2 93 94 89 0.693 9.64 3.05 Intr - 105409 105247 163 1 1 82 21 79 0.689 0.15 3.04 Intr - 105736 105447 290 2 2 53 28 135 0.560 0.96 3.03 Intr - 106187 106061 127 2 1 74 59 45 0.398 0.45 3.02 Intr - 108298 108219 80 2 2 92 94 8 0.650 1.07 3.01 Init - 108506 108437 70 2 1 72 109 115 0.996 11.22 3.00 Prom - 111652 111613 40 -4.86 4.10 PlyA - 113093 113088 6 1.05 4.09 Term - 114380 113528 853 1 1 78 42 337 0.616 20.39 4.08 Intr - 119406 119346 61 1 1 90 98 23 0.606 1.29 4.07 Intr - 121186 121071 116 0 2 43 86 94 0.245 4.79 4.06 Intr - 124863 124776 88 1 1 119 -4 49 0.007 -2.17 4.05 Intr - 132068 131917 152 1 2 80 65 61 0.030 2.81 4.04 Intr - 138024 137918 107 0 2 83 4 94 0.024 -0.49 4.03 Intr - 146977 146223 755 2 2 101 86 730 0.077 65.28 4.02 Intr - 149593 149433 161 0 2 46 56 51 0.022 -2.67 4.01 Init - 164662 164592 71 1 2 64 83 78 0.231 5.42 4.00 Prom - 180732 180693 40 -5.06 5.05 PlyA - 180979 180974 6 1.05 5.04 Term - 185573 185452 122 0 2 73 50 96 0.697 2.94 5.03 Intr - 185775 185623 153 1 0 92 59 45 0.702 2.04 5.02 Intr - 188294 188261 34 2 1 85 121 -2 0.585 0.60 5.01 Intr - 195365 195346 20 0 2 129 78 7 0.085 0.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 32237 32159 79 2 1 70 103 42 0.837 5.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:124653649_124862154|GENSCAN_predicted_peptide_1|182_aa MVAPGLVLGLVLPLILWADRSAGIGFRFASYINNDMVLQKEPAGAVIWGFGTPGATVTVT LRQGQETIMKKVTSVKAHSDTWMVVLDPMKPGGPFEVMAQQTLEKINFTLRVHDVLFGDV WLCSGQSNMQMTVLQIFNATRELSNTAAYQSVRILSVSPIQAEQELEDLVAVDLQWSKPT SX >gi568815587r:124653649_124862154|GENSCAN_predicted_CDS_1|546_bp atggtcgcgccggggcttgtactcgggctggtgctgccattaatcctgtgggccgacaga agtgcaggtattggttttcgctttgcttcatacatcaataatgatatggtgctgcagaag gagcctgctggggcagtgatatggggcttcggtacacctggagccacagtgaccgtgacc ctgcgccaaggtcaggaaaccatcatgaagaaagtgaccagtgtgaaagctcactctgat acgtggatggtggtactggatcctatgaagcctggaggacctttcgaagtgatggcacaa cagactttggagaaaataaacttcaccctgagagttcatgacgtcctgtttggagatgtc tggctctgtagtgggcagagtaacatgcagatgactgtgttacagatatttaatgctaca agggagttgtctaacactgcggcatatcagtctgtccgcatcctctctgtctctcccatt caagcagagcaggagctggaggaccttgttgcggttgacttgcagtggtctaagcccacc tcagnn >gi568815587r:124653649_124862154|GENSCAN_predicted_peptide_2|223_aa MKDSSEEDKEKEEVAAVKIQAAFRGHIAREEAKKMKTNSLQNEEKEENKYIHSSGIVFKL LKEIISNLEFCVTKLSYKYEGYIRERSRARETGPESRAAVSARVGWRPTAPEPPPGTTQT PPPPCASLRPRRGPPDTSMDCCTLRGQGGKENACSKPDDDILDIPLDDPGANAAAAKIQA SFRGHMARKKIKSGERGRKGPGPGGPGGAGVARGGAGGGPSGD >gi568815587r:124653649_124862154|GENSCAN_predicted_CDS_2|672_bp atgaaggactcttctgaggaagataaggaaaaagaagaggttgctgctgtcaaaatccaa gctgccttccggggacacatagccagagaggaggcaaagaaaatgaaaacaaatagtctt caaaatgaggaaaaagaggaaaacaaatacatacacagtagtgggattgtcttcaaactt ctcaaggaaattatttcaaacctagaattctgtgtcaccaaactgtcctacaagtatgag ggttacattcgcgagcggagccgagcgcgggagaccggacccgagagcagagctgctgtt tcggcgcgggtcggctggcggccgactgccccagagcccccacccggcaccacacagacc ccacccccgccctgcgccagccttcgtccccgcagaggaccccccgacaccagcatggac tgctgcaccctgcgaggccaaggaggaaaggagaacgcctgctccaagccggacgacgac attctagacatcccgctggacgatcccggcgccaacgcggccgccgccaaaatccaggcg agttttcggggccacatggcgcggaagaagataaagagcggagagcgcggccggaagggc ccgggccctggggggcctggcggagctggggtggcccggggaggcgcgggcggcggcccc agcggagactag >gi568815587r:124653649_124862154|GENSCAN_predicted_peptide_3|842_aa MISLPGPLVTNLLRFLFLGLSALGHHWLALTSRFVGDSPQFGCGGGWRGESLRSSGPPFI VFCLITGYKFPAQTTEILQEMGRGMLRRPSNRGQRMSISGRRGANDGPLLETNTPVHPLS AGHTQGRLHTCLETVVRAALNSLPPESSLPRSPEPQKGSEELSGFQPSVFRGKPASEIAR EGGEHRIHWLLASSEGAAGLIGNQTASQTPPPFENSGVRDPGGPEQHPYSPSGKLGKQVG YVAPPPSRAQLQLHLPANRLQAVEGGEVVLPAWYTLHGEVSSSQPWEVPFVMWFFKQKEK EDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRLEGLQEKDSGPYSCSVNVQDKQGKSRGH SIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRSKPAVQYQWDRQLPSFQTFFAP ALDVIRGSLSLTNLSSSMAGVYVCKAHNEVGTAQCNVTLEVSTGPGAAVVAGAVVGTLVG LGLLAGLVLLYHRRGKALEEPANDIKSRTGRPDAAMAELPGPFLCGALLGFLCLSGLAVE VKVPTEPLSTPLGKTAELTCTYSTSVGDSFALEWSFVQPGKPISESHPILYFTNGHLYPT GSKSKRVSLLQNPPTVGVATLKLTDVHPSDTGTYLCQVNNPPDFYTNGLGLINLTVLVPP SNPLCSQSGQTSVGGSTALRCSSSEGAPKPVYNWVRLGTFPTPSPGSMVQDEVSGQLILT NLSLTSSGTYRCVATNQMGSASCELTLSVTEPSQGRVAGALIGVLLGVLLLSVAAFCLVR FQKERGKKPKETYGGSDLREDAIAPGISEHTCMRADSSKGFLERPSSASTVTTTKSKLPM VV >gi568815587r:124653649_124862154|GENSCAN_predicted_CDS_3|2529_bp atgatttccctcccggggcccctggtgaccaacttgctgcggtttttgttcctggggctg agtgccctcgggcatcactggctggcgctgacttctcgctttgtcggggacagcccccag tttgggtgtgggggcggatggcggggggagagcctccggtctagtggcccccccttcatt gtcttctgccttatcactgggtacaaatttcctgcccagaccacagagatcctccaggaa atgggaagggggatgctgcgaaggccaagtaacagagggcaaaggatgagtatttctgga cgccgtggtgcaaacgatggccccctccttgaaaccaacaccccggtccaccccctgagc gccggacacactcaggggaggcttcatacttgtctagagaccgtcgttcgcgctgccctt aattccttgccacccgagagttctctgcccagatcgcccgaaccccagaagggctccgag gagctttctgggtttcaaccctccgttttccgagggaagcccgcctctgagattgcgcgg gagggcggggagcatcgcattcactggctcctggcaagctcagaaggagccgcagggtta ataggtaaccagacggcctcccaaacccctccgccctttgagaactcgggagtgagggac cctggcggcccggagcagcatccctactctccctcaggaaaactagggaagcaagtgggc tacgtggcgccgcccccctcgcgggcccagctgcaactgcacttgcccgccaaccggttg caggcggtggagggaggggaagtggtgcttccagcgtggtacaccttgcacggggaggtg tcttcatcccagccatgggaggtgccctttgtgatgtggttcttcaaacagaaagaaaag gaggatcaggtgttgtcctacatcaatggggtcacaacaagcaaacctggagtatccttg gtctactccatgccctcccggaacctgtccctgcggctggagggtctccaggagaaagac tctggcccctacagctgctccgtgaatgtgcaagacaaacaaggcaaatctaggggccac agcatcaaaaccttagaactcaatgtactggttcctccagctcctccatcctgccgtctc cagggtgtgccccatgtgggggcaaacgtgaccctgagctgccagtctccaaggagtaag cccgctgtccaataccagtgggatcggcagcttccatccttccagactttctttgcacca gcattagatgtcatccgtgggtctttaagcctcaccaacctttcgtcttccatggctgga gtctatgtctgcaaggcccacaatgaggtgggcactgcccaatgtaatgtgacgctggaa gtgagcacagggcctggagctgcagtggttgctggagctgttgtgggtaccctggttgga ctggggttgctggctgggctggtcctcttgtaccaccgccggggcaaggccctggaggag ccagccaatgatatcaagagcaggacaggacggccggacgcggccatggccgagctcccg gggccctttctctgcggggccctgctaggcttcctgtgcctgagtgggctggccgtggag gtgaaggtacccacagagccgctgagcacgcccctggggaagacagccgagctgacctgc acctacagcacgtcggtgggagacagcttcgccctggagtggagctttgtgcagcctggg aaacccatctctgagtcccatccaatcctgtacttcaccaatggccatctgtatccaact ggttctaagtcaaagcgggtcagcctgcttcagaacccccccacagtgggggtggccaca ctgaaactgactgacgtccacccctcagatactggaacctacctctgccaagtcaacaac ccaccagatttctacaccaatgggttggggctaatcaaccttactgtgctggttcccccc agtaatcccttatgcagtcagagtggacaaacctctgtgggaggctctactgcactgaga tgcagctcttccgagggggctcctaagccagtgtacaactgggtgcgtcttggaactttt cctacaccttctcctggcagcatggttcaagatgaggtgtctggccagctcattctcacc aacctctccctgacctcctcgggcacctaccgctgtgtggccaccaaccagatgggcagt gcatcctgtgagctgaccctctctgtgaccgaaccctcccaaggccgagtggccggagct ctgattggggtgctcctgggcgtgctgttgctgtcagttgctgcgttctgcctggtcagg ttccagaaagagagggggaagaagcccaaggagacatatgggggtagtgaccttcgggag gatgccatcgctcctgggatctctgagcacacttgtatgagggctgattctagcaagggg ttcctggaaagaccctcgtctgccagcaccgtgacgaccaccaagtccaagctccctatg gtcgtgtga >gi568815587r:124653649_124862154|GENSCAN_predicted_peptide_4|787_aa MRRHRKMTSEVEMSGKVREKEQMEFKRFSCLWPFGESRPRSSPNQGSYTLFGTLQFLVSP SFWEPPHSLVPSAEAARARGFGAPRPRPSPRERSGRRGRARGERPRGVDDLRPAATSSAA ETRSHPPAGEPGPRAAPETAVAGCAGASLPGGAAAAAWKMAAPCGSELPANSPLKIPKME VLSPASPGGLSDGNPSLSDPSTPRGASPLGPGSAAGSGAAASGGLGLGLGGRSAASSSVS FSPGGGGGGAAAAAAAACRGMSWTPAETNALIAVWGNERLVEARYQQLEGAGTVFGSKAP GPAMYERVSRALAELGYERTPSQCRERIKKCPHQMLVQRKSGRMEAADPTDIGQRAYSSE TGSSGNETLVPPQLSGMLTAGGFWLISSEFHSAQGNCLTQGEGPPLSGGSLNPMTASFRI QYSLTTLPPGGDNSIMSIYRGDFQHSGLYQELESDGSTMEDYSQEDWGNHSQDLHGYPTD QELDEIPVTKRTLKIKQESSEEAQKRDIMQNIVQILESVQLKWELFQSWTDFSRLHLSNK LAIFGIGYNTRWKEDIRYHYAEISSQVPLGKRLREYFNSEKPEGRIIMTRVQKMNWKNVY YKFLEITISEARCLELHMEIDWIPIAHSKPTGGNVVQYLLPGGIPKSPGLYAIGYEECIE RPLSPHMEQSSLDPGKEGRVDLETLSAQASLQVEIEPTRIIYCYLGIAEVRTLQQCLFLH FQANTKTFSKDWVGINGFLSQNCIVDPGVSPKSIYIKFVEVERDFLSAGSLVECLEKAIG YPLKFNN >gi568815587r:124653649_124862154|GENSCAN_predicted_CDS_4|2364_bp atgaggaggcaccgaaaaatgacatccgaggttgagatgtcggggaaggtgagggagaag gagcagatggaattcaagcgattctcctgcctgtggccctttggggagtccagacctagg agctccccgaaccagggctcttacaccctctttgggactctgcagttcctggtgtcgcca agcttctgggagccaccacattccctggtaccatcagcagaagctgctcgtgctcggggc ttcggcgcgcctcgtccccgcccttcgccccgggagaggagcgggcggcgtgggagggct cgcggagaaaggcccaggggagtggacgacctccgcccggcagccacatcctcagcagca gagacccggagccatccgcccgcgggcgagccaggcccgagggcagccccggagaccgcg gtggccggatgcgcgggcgcgtcacttccgggcggtgcagcggcggccgcttggaagatg gctgcgccctgtggctcggagctgcccgccaactcgccgctaaaaattccgaagatggag gtgctttccccggcttctcctggtggcctgagcgacggaaatccatcgctgtccgaccct tccacgcctcggggtgcctccccgctcgggccgggcagtgcggcgggctcgggggcagcg gcgtccgggggtctcgggctggggctggggggccgcagcgccgcctcgtcctcggtctcc ttctcccctggtggcggcggtggcggggctgcggcagccgccgccgccgcctgccggggc atgtcgtggacgccagccgagacgaacgcgctcatcgcagtgtggggcaacgagcggctg gtggaggcgcggtaccagcagctggagggagccggcacggtgttcggcagcaaggccccc gggccagccatgtacgagcgcgtgtcccgggccctggccgagctgggctacgagcggacc ccgtcccagtgccgggagcgcatcaagaaatgccctcaccaaatgctggtgcaaagaaag tctggaaggatggaagctgccgatcccactgatattgggcagcgggcttactcctcggag actggcagctcagggaatgagacgctagtccccccccagctgtctggaatgctgactgct ggtggcttctggttgatttcttcagaatttcactctgcccaaggcaactgtctcacccaa ggtgagggccccccactctctggtggcagcctgaatccaatgactgcttctttcaggata cagtattctctcacaactcttcctccagggggagacaacagcatcatgtcaatttaccgg ggagatttccaacatagtggcttgtaccaggagctggagtcagatggcagcactatggag gactattcacaggaggactggggaaaccacagtcaggatctccatggctatccaacagat caggaattggatgaaatacctgtcacaaagagaacattaaaaataaaacaagagtcttct gaagaagcacagaagagagacatcatgcagaatattgtacagattttggaatcggtacag ttgaaatgggaactttttcagagctggacagacttttcaaggctccatctttctaataaa ctggccatttttggaattggttataacacccgttggaaagaggatatccgttaccattat gctgagatcagctcccaggtgccccttggcaagcgacttcgggagtacttcaactctgag aagcctgaaggacggatcattatgacccgagtgcagaaaatgaactggaaaaatgtttac tacaaatttttagagatcactattagcgaagctaggtgcttggagctgcacatggaaatt gactggatacccattgcccactccaaaccaactggtgggaatgttgttcaatatttattg cctgggggtattcctaaaagcccaggcctttatgccattggctatgaagaatgtattgag aggcccctctcaccacacatggagcaaagttccctggacccaggaaaagagggccgggtt gacctggaaaccctttcagcacaagcctcattacaggtggaaatagaacccacccgaatt atctattgctacctcgggattgctgaggtcaggactctacagcagtgcttatttttacat ttccaagcgaataccaaaaccttcagcaaagattgggttggtattaacgggtttttgtct cagaactgtattgtggatcccggagtttcccccaaatccatctacatcaaatttgtagaa gtagagagggattttctttccgcaggctctttagttgagtgcctggaaaaagccattgga taccccttaaaatttaacaactga >gi568815587r:124653649_124862154|GENSCAN_predicted_peptide_5|109_aa XEDQIHKAARRLVFQSQEAPAALRVPSLPAGGVSAGAAVGKEAVERPAAARGGAINLEQM RPDSHPARGAELDGFQAEVTRDFGHTEPVLLTPPPLPAAHCVPTCREKE >gi568815587r:124653649_124862154|GENSCAN_predicted_CDS_5|330_bp ngtgaggatcaaattcacaaagcagccaggagacttgtcttccagtctcaggaagccccg gccgcgcttcgtgttccctccctccccgcgggaggggtctcagctggggctgcagttggt aaagaggccgtggagaggccggcggcggcccgaggaggagccataaatttggagcagatg cggcctgacagtcacccagcccggggagctgagttagatggattccaggccgaggtgacc cgggactttggacacacggagcctgtgctcctcactcccccacctctgccggccgcccat tgtgtgcccacgtgcagggagaaggaatag