GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:42:38 Sequence gi568815587r:40014373_40216292 : 201920 bp : 37.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6305 6344 40 -3.55 1.01 Sngl + 24315 24839 525 2 0 44 43 229 0.567 9.80 1.02 PlyA + 25065 25070 6 1.05 2.00 Prom + 25789 25828 40 -4.65 2.01 Init + 28166 28231 66 1 0 27 86 71 0.019 1.92 2.02 Intr + 35561 35636 76 1 1 84 80 34 0.040 0.37 2.03 Intr + 45693 45812 120 1 0 67 113 60 0.718 5.95 2.04 Intr + 66742 66812 71 2 2 61 82 67 0.037 1.38 2.05 Intr + 88858 89105 248 0 2 59 68 99 0.002 0.33 2.06 Intr + 89415 89546 132 0 0 60 64 68 0.025 0.44 2.07 Intr + 90064 90194 131 1 2 52 27 102 0.069 -0.08 2.08 Term + 90688 91049 362 2 2 105 44 156 0.972 6.61 2.09 PlyA + 91128 91133 6 1.05 3.02 PlyA - 91420 91415 6 1.05 3.01 Sngl - 101908 99998 1911 1 0 77 41 1334 0.951 121.23 3.00 Prom - 102966 102927 40 -6.55 4.00 Prom + 123294 123333 40 -1.95 4.01 Init + 123649 123676 28 0 1 60 84 23 0.373 -1.30 4.02 Intr + 125909 125944 36 0 0 156 94 35 0.360 8.22 4.03 Intr + 137648 137827 180 0 0 -14 66 199 0.547 6.42 4.04 Term + 138950 139065 116 0 2 77 42 100 0.557 2.05 4.05 PlyA + 139227 139232 6 1.05 5.06 PlyA - 139577 139572 6 1.05 5.05 Term - 140883 140862 22 2 1 92 49 28 0.008 -3.89 5.04 Intr - 142756 142629 128 1 2 59 72 57 0.011 -0.24 5.03 Intr - 147937 147811 127 0 1 33 73 132 0.043 5.96 5.02 Intr - 165320 165210 111 1 0 85 80 70 0.119 4.48 5.01 Init - 178411 178071 341 1 2 74 47 180 0.164 9.28 5.00 Prom - 194263 194224 40 -2.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 86107 86017 91 1 1 89 35 63 0.908 0.70 S.002 Init + 90077 90194 118 1 1 45 27 112 0.880 1.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:40014373_40216292|GENSCAN_predicted_peptide_1|174_aa MTDRKNGTKLENTLQDTMQENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKV EMKGKILRAAREKGRVTHKGKPIRLTADLLAETLQARREWGPIFNISKGKNFQPRISYPA KPSFISEGEIKSFKDKQMLRDFVTTRPALQELLKEALNMERNNRYQPLQKHAKL >gi568815587r:40014373_40216292|GENSCAN_predicted_CDS_1|525_bp atgactgacaggaagaatggaaccaagttggaaaacactctgcaggatactatgcaggag aacttccccaatctagcaaggcaggccaacattcaaattcaggaaatacagagaacgcca caaagatactcctcaaggagagcaactccaagacatataattgtcagattcaccaaagtt gaaatgaagggaaaaatattaagggcagccagagagaaaggtcgggttacccacaaaggg aaacccatcagactaacagcagatctcttggcagaaactctacaagccagaagagagtgg gggccaatattcaacatttctaaaggaaagaattttcaacccagaatttcatatccagcc aaaccaagcttcataagtgaaggagaaataaaatcctttaaagacaagcaaatgctgaga gattttgtcaccaccaggcctgccttacaagagctcctgaaggaagcactaaacatggaa aggaacaaccggtaccagccactgcaaaaacacgccaaattgtaa >gi568815587r:40014373_40216292|GENSCAN_predicted_peptide_2|401_aa MKEHSMLMDRKNQYRKNGHAAKPEDCKEKEGSACGKGTYFSSAYHTAVPTNEGPTLILKV VRKRQKLQQFSFHHELLAKGEHELLAKVQCFSAATANAQVDLVQLCQPLHKNTIGENGYF TKALTGRVCFDLRSQSPLAEPIKTQWGNWPHILAYTVPMQGSSPLVNKQYNFLTGPGLQV YLGTLRGEDHPTHRLTLIVPVNQPAISSCSSERTKGMGNVEIWTSILVLSNYPANLSRPP CFMGASRFNTHGSTCQGRWNSGMQGQKKGKRTLFLVSLAYPGHQTPNSHATRASDNGPFC WEPLNRPLGTAVSPNSSPVSRKQLRSVFVLIHNLMAVRCTSLKEGMRQPCGRGVPEETPT SLPTEMEPWEVHDVCSREEPEPGPSASSVETRIPTTWQEVP >gi568815587r:40014373_40216292|GENSCAN_predicted_CDS_2|1206_bp atgaaagaacattccatgctcatggataggaagaatcagtatcgtaaaaatggccatgct gccaagccagaagattgtaaagagaaggaagggagtgcatgtggaaagggaacttatttc tctagtgcataccacacagcagttcctactaatgaaggacccactcttatactcaaggta gtcaggaagagacagaagttacagcagttttcatttcatcatgaacttttggcaaagggt gagcatgaacttttggcaaaagtgcagtgcttctcagctgccacagccaatgctcaagtg gacctggtgcagctttgccagccactgcataagaacacaattggagaaaatggttatttt accaaggctttgactggaagggtatgctttgatttaaggagtcaatctccacttgcagag ccaataaaaacccaatggggaaactggcctcatatccttgcctacacagtccctatgcag ggttcctcacctctggtcaataaacagtataactttctaacaggtccagggctccaagtt tatcttgggaccttaaggggagaggatcacccaactcacagactaaccctgattgttcct gtgaaccaaccagcaatctccagctgtagctcagaaagaacaaaagggatgggtaatgta gaaatatggaccagtattctagttctgagcaattatcctgcaaatctttccagacccccc tgcttcatgggtgcaagccgctttaacactcatggcagcacctgccaaggtcgctggaac tcagggatgcaaggacagaagaagggaaagaggacactcttccttgtctcccttgcatac cccggccatcaaactccaaactctcatgcaaccagagcctctgacaatggccccttttgc tgggaacccttaaataggcctctggggactgctgtttccccaaacagcagccctgtcagc aggaagcagttaagatcggtctttgtccttatccataatctaatggcagttaggtgtact tctttaaaggagggaatgagacagccatgtgggaggggggtccctgaagaaactccaacc agcctgcccactgagatggagccctgggaagttcatgacgtttgcagcagggaggagcct gaacctggcccctctgcttcctctgtggaaactaggattccaactacctggcaggaagtg ccctag >gi568815587r:40014373_40216292|GENSCAN_predicted_peptide_3|636_aa MTLHPQQIMIGPRFNRALFDPLLVVLLALQLLVVAGLVRAQTCPSVCSCSNQFSKVICVR KNLREVPDGISTNTRLLNLHENQIQIIKVNSFKHLRHLEILQLSRNHIRTIEIGAFNGLA NLNTLELFDNRLTTIPNGAFVYLSKLKELWLRNNPIESIPSYAFNRIPSLRRLDLGELKR LSYISEGAFEGLSNLRYLNLAMCNLREIPNLTPLIKLDELDLSGNHLSAIRPGSFQGLMH LQKLWMIQSQIQVIERNAFDNLQSLVEINLAHNNLTLLPHDLFTPLHHLERIHLHHNPWN CNCDILWLSWWIKDMAPSNTACCARCNTPPNLKGRYIGELDQNYFTCYAPVIVEPPADLN VTEGMAAELKCRASTSLTSVSWITPNGTVMTHGAYKVRIAVLSDGTLNFTNVTVQDTGMY TCMVSNSVGNTTASATLNVTAATTTPFSYFSTVTVETMEPSQDEARTTDNNVGPTPVVDW ETTNVTTSLTPQSTRSTEKTFTIPVTDINSGIPGIDEVMKTTKIIIGCFVAITLMAAVML VIFYKMRKQHHRQNHHAPTRTVEIINVDDEITGDTPMESHLPMPAIEHEHLNHYNSYKSP FNHTTTVNTINSIHSSVHEPLLIRMNSKDNVQETQI >gi568815587r:40014373_40216292|GENSCAN_predicted_CDS_3|1911_bp atgaccttacatccacagcagataatgataggtcctaggtttaacagggccctatttgac cccctgcttgtggtgctgctggctcttcaacttcttgtggtggctggtctggtgcgggct cagacctgcccttctgtgtgctcctgcagcaaccagttcagcaaggtgatttgtgttcgg aaaaacctgcgtgaggttccggatggcatctccaccaacacacggctgctgaacctccat gagaaccaaatccagatcatcaaagtgaacagcttcaagcacttgagacacttggaaatc ctacagttgagtaggaaccatatcagaaccattgaaattggggctttcaatggtctggcg aacctcaacactctggaactctttgacaatcgtcttactaccatcccgaatggagctttt gtatacttgtctaaactgaaggagctctggttgcgaaacaaccccattgaaagcatccct tcttatgcttttaacagaattccttctttgcgccgactagacttaggggaattgaaaaga ctttcatacatctcagaaggtgcctttgaaggtctgtccaacttgaggtatttgaacctt gccatgtgcaaccttcgggaaatccctaacctcacaccgctcataaaactagatgagctg gatctttctgggaatcatttatctgccatcaggcctggctctttccagggtttgatgcac cttcaaaaactgtggatgatacagtcccagattcaagtgattgaacggaatgcctttgac aaccttcagtcactagtggagatcaacctggcacacaataatctaacattactgcctcat gacctcttcactcccttgcatcatctagagcggatacatttacatcacaacccttggaac tgtaactgtgacatactgtggctcagctggtggataaaagacatggccccctcgaacaca gcttgttgtgcccggtgtaacactcctcccaatctaaaggggaggtacattggagagctc gaccagaattacttcacatgctatgctccggtgattgtggagccccctgcagacctcaat gtcactgaaggcatggcagctgagctgaaatgtcgggcctccacatccctgacatctgta tcttggattactccaaatggaacagtcatgacacatggggcgtacaaagtgcggatagct gtgctcagtgatggtacgttaaatttcacaaatgtaactgtgcaagatacaggcatgtac acatgtatggtgagtaattccgttgggaatactactgcttcagccaccctgaatgttact gcagcaaccactactcctttctcttacttttcaaccgtcacagtagagactatggaaccg tctcaggatgaggcacggaccacagataacaatgtgggtcccactccagtggtcgactgg gagaccaccaatgtgaccacctctctcacaccacagagcacaaggtcgacagagaaaacc ttcaccatcccagtgactgatataaacagtgggatcccaggaattgatgaggtcatgaag actaccaaaatcatcattgggtgttttgtggccatcacactcatggctgcagtgatgctg gtcattttctacaagatgaggaagcagcaccatcggcaaaaccatcacgccccaacaagg actgttgaaattattaatgtggatgatgagattacgggagacacacccatggaaagccac ctgcccatgcctgctatcgagcatgagcacctaaatcactataactcatacaaatctccc ttcaaccacacaacaacagttaacacaataaattcaatacacagttcagtgcatgaaccg ttattgatccgaatgaactctaaagacaatgtacaagagactcaaatctaa >gi568815587r:40014373_40216292|GENSCAN_predicted_peptide_4|119_aa MKNIRIQMLCRTTTHGNSATEAPDGLQEQTGNPKRNHRPSEGNGLLLQDSRDTPNTVSAP SVEVGRGDPPLQTHTPAGEAEEPIQMRRSQKTNPGNMTKQGSSTPPENHTSSPAMDPNQ >gi568815587r:40014373_40216292|GENSCAN_predicted_CDS_4|360_bp atgaagaacattaggatccagatgctctgtagaactaccactcatggaaacagtgctaca gaagctccagatggcctgcaagaacaaaccggcaatcccaagaggaaccacagaccctct gaaggaaatggactgctcctgcaggactcgagagacaccccaaatactgtgagtgcccca tctgtggaagtaggaaggggagaccctcctctccaaacgcacacccccgctggagaagct gaagagcctatccaaatgagaaggagccagaaaaccaaccctggtaatatgacaaaacaa ggctcttcaacaccccctgaaaatcacactagttcaccagcaatggatccaaaccagtaa >gi568815587r:40014373_40216292|GENSCAN_predicted_peptide_5|242_aa MVDAPPPTKLKLPRSTSDCCAGSENFKPVDLSLLGSMGVGSIEQDHLAPWLQPTFQGSEW FCLAGVPGATGYEKKLLQLVQCLPKWPPSFVLETQGPGGKGTQGNLLVCRLRRPDHLTDI SKARAGCGGPDLCAYGPALEKHSVSGEKKTKKMPKCLSVCSCDAYELMKVKQTKSLWRDN KAGTGNDVWLVITRSVGCIWVYFWVLYSAAFVYVPIFIPVPLCFGDYGLIVKFEIRVFDV GI >gi568815587r:40014373_40216292|GENSCAN_predicted_CDS_5|729_bp atggtggacgcccctccccccaccaagctcaagcttcctaggtcaacttcagactgctgt gctggcagcgagaatttcaagccagtggatcttagcttgttgggctccatgggagtggga tccattgagcaagaccatctggctccctggcttcagcccactttccagggtagtgaatgg ttctgtctcgctggtgttccaggtgccactgggtatgaaaaaaaactcctgcaactagtt cagtgtctgcccaaatggccacccagttttgtgcttgaaacccagggtcctggtggtaaa ggcacccaagggaatctcctggtctgcagattgcgaagacctgatcatttaactgacatt tccaaggcacgtgctggatgcgggggtcctgatctctgtgcctatggccctgcccttgag aagcactcagtctccggggagaagaagacaaaaaaaatgccaaaatgtttgagcgtttgc agctgtgatgcctatgagctaatgaaggtcaagcagactaagtccctgtggagagacaac aaagctggcaccggaaatgatgtctggcttgttataacgagatcagttggctgtatctgg gtttatttctgggttctctattctgctgcattcgtctatgtgcctattttcataccagta ccactctgttttggtgactatggccttatagtgaagtttgaaatcagagtttttgatgtg ggcatttag