GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:01:36 Sequence gi568815597r:54039523_54252922 : 213400 bp : 47.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 635 630 6 1.05 1.02 Term - 6264 6181 84 0 0 100 53 67 0.831 2.05 1.01 Init - 13666 13478 189 1 0 90 97 237 0.993 21.81 1.00 Prom - 23363 23324 40 -2.56 2.00 Prom + 28265 28304 40 -4.16 2.01 Init + 43215 43224 10 2 1 105 94 0 0.811 3.05 2.02 Intr + 49075 49268 194 2 2 113 90 151 0.993 17.01 2.03 Term + 56763 56951 189 2 0 86 43 174 0.879 10.15 2.04 PlyA + 60729 60734 6 1.05 3.06 PlyA - 62124 62119 6 1.05 3.05 Term - 72156 72046 111 0 0 67 42 148 0.897 6.66 3.04 Intr - 81474 81315 160 2 1 110 72 25 0.146 3.09 3.03 Intr - 82662 82574 89 0 2 81 81 17 0.149 -0.93 3.02 Intr - 82947 82780 168 0 0 65 64 128 0.932 8.24 3.01 Init - 91840 91673 168 1 0 71 38 65 0.237 -0.67 3.00 Prom - 92381 92342 40 -0.26 4.07 PlyA - 92652 92647 6 -1.75 4.06 Term - 93829 93446 384 1 0 23 42 552 0.597 38.99 4.05 Intr - 97286 97108 179 0 2 63 91 345 0.998 31.94 4.04 Intr - 100584 100231 354 1 0 91 60 638 0.981 56.56 4.03 Intr - 101911 101576 336 2 0 26 91 536 0.513 43.19 4.02 Intr - 105291 104944 348 1 0 99 42 502 0.957 42.03 4.01 Init - 113400 113322 79 0 1 66 102 187 0.998 17.12 4.00 Prom - 116418 116379 40 -7.66 5.13 PlyA - 119225 119220 6 1.05 5.12 Term - 120422 120321 102 2 0 84 49 34 0.009 -2.62 5.11 Intr - 121929 121822 108 0 0 114 48 42 0.025 3.28 5.10 Intr - 124071 123951 121 2 1 63 70 67 0.032 2.90 5.09 Intr - 135300 135146 155 0 2 111 49 75 0.489 4.77 5.08 Intr - 139830 139627 204 0 0 83 99 203 0.975 20.30 5.07 Intr - 144743 144639 105 2 0 129 82 38 0.887 7.71 5.06 Intr - 148217 148130 88 1 1 119 113 81 0.846 13.67 5.05 Intr - 151374 151226 149 0 2 111 38 205 0.638 16.83 5.04 Intr - 151889 151813 77 0 2 56 71 25 0.570 -3.17 5.03 Intr - 152346 152169 178 0 1 48 28 141 0.353 3.59 5.02 Intr - 156112 155897 216 1 0 51 94 152 0.198 10.80 5.01 Init - 159314 159312 3 2 0 108 81 0 0.222 1.30 5.00 Prom - 159628 159589 40 -5.56 6.00 Prom + 159667 159706 40 -10.64 6.01 Init + 160722 161067 346 2 1 49 72 416 0.999 31.48 6.02 Intr + 165496 165679 184 2 1 48 119 84 0.991 6.35 6.03 Intr + 165773 165888 116 2 2 67 73 94 0.973 5.89 6.04 Intr + 170424 170609 186 1 0 66 110 158 0.576 15.46 6.05 Intr + 172979 173136 158 0 2 114 63 197 0.999 19.53 6.06 Intr + 176619 176822 204 2 0 116 103 158 0.983 19.50 6.07 Intr + 181089 181224 136 2 1 70 55 23 0.228 -2.66 6.08 Intr + 181498 181639 142 2 1 81 44 68 0.479 1.11 6.09 Intr + 182344 182459 116 1 2 67 80 80 0.547 5.19 6.10 Intr + 183122 183223 102 0 0 96 80 44 0.533 4.55 6.11 Term + 184931 185043 113 0 2 67 40 49 0.232 -3.28 6.12 PlyA + 185606 185611 6 1.05 7.12 PlyA - 187019 187014 6 1.05 7.11 Term - 187764 187744 21 0 0 100 52 22 0.440 -1.89 7.10 Intr - 188834 188733 102 2 0 130 75 66 0.993 9.87 7.09 Intr - 188955 188927 29 1 2 89 87 2 0.107 -1.97 7.08 Intr - 189304 189226 79 1 1 82 95 34 0.014 2.62 7.07 Intr - 199677 199607 71 1 2 83 111 74 0.989 8.10 7.06 Intr - 201437 201383 55 2 1 79 110 44 0.789 4.25 7.05 Intr - 201987 201952 36 0 0 86 103 11 0.483 0.86 7.04 Intr - 204154 204092 63 1 0 111 87 -8 0.504 0.31 7.03 Intr - 208804 208724 81 1 0 119 80 1 0.647 2.23 7.02 Intr - 212170 212094 77 2 2 121 91 6 0.648 3.43 7.01 Init - 212734 212614 121 1 1 102 54 26 0.379 0.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_1|90_aa MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHT YPKHVQKHIPNLMSNMLAILVARISCHSLN >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_1|273_bp atggcggcgccgaaggggagcctctgggtgaggacccaactggggctcccgccgctgctg ctgctgaccatggccttggccggaggttcggggaccgcttcggctgaagcatttgactcg gtcttgggtgatacggcgtcttgccaccgggcctgtcagttgacctaccccttgcacacc taccctaagcatgtacagaagcatattcccaatctgatgagcaatatgcttgccatcttg gttgccagaatcagctgccattcgctgaactga >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_2|130_aa MVTGHTVNKMRKHSDSEVASLAREVYTEWKTFTEKHSNRPSIEVRSDPKTESLRKNAQKL LSEALELKMDHLLVENIERETFHLCSRLINGPYRRTVRALVFTLKHRAEIRAQVKSGSLP VGTFVQTHKK >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_2|393_bp atggtgacaggtcacactgtgaacaagatgcgtaaacactcagattcagaagtggcttct cttgccagagaagtttacactgagtggaaaactttcactgaaaaacattcaaatagacct tctattgaagttagaagtgatcccaaaaccgagtcgttgaggaaaaatgctcagaaatta ctctcagaagccttggaattaaagatggatcacctactggttgaaaatattgaacgggaa acgtttcatctctgctcccgcctcattaatgggccgtaccggcggacggtgagagccctg gtcttcacattaaagcaccgagctgaaatccgggctcaggtgaagagcggctcgctgcca gtcggcacgtttgtacagacccacaaaaagtga >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_3|231_aa MLPTGIKLKKLWPNYNSLNNPTSIHWIPDFVSVAVAEAPGTQQHTRQMRFLPARRLFSFK AERDAGRGDPAGVRCPDRTCSSEQPELKFVSRVSVCPLLAFPGVPSYVAGAQSCEGHGRG SGKGEFKGKVEPQQSGAPDSPRSLIQLCPPTYHHLSSPLLSLSPHCHHQTTQLPPPAMPQ FRPDHDQATECGALRGCREGYMGELCDGYHHHIFIFIIIKCLGLTREQRGQ >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_3|696_bp atgctgcccactggcatcaagttaaaaaaactatggcccaattacaattcactcaacaac ccaacaagcatccactggattccagattttgtgtctgttgctgtggcagaggctccaggc acacagcagcacacacgccagatgagattcctacctgcaaggcgtttgtttagtttcaag gcagaaagagatgctgggaggggggaccccgctggtgtcagatgtccagaccggacctgc agctccgagcagcccgagctcaagtttgtgagccgcgtgtccgtgtgtcccctgctggca ttcccgggcgttccctcctatgtggctggggcacagagctgtgaggggcacggcaggggt agtgggaagggagagttcaaggggaaggtggagcctcagcaaagcggggcgcctgacagc ccaaggtctctcatccagctctgtccgcccacgtaccaccatctctccagtccactcctt tctctctcgccccactgccaccaccagaccacccagctcccgcctccagccatgccccaa tttcgtccagaccacgaccaagccacggagtgtggtgcactaaggggttgtcgtgagggt tacatgggagagctgtgtgatggctaccatcatcacatcttcatcttcatcatcatcaag tgtttaggcctcacccgggagcaacggggccagtaa >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_4|559_aa MLAEWGACLLLAVALLGPGLQAQAMEGVKCGGVLSAPSGNFSSPNFPRLYPYNTECSWLI VVAEGSSVLLTFHAFDLEYHDTCSFDFLEIYNGASPDKGNLLGRFCGKVPPPPFTSSWHV MSVIFHSDKHVASHGFSAGYQKDVCGGVLTGLSGVLTSPEYPNNYPNSMECHWVIRAAGP AHVKLVFVDFQVEGNEECTYDYVAVLGGPGPTRGHHYCGSTRPPTLVSLGHELQVVFKSD FNIGGRGFKAYYFSGECQEVYMAMRGNFSSPQYPSSYPNNIRCHWTIRLPPGYQVKVFFL DLDLEEPNSLTKTCDFDHLAAFDGASEEAPLLGNWCGHHLPPPVTSSHNQLLLLLHTDRS TTRRGFSVAYIGVVPMNVSCSRTDFQILISTQALAPLERTKVYLGSRSCAAQEVGGNLRI QARFDTCGTESQGNKFSPSSFRCDEYPVTPQRRNNTSVIVSVLYIDFSAAGREDIHEYEV RCEPRRKEASVHLLSGSHWLGPYAATAEHLQEAPPMDEAEALEGPVSMVAQDTSDIVFLG LCILAGILMVIAIVVLMLL >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_4|1680_bp atgctggcagagtggggggcttgcctgctgctggcagtggcactgctgggcccagggctc caggcccaagccatggaaggtgtcaaatgtgggggtgtgctctcagcaccttctggaaac ttctccagccccaacttccctagactgtacccctacaacacagagtgcagctggctgatc gtggtggccgagggatcctcggtgctgctcaccttccatgcctttgacctagagtaccac gacacctgcagcttcgactttctggagatctacaatggggcctcaccagacaagggcaac ctgctggggaggttctgcggcaaggtgcccccgccgcccttcacctcctcctggcatgtc atgtctgtcatcttccactcggacaagcatgtggccagccatggcttttctgcgggctac cagaaagatgtgtgtggcggcgtcctgactggcctgtcaggggtcctcaccagtcctgag tatcccaacaactacccgaacagcatggagtgccactgggtgatccgggccgctggccct gcccacgtcaagctggtgttcgtggacttccaggtggagggcaatgaagagtgcacctat gactacgtggctgtgcttggggggcctggccccacccgtgggcaccactactgtggcagc accaggccccccaccctcgtgtctctgggccacgaactgcaggtggtcttcaagtccgac ttcaacatcggaggccgtggcttcaaggcctactacttctcaggagaatgccaggaggta tacatggccatgcggggcaacttctccagcccacagtaccccagctcctaccccaacaac atccgctgccactggaccatccgcctgcccccgggctaccaggtcaaggtgttcttcctg gacctggacctggaggagcccaacagcctgaccaagacctgtgactttgaccatctggcg gccttcgatggggccagcgaggaggcacccctgctggggaattggtgtggacaccacctg ccaccacccgtgacctcaagccacaaccagcttctgcttctgctgcacacagaccgcagc accacccgcaggggcttctctgtggcctacatcggagtggtgcccatgaacgtgagctgc tcccgcacggacttccagatcctgatctccacgcaggcgctggccccgctggagcggacc aaggtctacctgggcagccggagctgtgccgcccaggaggtcggcggcaacctcaggatc caggcccgctttgatacctgcggcactgagtctcagggaaacaagttcagcccaagctcc ttcaggtgtgatgagtaccctgtgaccccacagagaagaaacaacacttcagtgattgtc agcgtgctgtacatcgacttctcagccgcggggcgggaggacatccatgagtacgaggtc cgctgtgagccacggcgcaaggaggcttctgtccacctgctgtctggctctcactggctg gggccctatgctgccactgcggagcaccttcaggaagcaccacccatggatgaggcggag gcactggagggcccagtgagcatggtggcccaggataccagtgacatcgtcttcctgggc ctttgcatcctggctggaatcctcatggttattgccatcgtggtcttgatgctgctttga >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_5|501_aa MAAQAPLMMAEREEDDDTEEAWMQLRPTEPLPSQCCGSGCSPCVFDLYHRDLARWEAAQA SKDRSLLRGPESQSSVLSKAPGSCAREYPGTLADDEEVILDGGCGKGHSGDKALNAKPKN FCQLSDTVSTTMDIGLDILRSHQLINYANLDFSSFEEWSCPSKLNPETFVAFCIIAMDRL TKDTYRVRFALPGNSQLGLRPGQHLILRGIVDDLEIQRAYTPISPANAEGYFEVLIKCYQ MGLMSRYVESWRVGDTAFWRGPFGDFFYKPNQYGELLLLAAGTGLAPMVPILQSITDNEN DETFVTLVGCFKTFESIYLKTFLQEQARFWNVRTFFVLSQESSSEQLPWSYQEKTHFGHL GQDLIKELVSCCRRKPFALVCGSAEFTKDIARFSAEFLRTLTRGVHRLLGAADAVKTNVK GLVLGLLPSRHSDPLPSDSPGGPIWWLRGPALQLDSPGFESWFCHLEDVSPDLCGQGGVR QHLAITWAPPLDPGTSDRIQI >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_5|1506_bp atggctgcccaagccccactgatgatggctgagagggaagaggacgacgacactgaggaa gcctggatgcagctacggcccacagaacccttgccttcccagtgctgcggcagtggctgc tcaccctgtgtgtttgacctctatcaccgagatctggcaaggtgggaggcagcccaagcc agcaaggacaggagcctgctgcgtgggccagagtcacagtccagtgttctttccaaagca ccaggcagttgtgccagagagtacccaggtactctggctgatgatgaagaggtcatcctg gatggtgggtgtgggaaggggcacagtggagacaaggccttgaatgccaagcccaagaat ttctgtcagctcagtgatactgtcagcactaccatggatattgggttagacattctgagg tcccatcaacttataaattatgcaaatctggattttagcagctttgaagaatggagctgc ccctccaagctgaacccagagaccttcgtggccttctgcatcattgccatggacaggctc actaaggacacctaccgtgtccggtttgctctacccgggaacagccagcttggcctgcgg cccggccagcacctcatcctacgagggatagtagatgacttagaaattcagagagcctat acgcccatcagccctgccaacgcagaaggatactttgaagtgttaattaagtgctaccag atggggctgatgtcccggtatgttgagtcctggagagtaggagacacagctttctggcga ggacctttcggagatttcttctataaaccaaaccagtatggtgagctcctcttgctggct gcgggcacgggcctggcccccatggtgcctatcctgcagagcatcacagacaatgagaat gacgagacttttgtcactctggtcggttgcttcaagacctttgagagcatctacctgaaa accttcctccaagagcaggcccgtttctggaatgtccgtaccttctttgtactcagccag gagagctcctcagagcagcttccctggagttaccaagagaaaacccactttggccacctg ggccaggacctaattaaagagctggtcagctgctgtcggagaaagccattcgcactggtc tgtggctcggctgagttcaccaaagacatagccaggttttctgctgagttcctgaggacc ctgactcgaggagtccaccgtctcctgggggctgcagatgctgtcaaaacaaatgtgaaa ggtctggtgctcggcctgctacccagtagacactcagaccctcttccatccgactcccct ggcggcccgatctggtggctcagaggaccagcattgcagctggacagtcccggatttgaa tcctggttctgccacttagaggatgtcagcccagacctctgtggacaaggaggtgttagg cagcaccttgctatcacctgggcaccacccctggacccgggaaccagtgacaggatacag atatga >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_6|600_aa MALASGPARRALAGSGQLGLGGFGAPRRGAYEWGVRSTRKSEPPPLDRVYEIPGLEPITF AGKMHFVPWLARPIFPPWDRGYKDPRFYRSPPLHEHPLYKDQACYIFHHRCRLLEGVKQA LWLTKTKLIEGLPEKVLSLVDDPRNHIENQDECVLNVISHARLWQTTEEIPKRETYCPVI VDNLIQLCKSQILKHPSLARRICVQNSTFSATWNRESLLLQVRGSGGARLSTKDPLPTIA SREEIEATKNHVLETFYPISPIIDLHECNIYDVKNDTGFQEGYPYPYPHTLYLLDKANLR PHRLQPDQLRAKMILFAFGSALAQARLLYGNDAKVLEQPVVVQSVGTDGRVFHFLVFQLN TTDLDCNEGVKNLAWVDSDQLLYQHFWCLPVIKKRVVVTGPCFSKSQMSTSFVLTPSFNA GEEVMQRNLRASASLPPSTGSPATALSDPPLPRWQNGDTAPPFQVSPPLDQIGSIRQGFK SMGLEFFLEARCCAGSELQNEGDMVPALWELLGYWVDTARVVSFGLAVADLRAQSFGLFS IHSLDDFSQSGGFKDPLNGNNSHAYEAANPPSPILQMRELRTEVSTNLPKATQQVAANKI >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_6|1803_bp atggcattggcgtccgggcccgcaaggcgggcgctagctggctccgggcagctcggcctt gggggcttcggggccccgagacgcggggcgtatgagtggggcgtgcgctccacgcggaag tcggagcctcctcccctggatagggtgtacgagatccctggactggagcccatcaccttt gcggggaagatgcacttcgtgccctggctggcgcggccgatctttccgccctgggaccgc ggctacaaggacccaaggttctaccgctcgccccctcttcacgagcatccgctgtacaaa gaccaggcctgctatatctttcaccaccgttgccgccttctcgagggtgtaaagcaggcc ctctggctcaccaagaccaagttaatagaaggccttcccgagaaagtgcttagccttgtt gatgatccaaggaaccacatagagaaccaagacgagtgcgttctgaatgtgatctctcac gcccgtctctggcagaccactgaggaaatccccaagagagagacctactgcccggtcatc gtggacaacctaatacagctgtgtaaatctcagattctcaagcatccttctctggccagg aggatctgtgtccaaaactccacgttttctgctacctggaaccgagagtctcttctcctt caagtccgtggttctggtggagcccgactgagcactaaggatcctctgcccaccatcgcc tccagagaggagattgaagctactaagaatcatgttctagagaccttctaccccatatca cccatcatcgatcttcatgaatgcaatatttatgatgtgaaaaatgacacaggattccag gaaggctatccttacccctatccccataccctgtacttactggacaaagccaatttacga ccacaccgccttcaaccagatcagctgcgggccaagatgatcctgtttgcttttggcagt gccctggctcaggcccggctcctctatgggaatgatgccaaggtcttggagcagcccgtg gtggtgcagagcgtgggcacggatggacgtgtcttccatttcctagtgtttcaactgaat accacagacctggactgtaacgagggtgtcaagaatttggcctgggtggactcagaccag ctcctctatcagcatttttggtgtctcccagtgatcaaaaagagagtggttgtgacaggt ccttgcttctccaaatcccaaatgtctaccagcttcgtcctcactccctcattcaacgca ggagaggaagtgatgcagcggaacttgagggcctcagccagcctcccgcccagtacaggc tccccagccactgccctgtcagatcctccacttcctcgctggcaaaatggggatacagca cctcccttccaagtttccccacctttggatcaaataggctccattcgccaaggtttcaag tcgatgggtttggagttcttcttagaagccaggtgttgtgctgggagcgagttacagaat gagggagacatggtccccgccctctgggagctcctggggtactgggtggacacggccaga gtggtctcctttggattggcagtagcagatctcagggcccagtccttcggcctcttctcc attcattccctagatgatttcagccagtctggtggctttaaagaccctctaaatggcaac aactcccatgcctatgaggcagccaacccgccatcccccattttgcagatgagagaactg agaacagaagtatcaactaatttgcccaaggccacacagcaagttgcagcaaacaagatt taa >gi568815597r:54039523_54252922|GENSCAN_predicted_peptide_7|244_aa MGFGQKEGVGTAASERKVVGATARARPRELGLLPHFRALIGHPNMGGSMQRMNPPRGMGP MGPGPQALSLWEGALPTATARAVRKGGQPLQHKSLSMGSKTKSIPSYPPEPRGKIPYSSS SPGTYVGPPGGGGPPGTPIMPSPADSTNSSDNIYTMINPVPPGGSRSNFPMGPGSDGPMG GMGGMEPHHMNGSLGSGDIDGLPKNSPNNISGISNPPGTPRDDGELGGNFLHSFQNDNLN HIPK >gi568815597r:54039523_54252922|GENSCAN_predicted_CDS_7|735_bp atggggtttgggcagaaggagggagtgggcactgctgcctctgagaggaaggtggttggt gctacagccagagcaagacctagagaacttggtctcctgcctcacttcagagctctgata ggccaccccaacatgggaggatcaatgcagagaatgaaccctccccgaggcatggggccc atgggtcccggcccacaggccctgagcttgtgggagggagccctgcctactgccacagcc cgggctgtcaggaagggcggccagccccttcagcataaaagcctctcaatgggttccaag accaagtccatcccctcttacccgcctgagcccagggggaagattccatactcctcctca tcacctggtacctatgtgggaccccctggtggtggcggtcctccaggaacacccattatg cccagtcccgcagattcaacaaattccagtgacaacatctacacaatgattaatccagtg ccgcctggaggcagccggtccaacttcccgatgggtcccggctcggacggtccgatgggc ggcatgggtggcatggagccacaccacatgaatggatcattagggtcaggcgacatagac ggacttccaaaaaattctcctaacaacataagtggcattagcaatcctccaggcacccct cgagatgacggcgagctaggagggaacttcctccactcctttcagaacgacaatctcaac cacatccccaaatga