GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:20:02 Sequence gi568815593r:62412246_62612482 : 200237 bp : 37.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 53 48 6 -3.94 1.03 Term - 842 211 632 0 2 52 37 233 0.069 8.09 1.02 Intr - 15280 15177 104 0 2 101 18 70 0.051 0.20 1.01 Init - 20172 20015 158 0 2 73 47 124 0.227 6.23 1.00 Prom - 21555 21516 40 -5.85 2.00 Prom + 22950 22989 40 -5.75 2.01 Init + 25035 25172 138 2 0 49 119 95 0.502 8.89 2.02 Intr + 30738 30838 101 2 2 82 89 41 0.953 1.59 2.03 Intr + 37682 37754 73 2 1 108 92 63 0.996 7.19 2.04 Intr + 39485 39688 204 1 0 102 95 150 0.878 15.57 2.05 Intr + 53753 53860 108 1 0 49 89 57 0.262 1.46 2.06 Intr + 54886 55018 133 0 1 107 74 21 0.388 1.90 2.07 Intr + 58005 58063 59 1 2 77 115 52 0.393 4.48 2.08 Intr + 70856 71048 193 1 1 94 86 71 0.546 5.64 2.09 Intr + 71765 71917 153 0 0 84 46 64 0.501 0.92 2.10 Intr + 73174 73217 44 2 2 39 110 30 0.101 -2.66 2.11 Intr + 76198 76325 128 0 2 60 50 79 0.013 -0.14 2.12 Intr + 81753 81879 127 0 1 111 68 89 0.047 8.96 2.13 Intr + 93996 94112 117 2 0 89 91 42 0.223 4.34 2.14 Intr + 100022 100216 195 1 0 56 -7 201 0.511 6.09 2.15 Intr + 113897 114012 116 1 2 73 95 78 0.112 5.33 2.16 Intr + 118464 118540 77 0 2 103 76 -3 0.049 -1.76 2.17 Intr + 132603 132781 179 1 2 27 86 93 0.000 1.72 2.18 Intr + 138122 138217 96 1 0 106 94 22 0.188 3.89 2.19 Intr + 148891 149012 122 0 2 78 66 81 0.272 3.27 2.20 Intr + 156646 156740 95 1 2 99 69 77 0.394 5.59 2.21 Intr + 167828 169032 1205 0 2 78 41 400 0.076 20.78 2.22 Intr + 169754 169822 69 1 0 45 64 134 0.363 5.16 2.23 Intr + 172521 172597 77 2 2 77 82 36 0.340 -0.71 2.24 Intr + 179332 179427 96 1 0 100 39 95 0.090 4.01 2.25 Intr + 189519 189603 85 0 1 10 115 120 0.106 5.80 2.26 Term + 191691 191765 75 2 0 58 49 70 0.574 -3.04 2.27 PlyA + 192599 192604 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 77870 77985 116 0 2 95 38 113 0.910 4.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:62412246_62612482|GENSCAN_predicted_peptide_1|297_aa MGSQPASISHTPQLYAAEILYQANTARGLGLSSSIQAPLLRQKLYHRAAGKEYVAQFLAG HGLTDTGPWPEGWGHLPYDTSCRLSKTGCANSQVCRPPLPRYRYPSSPGSPTFLPPLGTR VPNPLPSTPARSVALGPGPHRLVRSGIARPSRSHFPTRSGVGWSLSPRCQTPRRRLLMPK AASRNTPCLASQLQTTRTRENYVTGDQRACDRQREKAIGERREGEGGALSLNQKARGIQI SCVRLRSALAQYKEPGVRTSRICHHGNHRSGFSDSQGAQLFPALIEASIRTISRFIF >gi568815593r:62412246_62612482|GENSCAN_predicted_CDS_1|894_bp atgggtagccagccagccagtatttctcatacaccccagctatatgctgcagaaattctg taccaggcaaacacagctagaggtctagggctctcttcttccatccaggcaccactcctg cggcagaagctctaccacagggcagcaggcaaagaatatgtggcccagttcctagcaggc cacggactgactgatactggtccatggcctgagggctggggacacctgccctatgacact tcatgcagactgtcaaaaactggttgcgcaaacagccaagtctgcaggccgcccttgcct cgctaccgttatccctcatctcctggtagccccactttcctcccgcctctcgggactcgg gtgcccaacccactcccttccactcccgcgcgctcggtggctctcgggccggggcctcac cgtttggttcggtcgggaattgcccggccctcccgctcccacttcccaacacgaagtggc gtcggttggtccctctccccacgctgccagaccccacgccgccgcctactgatgccgaaa gcggcttctaggaacacgccatgtttggcgtcgcagctccaaacgacgcggacgcgcgaa aactacgtcacaggagaccagcgcgcatgcgaccggcagagagagaaggcgataggcgaa cggcgggaaggggaaggaggggctttgtctttgaaccaaaaggcgcggggtatccaaatc agttgtgtgcgcttgcgcagtgcgcttgcgcagtataaagagccaggagtccggactagc cggatctgtcaccatggaaaccataggtctggtttctccgactcccagggagctcaattg tttcctgcgttgattgaagcttcaatcagaaccatttcacgctttatattctag >gi568815593r:62412246_62612482|GENSCAN_predicted_peptide_2|1354_aa MDLNSASTVVLQVLTQATSQDTAVLKPAEEQLKQWETQPGFYSVLLNIFTNHTLDINVRW LAVLYFKHGIDRYWRRVAPHALSEEEKTTLRAGLITNFNEPINQIATQIAVLIAKVARLD CPRQWPELIPTLIESVKVQDDLRQHRALLTFYHVTKTLASKRLAADRKLFYDGSRAENLI PLQNLRGDEKLSHDSLMVQIKRRVGDGLLASGIYNFACSLWNHHTDTFLQEVSSGNEAAI LSSLERTLLSLKVLRKLTVNGFVEPHKNMEVMLLDFLDQHPFSFTPLIQRSLEFSVSYVF TEVGEGVTFERFIVQCMNLIKMIVKNYAYKPSKNFEDSSPETLEAHKIKMAFFTYPTLTE ICRRLVSHYFLLTEEELTMWEEDPEGFTVEETGGDSWKYSLRQPIECARQGKVIKDDPKL LSLNDKETVTSLEVVGDRDILSECMYKPLRRRVIWLIGQWISVKFKSDLRPMLYEAICNL LQDQDLVYLETMFTLLFQLLQQVTECDTKMHVLHVLSCVIERVNMQWAPEQQDVRFWFMD HIMDPSLTLLNAKIPPFRFRHQMGFRDQLGYVLGQHDMSILKLLVIVFVRIGLGADSKNL YPFLLPVIQLSTDVSQPPHVYLLEDGLELWLVTLENSPCITPELLRIFQNMSPLLVASKR IKYLGIQLTRDVKELFRENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILSKVVENA LKVNPILGPQMFQPILPYVFKGIIEGEMDQLLGNMIEMWVDRMDNITQPERRKLSALALL SLLPSDNRGSSGIQKGLTVHFPKPVDTAAILALESALRPGSNNLTKVPSNAFEVLKSLRR LSLSHNPIEAIQPFAFKGLANLEYLLLKNSRIRNVTRDGFSGINNLKHLILSHNDLENLN SDTFSLLKNLIYLKLDRNRIISIDNDTFENMGASLKILNLSFNNLTALHPRVLKPLSSLI HLQANSNPWECNCKLLGLRDWLASSAITLNIYCQNPPSMRGRALRYINITNCVTSSINVS RAWAVVKSPHIHHKTTALMMAWHKVTTNGSPLENTETENITFWERIPTSPAGRFFQENAF GNPLETTAVLPVQIQLTTSVTLNLEKNSALPNDAASMSGKTSLICTQEVEKLNEAFDILL AFFILACVLIIFLIYKVVQFKQKLKASENSRENRLEYYSFYQSARYNVTASICNTSPNSL ESPGLEQIRLHKQIVPENEAQELKFSTAGVVNDGVRFKARKRDQCPLPLVITLLKLISLC GRTLLMWEACVIQDKFCGIINISVEGLHDVMTEDPETGTYKDCMLMSHLEEPKVTEDEEP PTEQDKRKKMIHRYYLVTVAYSIRYSNMLYRFVA >gi568815593r:62412246_62612482|GENSCAN_predicted_CDS_2|4065_bp atggatctcaatagtgccagcactgttgttcttcaggtgttaacacaggccaccagtcag gatactgctgtgttaaaaccagctgaggagcagttgaagcagtgggagacacagccaggt ttctattcagtgttgctgaatattttcaccaaccacactttggatataaatgtaaggtgg cttgctgtactgtattttaaacatggaattgatcgctactggagacgtgtagcacctcat gctctctcagaggaggagaaaactactctgcgtgcagggctcatcaccaacttcaatgaa ccaataaaccagattgcaactcagattgcagtgctcattgcaaaagttgctagattggat tgtcccagacagtggcctgaactaattcccactcttatagagtctgttaaagtccaggat gatcttcgacagcacagagcattacttaccttctatcatgttaccaagacactggcatct aaacgacttgctgctgatagaaaactattttatgatgggagtagggcagaaaaccttata cctcttcagaacctgagaggtgatgagaaactgtctcatgatagcctaatggtgcagata aagagacgagtaggagatggtcttttagcttctggaatttataattttgcctgctctctg tggaatcaccacacagacacattcctgcaagaagtttcttctggcaatgaagctgcaatt ttgagttcactagaacgaacactgctatcattgaaagtgctgcgtaagttaactgttaat ggatttgtggaacctcataagaatatggaggtgatgcttttggacttcttggatcagcat cctttttcatttactcctctaattcagagatcactggaattttctgtaagctatgttttt acagaagttggtgaaggcgttacatttgaacgattcattgtccaatgtatgaatcttatt aagatgattgtcaaaaattatgcttataagccatccaaaaattttgaagatagcagccct gaaactcttgaagcccataagattaagatggcattcttcacatatcctactttgacagag atatgtagaagattagtctctcattatttcctattaactgaagaagaactgacaatgtgg gaagaagacccagaaggctttacagtggaagaaacaggaggagattcttggaaatatagt ttgaggcaaccgattgaatgtgctaggcaaggaaaagtgatcaaagatgatcctaaactt ttgagccttaatgacaaggaaacagttacctcattagaagtggtaggagatagagatata ctttcggagtgcatgtataagccattgcgacgcagggtgatttggctcatcggtcagtgg atttctgtgaaattcaagtctgacttaagacccatgctttatgaagcaatctgtaacttg cttcaagatcaagatttagtgtatttggaaaccatgttcacactactttttcagttactg cagcaagttacagaatgtgacacaaagatgcatgttttgcatgtcctttcttgtgtgatc gaaagagtcaacatgcagtgggcgccggaacagcaagatgtgaggttctggttcatggat catataatggacccatccctgactctgctgaacgccaagattcctccattcagattcaga catcagatgggttttagggaccagcttggctatgtccttgggcagcatgacatgtcgata ctcaaactcctcgtcatcgtatttgtccgaataggattaggagcagacagcaagaacctg taccctttcctgctcccagttattcaactgagtacagatgtttcacagcctccacatgtt tatcttctggaagatggtttagaattatggttagtaactttggaaaacagtccatgtatt acaccagagttgcttcgtatatttcagaatatgtcaccacttcttgttgcttcaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggagctcttcagggagaactac aaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaacattccatgctca tggataggaagaatcaatatcgtgaaaatggccatactatccaaggttgtggaaaatgcc cttaaagtgaacccaatactaggtccacaaatgtttcaaccgattttaccctatgttttc aagggtattatagaaggggagatggaccagcttttgggaaatatgattgaaatgtgggtt gatcgaatggacaacattacccagcctgaaagaagaaaactttcagctttggctttgctc tctcttctgccatctgataatagagggtcttcaggaattcaaaagggactgactgttcac ttccctaagcctgtggatactgcagccattttagcactagagagtgccctaaggccagga agtaataatttaacaaaagtaccatcaaatgcctttgaagtacttaaaagtcttagaaga ctttctttgtctcataatcctattgaagcaatacagccctttgcatttaaaggacttgcc aatctggaatacctcctcctgaaaaattcaagaattaggaatgttactagggatgggttt agtggaattaataatcttaaacatttgatcttaagtcataatgatttagagaatttaaat tctgacacattcagtttgttaaagaatttaatttaccttaagttagatagaaacagaata attagcattgataatgatacatttgaaaatatgggagcatctttgaagatccttaatctg tcatttaataatcttacagccttgcatccaagggtccttaagccgttgtcttcattgatt catcttcaggcaaattctaatccttgggaatgtaactgcaaacttttgggccttcgagac tggctagcatcttcagccattactctaaacatctattgtcagaatcccccatccatgcgt ggcagagcattacgttatattaacattacaaattgtgttacatcttcaataaatgtatcc agagcttgggctgttgtaaaatctcctcatattcatcacaagactactgcgctaatgatg gcctggcataaagtaaccacaaatggcagtcctctggaaaatactgagactgagaacatt actttctgggaacgaattcctacttcacctgctggtagattttttcaagagaatgccttt ggtaatccattagagactacagcagtgttacctgtgcaaatacaacttactacttctgtt accttgaacttggaaaaaaacagtgctctaccgaatgatgctgcttcaatgtcagggaaa acatctctaatttgtacacaagaagttgagaagttgaatgaggcttttgacattttgcta gcttttttcatcttagcttgtgttttaatcatttttttgatctacaaagttgttcagttt aaacaaaaactaaaggcatcagaaaactcaagggaaaatagacttgaatactacagcttt tatcagtcagcaaggtataatgtaactgcctcaatttgtaacacttccccaaattctcta gaaagtcctggcttggagcagattcgacttcataaacaaattgttcctgaaaatgaggca caggaactaaaattcagtacagctggtgtcgtgaatgatggtgtgcgatttaaagcccga aagagagatcagtgcccactaccactggtcatcactttgctgaagctgataagcttgtgt ggccgcacacttttaatgtgggaagcctgtgttatccaagataaattctgtgggattata aacatttcagtagaaggcctgcatgatgtcatgacggaagatcctgaaacaggaacttat aaagactgtatgttgatgtctcatcttgaggaaccaaaagtaacagaagatgaagaacca cccacagaacaagataagaggaaaaagatgatacacagatactacttggttacagttgcc tacagtattcgctacagtaacatgctgtacagatttgttgcctag