GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:21:56 Sequence gi568815592r:88998581_89217887 : 219307 bp : 42.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2871 3148 278 0 2 79 68 130 0.632 6.34 1.02 Term + 10502 10527 26 0 2 110 47 21 0.070 -2.39 1.03 PlyA + 11676 11681 6 1.05 2.00 Prom + 38419 38458 40 -2.35 2.01 Init + 43366 43368 3 0 0 73 103 0 0.610 0.05 2.02 Intr + 47038 47145 108 0 0 63 64 90 0.797 3.66 2.03 Intr + 51236 51352 117 1 0 102 9 143 0.073 7.54 2.04 Intr + 66589 66762 174 0 0 42 66 121 0.068 4.51 2.05 Term + 67408 67644 237 0 0 19 48 235 0.071 7.78 2.06 PlyA + 69412 69417 6 1.05 3.00 Prom + 77016 77055 40 -5.35 3.01 Init + 77820 77880 61 2 1 63 73 52 0.264 2.67 3.02 Intr + 82217 82393 177 0 0 57 17 130 0.547 1.77 3.03 Intr + 82499 82854 356 0 2 120 78 206 0.524 16.88 3.04 Term + 85173 85616 444 2 0 102 33 224 0.998 12.55 3.05 PlyA + 89102 89107 6 1.05 4.06 PlyA - 91058 91053 6 1.05 4.05 Term - 100367 100207 161 0 2 69 49 121 0.084 3.42 4.04 Intr - 106634 106539 96 0 0 12 70 133 0.592 2.96 4.03 Intr - 106958 106847 112 2 1 53 106 150 0.977 12.33 4.02 Intr - 108211 108101 111 1 0 38 76 85 0.599 1.96 4.01 Init - 109079 109068 12 2 0 53 60 -6 0.162 -6.44 4.00 Prom - 110238 110199 40 -7.05 5.00 Prom + 110986 111025 40 -3.95 5.01 Init + 114380 114390 11 1 2 60 91 16 0.317 -2.33 5.02 Intr + 119229 119754 526 0 1 68 44 254 0.457 11.02 5.03 Intr + 120345 120461 117 2 0 69 54 133 0.200 7.74 5.04 Term + 142075 142317 243 0 0 76 38 175 0.410 6.22 5.05 PlyA + 145062 145067 6 1.05 6.00 Prom + 145676 145715 40 -2.95 6.01 Init + 147565 148029 465 0 0 93 100 425 0.599 39.89 6.02 Intr + 150685 150833 149 0 2 82 61 166 0.961 11.41 6.03 Intr + 154463 154605 143 2 2 82 60 82 0.950 3.88 6.04 Intr + 156168 156322 155 1 2 76 61 72 0.934 2.17 6.05 Intr + 159745 159880 136 0 1 63 91 73 0.948 4.32 6.06 Intr + 162848 163054 207 0 0 -19 105 117 0.593 0.83 6.07 Intr + 163203 163310 108 1 0 61 119 143 0.999 14.04 6.08 Term + 163529 163683 155 0 2 74 42 228 0.998 13.90 6.09 PlyA + 166165 166170 6 1.05 7.11 PlyA - 166281 166276 6 1.05 7.10 Term - 179983 179867 117 1 0 32 41 100 0.740 -2.74 7.09 Intr - 180483 180295 189 0 0 86 47 155 0.952 10.06 7.08 Intr - 181908 181712 197 1 2 103 57 302 0.992 26.81 7.07 Intr - 183477 183325 153 1 0 108 82 44 0.951 4.92 7.06 Intr - 186870 186730 141 1 0 99 111 144 0.999 17.20 7.05 Intr - 191667 191585 83 2 2 99 68 51 0.469 2.56 7.04 Intr - 199663 199440 224 1 2 112 91 262 0.814 24.90 7.03 Intr - 200849 200782 68 0 2 89 115 84 0.902 8.91 7.02 Intr - 202685 202579 107 1 2 94 64 80 0.483 5.14 7.01 Init - 204653 204532 122 2 2 55 43 95 0.310 1.41 7.00 Prom - 206584 206545 40 -6.15 8.00 Prom + 206599 206638 40 -7.75 8.01 Init + 209420 209530 111 1 0 39 73 114 0.251 5.39 8.02 Term + 213463 213669 207 0 0 40 48 181 0.189 5.66 8.03 PlyA + 213897 213902 6 1.05 9.03 PlyA - 214086 214081 6 1.05 9.02 Term - 215549 215380 170 0 2 78 39 148 0.977 5.96 9.01 Init - 217723 217528 196 1 1 49 89 69 0.657 2.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 67751 67661 91 2 1 78 101 69 0.815 7.90 S.002 Sngl - 100381 99998 384 1 0 51 47 205 0.808 8.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_1|101_aa XVSVNAMTHGDQLELTQQKIWKSRATHLRESTKMTSFLMPPASRGTQRSCSKSRKRQMRR RNPSSFVASCATLLPFTSVPASAAGPFRASFPSATNSAFGV >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_1|306_bp nnggtgtcagtcaatgccatgacccatggtgaccagcttgagctcacccagcagaaaata tggaaaagcagagcaactcatttaagagaaagcaccaagatgaccagctttctgatgcca cctgcaagtagagggactcagagatcatgcagcaaaagcagaaaaaggcaaatgagaagg agaaacccaagtagctttgtggcttcatgcgcaaccctcctgccattcaccagtgtccct gcatctgcagcaggtccctttcgtgcttccttcccctcagccaccaactcagcttttggt gtgtga >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_2|212_aa MAFVSLCEKHGPEVPSSSRCPCSSNEDVKPWRWKSMMVLAEQHAEAYVLPTTITNLDVVV EAMSLMRSLKMSGKNKTHRNPWRRGKAQAWRAAARASPTNAAPCSRAPSPIDRPRAEECE CMAQDWQAAPPTAPVCSFTPEASETTSPPGGTNNSRRAALRAVTLTAKVCSFTPEPARLR THQKEETPNTSEHQKEQTPDATFGVNKLQTPP >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_2|639_bp atggcttttgtgtccctgtgtgagaaacatggaccagaagtgccatcaagttctcgctgt ccctgttcttcaaatgaggacgtgaagccatggagatggaaaagtatgatggtgcttgca gaacagcacgcagaagcttatgtactgcccacaacaatcaccaacttggacgtggtagtt gaagcaatgagtcttatgaggtccctcaagatgagcgggaagaacaaaactcacaggaac ccatggaggcgggggaaggctcaggcatggcgggctgcagcccgagcctccccgacaaat gccgccccctgctccagggcgcccagtcccatcgaccgcccaagggctgaggagtgcgag tgcatggcgcaggactggcaggcagctccacctacagccccggtctgcagcttcactcct gaagccagtgaaaccacgagcccaccaggaggaacgaacaactccagacgcgccgcctta agagctgtaacactcactgcgaaggtctgtagcttcactcctgaaccagcgagactacga acccaccagaaggaagaaactccgaacacatccgaacatcagaaggaacaaactccagac gcgacctttggagttaacaaactccaaactccaccttaa >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_3|345_aa MRQLGRMRCENLLNARGGGCSSPSSIHRRHPIFTGFIHLLLLSSLLSDKLPSAMTVVSVP QREPLVLGGRLAPLGFSSRGGDGPCLTPQPRAPAALPNRSLAVAGGTPRAAPKKRRKKKV RASPAGQLPSRFHQYQQHRPSLEGGRSPATGPSGAQEVPGPAAALAPSPAAAAGTEGASP DLAPLRPAAPGQTPLRKEVLKSKMGKSEKIALPHGQLVHGIHLYEQPKINRQKSKYNLPL TKITSAKRNENNFWQDSVSSDRIQKQEKKPFKNTENIKNSHLKKSAFLTEVSQKENYAGA KFSDPPSPSVLPKPPSHWMGSTVENSNQNRELMAVHLKTLLKVQT >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_3|1038_bp atgcgccagctagggaggatgaggtgtgagaatctcctgaacgccagaggtggaggctgc agctctcctagcagcatccatcgccgccaccctatcttcactggcttcattcaccttctc cttctctcttcgttgctgagcgacaagcttcctagcgctatgactgtcgtctccgtcccg cagcgggagccgctcgtcctgggtggccgccttgcgccgcttggcttttcctcccgaggg ggagatggcccgtgtctgaccccccagcctcgcgctccagcagctctgcccaaccgcagc ctcgccgtggcgggaggcactcctcgggcagcgccgaagaagcggcgaaagaagaaggtg cgggccagccccgcagggcagctgcccagccgcttccaccagtaccagcagcaccggccg agtctggagggcggccggagccccgcgaccggcccgagcggagcgcaggaggtcccgggc ccggccgccgccttggccccgagtcctgcagccgcagccggcacggagggagccagcccc gaccttgccccgctgcggcccgcggctcccggccaaacccccctcaggaaagaggtttta aaatcaaagatgggaaaatcggagaaaattgcccttccccatggccagcttgttcatggt atacacttgtatgagcaaccaaagataaacagacagaaaagcaaatataacttgccacta accaagatcacctctgcaaaaagaaatgaaaacaacttttggcaggattctgtttcatct gacagaattcagaagcaggaaaaaaagccttttaaaaataccgagaacattaaaaattcg catttgaagaaatcagcatttctaactgaagtgagccaaaaggaaaattatgctggggca aagtttagtgatccaccttctcctagtgttcttccaaagcctcctagtcactggatggga agcactgttgaaaattccaaccaaaacagggagctgatggcagtacacttaaaaacgctc ctcaaagttcaaacttag >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_4|163_aa MLTQSSRVKAKSDFVGSASLIKSILQALKSKGQGSRSSDIELVIFEDVRDAEDALYNLNR KWVCGRQIEIQFAQGDRKMITGDQEAPAKEELEVEVLHGEEIGGGQTALKSLDTGDFLIA SLNLVPNHYQGGLPQQGSQELQEGILALEDGQGPSPYKRGPSQ >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_4|492_bp atgctcactcagagttccagagtcaaagccaagtctgattttgttggttctgcgtctctt ataaagtccatcttgcaagccttaaagagtaaaggtcaaggttcaagatcaagtgacatt gagctagtcatatttgaagatgttcgagatgctgaagatgctctttataacctcaataga aagtgggtatgtggccgtcagattgaaatacagtttgcacaaggtgatcgcaaaatgatc acaggagatcaagaagccccagccaaagaagaactcgaagtagaagttcttcatggggaa gaaataggaggcggtcagacagccttaaagagtctcgacacaggcgattttcttatagcc agtctaaatctcgttccaaatcattaccaaggcggtctacctcagcaaggcagtcaagaa ctccaagaaggaattttggctctagaggacggtcaaggtccaagtccttacaaaagaggt ccaagtcaatag >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_5|298_aa MLYSRGRSPGGVRDVPDEQGGVGGPRVARHDRFLRSLRVCPRPLDSLRLPLPLLPPQELR RPPARPPPLGLSPASAQPQRPAQRGRSAAPAWTHGPGAGAAPARPRGRGNSEDRSAAVPR PYGIRVWGVLECDGIWGGGGEGQCMSGGRRPRGGLRPRRGRRGLEDPEVVAGLRLGGGGL DSSSPCSCGRQAQPVIQGWECNQATPKCEECPFSLAEVCCFVHTVGNQGKGKAYPLRTSL SSFPGIVMENRAANLTAKRREAKGSLRSSGTRTFRALKSLPSLCGPFEEIRTELENEL >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_5|897_bp atgctgtacagccgcggccgttcacctggtggcgtccgcgacgttcctgatgaacaggga ggtgttggggggcctcgtgtagcgagacatgaccgcttcctccgttccctccgggtctgc ccgcggccgctggactcgctccgtctcccgctaccgctgctaccaccacaggagctccgc cggcccccggcgcgacccccacccctcggcctcagccccgccagcgcgcagccgcagagg ccggcgcagagggggcgcagcgcggcgccagcctggacgcacgggccgggggcgggggcg gcgccggcgcggcccaggggccgcgggaattccgaagacaggagcgcggccgttcccagg ccctatgggatccgggtatggggggtcctggagtgcgacgggatttggggagggggcggc gagggccagtgtatgtcaggagggcggaggccaagagggggcttgaggccgcggcgggga cgccgggggttagaggacccagaggtggtggcggggctgcggctgggcggaggtgggtta gactcgtcttcgccttgctcctgtggccggcaggcacagcctgttattcagggctgggag tgtaaccaggcaactcctaagtgtgaagagtgccccttttcgctcgcagaagtgtgttgt tttgtacacactgtggggaatcagggcaaaggaaaagcctatcctctcagaacctccctg tcctcatttccaggaatagtgatggagaacagagcagctaatctcactgccaagagaagg gaagccaaagggagcctcaggtcatcaggtaccaggacattcagggctttaaagtcactg ccttcactttgtgggccatttgaagaaatccgcacagaacttgaaaacgagttgtaa >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_6|505_aa MRPGGERPVEGGACNGRSELELLKLRSAECIDEAAERLGALSRAIWSQPELAYEEHHAHR VLTHFFEREPPAASWAVQPHYQLPTAFRAEWEPPEARAPSATPRPLHLGFLCEYDALPGI GHACGHNLIAEVGAAAALGVRGALEGLPRPPPPVKVVVLGTPAEEDGGGKIDLIEAGAFT NLDVVFMAHPSQENAAYLPDMAEHDVTVKYYGKASHSASYPWEGLNALDAAVLAYNNLSV FRQQMKPTWRVHGIIKNGGVKPNIIPSYSELIYYFRAPSMKELQVLTKKAEDCFRAAALA SGCTVEIKGGAHDYYNVLPNKSLWKAYMENGRKLGIEFISEDTMLNGPSERQKKGPSERQ KGSHASAVSQKRRKGVGGSVECWPRGPVKAVRQGEGRFAVTFMRDVTDTGEWVADSVEGS TDFGNVSFVVPGIHPYFHIGSNALNHTEQYTEAAGSQEAQFYTLRTAKALAMTALDVIFK PELLEGIREDFKLKLQEEQFVNAVE >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_6|1518_bp atgaggcccggaggggagcggcccgtggaagggggcgcgtgcaatggccgctccgagctg gagctactgaagctgcgctcggcggagtgcatcgacgaggcggccgagcggctgggggcc ctgagccgcgcgatctggagccagcccgagctggcctacgaggagcaccatgcccaccgc gtgctgacgcacttcttcgagcgggagccgcccgcggcctcctgggcagtgcagccgcac taccagctgcccacggccttccgcgccgagtgggagccgccggaggcccgggcaccgagc gccacgccacgcccgctgcacctgggcttcctctgcgagtacgacgcgctgcccggcatc ggccacgcctgcggccacaacctcatcgctgaggtcggggcggcggccgcgctgggcgtg aggggggccttagagggcctccccaggccgcctccgcccgtgaaggtagttgtcctggga acccctgcagaagaagatggtggtggcaaaattgatttaattgaagcaggggcttttaca aatcttgatgttgtttttatggcccacccatcacaagagaatgctgcttatctaccagat atggctgaacatgatgtgactgtgaaatactatggaaaagcatctcattctgcttcttat ccctgggaaggattaaatgcattagatgctgctgtgctggcctataacaatctgtctgtg ttcagacagcaaatgaaaccaacctggagagttcatggtataataaaaaatggtggtgta aaacccaatatcattccctcttattctgaattaatctattacttccgtgcaccctcaatg aaagaacttcaagttttgaccaaaaaggcagaagattgcttcagagctgcagctttggct tcagggtgcacagtggaaattaaaggtggagcacatgattattacaatgttcttcccaat aagagcctatggaaagcctatatggaaaatggaagaaagctaggaatagagttcatttca gaagatacaatgttgaatggcccttcagagagacagaagaaggggccatcagagaggcag aagggaagccatgcaagtgctgtgtcacagaagagaagaaagggagtaggtggcagtgtg gagtgctggccaagagggccagtaaaagccgtgagacaaggtgagggcaggtttgcagtg accttcatgagagatgtgactgatacaggggagtgggtggcagactcagtggaaggatct acggattttggaaatgttagttttgtggttcctggaattcatccatattttcacattgga tctaatgccttgaatcatactgaacagtacactgaagctgctgggtcacaggaagctcag ttctacactctgcggacggccaaagctctggcaatgacggcactggatgttatttttaaa ccagagttactggaaggaatcagagaggactttaaactgaaacttcaagaagaacagttt gtaaatgcagtagaataa >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_7|466_aa MSAAMGAGVPPLDKPGFLPYICKQLTGSNSFPGPPPLLPARPILRRSPDITKSPLTKSEQ LLRIDDHDFSMRPGFGGPAIPVGVDVQVESLDSISEVDMDFTMTLYLRHYWKDERLSFPS TNNLSMTFDGRLVKKIWVPDMFFVHSKRSFIHDTTTDNVMLRVQPDGKVLYSLRVTVTAM CNMDFSRFPLDTQTCSLEIESYAYTEDDLMLYWKKGNDSLKTDERISLSQFLIQEFHTTT KLAFYSSTGWYNRLYINFTLRRHIFFFLLQTYFPATLMVMLSWVSFWIDRRAVPARVPLG ITTVLTMSTIITGVNASMPRVSYIKAVDIYLWVSFVFVFLSVLEYAAVNYLTTVQERKEQ KLREKLPCTSGLPPPRTAMLDGNYSDGEVNDLDNYMPENGEKPDRMMVQLTLASERSSPQ RKSQRSSYHYIRCCRKYDTVATDVSCYPDPLEKHTTSVVGTFSSTR >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_7|1401_bp atgtccgcggcaatgggagctggtgttcctcctcttgacaagcccggcttcctcccttat atctgtaagcagctcactggttccaattctttccctgggccacctcctttgctgccagca cgcccaattctgagacgaagtcctgacatcaccaaatcgcctctgacaaagtcagaacag cttctgaggatagatgaccatgatttcagcatgaggcctggctttggaggccctgccatt cctgttggtgtggatgtgcaggtggagagtttggatagcatctcagaggttgacatggac tttacgatgaccctctacctgaggcactactggaaggacgagaggctgtcttttccaagc accaacaacctcagcatgacgtttgacggccggctggtcaagaagatctgggtccctgac atgtttttcgtgcactccaaacgctccttcatccacgacaccaccacagacaacgtcatg ttgcgggtccagcctgatgggaaagtgctctatagtctcagggttacagtaactgcaatg tgcaacatggacttcagccgatttcccttggacacacaaacgtgctctcttgaaattgaa agctatgcctatacagaagatgacctcatgctgtactggaaaaagggcaatgactcctta aagacagatgaacggatctcactctcccagttcctcattcaggaattccacaccaccacc aaactggctttctacagcagcacaggctggtacaaccgtctctacattaatttcacgttg cgtcgccacatcttcttcttcttgctccaaacttatttccccgctaccctgatggtcatg ctgtcctgggtgtccttctggatcgaccgcagagccgtgcctgccagagtccccttaggt atcacaacggtgctgaccatgtccaccatcatcacgggcgtgaatgcctccatgccgcgc gtctcctacatcaaggccgtggacatctacctctgggtcagctttgtgttcgtgttcctc tcggtgctggagtatgcggccgtcaactacctgaccactgtgcaggagaggaaggaacag aagctgcgggagaagcttccctgcaccagcggattacctccgccccgcactgcgatgctg gacggcaactacagtgatggggaggtgaatgacctggacaactacatgccagagaatgga gagaagcccgacaggatgatggtgcagctgaccctggcctcagagaggagctccccacag aggaaaagtcagagaagcagctatcattatattaggtgctgcaggaaatacgacactgta gcgactgatgttagttgttacccagatcccctggaaaagcacactaccagtgttgtgggc acatttagttccacccgttag >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_8|105_aa MRWLRNLSSPFQKAAASLRCERRFAALVLLAEGDATRLSLGILQAHLGPYVHTKLITFTP KPNPPPALLPGITIYSVVQSKDLPNIHPKLLSHPEANQILTILAP >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_8|318_bp atgagatggcttcgaaacctttcaagtccattccagaaggcggcagcctccctgagatgt gagaggcgcttcgctgctctagtcctgttggctgaaggcgatgctacaaggctgtctctg ggtatcttgcaagcacatctaggtccatatgttcataccaaactcatcaccttcaccccc aaacctaatcctccacctgctctccttcctggcatcaccatctactcagtcgtacaatcc aaagacctgccgaacattcaccctaaacttctctctcaccctgaagccaatcagattctg accattctagctccttga >gi568815592r:88998581_89217887|GENSCAN_predicted_peptide_9|121_aa MKRYLAFGSLGLGGGLCLAPQCPPSGAGGSPHHMGSQTASCSSQHLFQAVLNNITKLGAE ISSGHVPEVLAKAVRQEKERKDIEIGKKEVKLLLFADDMILYIENPKDFTKKTVRTNEQI Q >gi568815592r:88998581_89217887|GENSCAN_predicted_CDS_9|366_bp atgaaaagatatctagcttttgggagtttggggttaggaggagggctgtgtctagcacct cagtgtcctccttctggggctggagggagccctcatcacatgggctcccaaacagccagc tgtagctcacagcatttgtttcaagctgttcttaacaacatcacaaaactaggagctgaa atatccagtggtcatgtaccagaagtccttgccaaagcagttagacaagagaaagaaaga aaagacatagaaataggaaagaaagaagtgaaattgttgctgtttgctgatgacatgatc ttatatatagaaaaccctaaagacttcaccaagaaaactgttagaactaatgaacaaata caataa