GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:39:36 Sequence gi568815592f:88980895_89184193 : 203299 bp : 42.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1420 1584 165 0 0 106 57 54 0.265 3.34 1.02 Intr + 64724 64831 108 1 0 63 64 90 0.797 3.66 1.03 Intr + 68922 69038 117 2 0 102 9 143 0.073 7.54 1.04 Intr + 84275 84448 174 1 0 42 66 121 0.068 4.51 1.05 Term + 85094 85330 237 1 0 19 48 235 0.071 7.78 1.06 PlyA + 87098 87103 6 1.05 2.00 Prom + 94702 94741 40 -5.35 2.01 Init + 95506 95566 61 0 1 63 73 52 0.264 2.67 2.02 Intr + 99903 100079 177 1 0 57 17 130 0.547 1.77 2.03 Intr + 100185 100540 356 1 2 120 78 206 0.524 16.88 2.04 Term + 102859 103302 444 0 0 102 33 224 0.998 12.55 2.05 PlyA + 106788 106793 6 1.05 3.06 PlyA - 108744 108739 6 1.05 3.05 Term - 118053 117893 161 1 2 69 49 121 0.084 3.42 3.04 Intr - 124320 124225 96 1 0 12 70 133 0.592 2.96 3.03 Intr - 124644 124533 112 0 1 53 106 150 0.977 12.33 3.02 Intr - 125897 125787 111 2 0 38 76 85 0.599 1.96 3.01 Init - 126765 126754 12 0 0 53 60 -6 0.162 -6.44 3.00 Prom - 127924 127885 40 -7.05 4.00 Prom + 128672 128711 40 -3.95 4.01 Init + 132066 132076 11 2 2 60 91 16 0.317 -2.33 4.02 Intr + 136915 137440 526 1 1 68 44 254 0.457 11.02 4.03 Intr + 138031 138147 117 0 0 69 54 133 0.200 7.74 4.04 Term + 159761 160003 243 1 0 76 38 175 0.410 6.22 4.05 PlyA + 162748 162753 6 1.05 5.00 Prom + 163362 163401 40 -2.95 5.01 Init + 165251 165715 465 1 0 93 100 425 0.599 39.89 5.02 Intr + 168371 168519 149 1 2 82 61 166 0.961 11.41 5.03 Intr + 172149 172291 143 0 2 82 60 82 0.950 3.88 5.04 Intr + 173854 174008 155 2 2 76 61 72 0.934 2.17 5.05 Intr + 177431 177566 136 1 1 63 91 73 0.948 4.32 5.06 Intr + 180534 180740 207 1 0 -19 105 117 0.593 0.83 5.07 Intr + 180889 180996 108 2 0 61 119 143 0.999 14.04 5.08 Term + 181215 181369 155 1 2 74 42 228 0.998 13.90 5.09 PlyA + 183851 183856 6 1.05 6.05 PlyA - 183967 183962 6 1.05 6.04 Term - 197669 197553 117 2 0 32 41 100 0.740 -2.74 6.03 Intr - 198169 197981 189 1 0 86 47 155 0.952 10.06 6.02 Intr - 199594 199398 197 2 2 103 57 302 0.991 26.81 6.01 Intr - 201163 201011 153 2 0 108 82 44 0.748 4.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 85437 85347 91 0 1 78 101 69 0.815 7.90 S.002 Sngl - 118067 117684 384 2 0 51 47 205 0.808 8.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:88980895_89184193|GENSCAN_predicted_peptide_1|266_aa VQKLSSASQPTTLWEKASFVGLETLLLKPSLSRYTYNHSSPRLCKGAGPGIPNSRAFVSL CEKHGPEVPSSSRCPCSSNEDVKPWRWKSMMVLAEQHAEAYVLPTTITNLDVVVEAMSLM RSLKMSGKNKTHRNPWRRGKAQAWRAAARASPTNAAPCSRAPSPIDRPRAEECECMAQDW QAAPPTAPVCSFTPEASETTSPPGGTNNSRRAALRAVTLTAKVCSFTPEPARLRTHQKEE TPNTSEHQKEQTPDATFGVNKLQTPP >gi568815592f:88980895_89184193|GENSCAN_predicted_CDS_1|801_bp gtccagaagctctcttctgcttctcaaccaaccacactgtgggaaaaggcaagttttgta ggactggaaaccctgttgctcaagccttctctttctagatacacctataatcattcttct ccaaggctctgcaaaggagcggggcctggaatcccaaactctagggcttttgtgtccctg tgtgagaaacatggaccagaagtgccatcaagttctcgctgtccctgttcttcaaatgag gacgtgaagccatggagatggaaaagtatgatggtgcttgcagaacagcacgcagaagct tatgtactgcccacaacaatcaccaacttggacgtggtagttgaagcaatgagtcttatg aggtccctcaagatgagcgggaagaacaaaactcacaggaacccatggaggcgggggaag gctcaggcatggcgggctgcagcccgagcctccccgacaaatgccgccccctgctccagg gcgcccagtcccatcgaccgcccaagggctgaggagtgcgagtgcatggcgcaggactgg caggcagctccacctacagccccggtctgcagcttcactcctgaagccagtgaaaccacg agcccaccaggaggaacgaacaactccagacgcgccgccttaagagctgtaacactcact gcgaaggtctgtagcttcactcctgaaccagcgagactacgaacccaccagaaggaagaa actccgaacacatccgaacatcagaaggaacaaactccagacgcgacctttggagttaac aaactccaaactccaccttaa >gi568815592f:88980895_89184193|GENSCAN_predicted_peptide_2|345_aa MRQLGRMRCENLLNARGGGCSSPSSIHRRHPIFTGFIHLLLLSSLLSDKLPSAMTVVSVP QREPLVLGGRLAPLGFSSRGGDGPCLTPQPRAPAALPNRSLAVAGGTPRAAPKKRRKKKV RASPAGQLPSRFHQYQQHRPSLEGGRSPATGPSGAQEVPGPAAALAPSPAAAAGTEGASP DLAPLRPAAPGQTPLRKEVLKSKMGKSEKIALPHGQLVHGIHLYEQPKINRQKSKYNLPL TKITSAKRNENNFWQDSVSSDRIQKQEKKPFKNTENIKNSHLKKSAFLTEVSQKENYAGA KFSDPPSPSVLPKPPSHWMGSTVENSNQNRELMAVHLKTLLKVQT >gi568815592f:88980895_89184193|GENSCAN_predicted_CDS_2|1038_bp atgcgccagctagggaggatgaggtgtgagaatctcctgaacgccagaggtggaggctgc agctctcctagcagcatccatcgccgccaccctatcttcactggcttcattcaccttctc cttctctcttcgttgctgagcgacaagcttcctagcgctatgactgtcgtctccgtcccg cagcgggagccgctcgtcctgggtggccgccttgcgccgcttggcttttcctcccgaggg ggagatggcccgtgtctgaccccccagcctcgcgctccagcagctctgcccaaccgcagc ctcgccgtggcgggaggcactcctcgggcagcgccgaagaagcggcgaaagaagaaggtg cgggccagccccgcagggcagctgcccagccgcttccaccagtaccagcagcaccggccg agtctggagggcggccggagccccgcgaccggcccgagcggagcgcaggaggtcccgggc ccggccgccgccttggccccgagtcctgcagccgcagccggcacggagggagccagcccc gaccttgccccgctgcggcccgcggctcccggccaaacccccctcaggaaagaggtttta aaatcaaagatgggaaaatcggagaaaattgcccttccccatggccagcttgttcatggt atacacttgtatgagcaaccaaagataaacagacagaaaagcaaatataacttgccacta accaagatcacctctgcaaaaagaaatgaaaacaacttttggcaggattctgtttcatct gacagaattcagaagcaggaaaaaaagccttttaaaaataccgagaacattaaaaattcg catttgaagaaatcagcatttctaactgaagtgagccaaaaggaaaattatgctggggca aagtttagtgatccaccttctcctagtgttcttccaaagcctcctagtcactggatggga agcactgttgaaaattccaaccaaaacagggagctgatggcagtacacttaaaaacgctc ctcaaagttcaaacttag >gi568815592f:88980895_89184193|GENSCAN_predicted_peptide_3|163_aa MLTQSSRVKAKSDFVGSASLIKSILQALKSKGQGSRSSDIELVIFEDVRDAEDALYNLNR KWVCGRQIEIQFAQGDRKMITGDQEAPAKEELEVEVLHGEEIGGGQTALKSLDTGDFLIA SLNLVPNHYQGGLPQQGSQELQEGILALEDGQGPSPYKRGPSQ >gi568815592f:88980895_89184193|GENSCAN_predicted_CDS_3|492_bp atgctcactcagagttccagagtcaaagccaagtctgattttgttggttctgcgtctctt ataaagtccatcttgcaagccttaaagagtaaaggtcaaggttcaagatcaagtgacatt gagctagtcatatttgaagatgttcgagatgctgaagatgctctttataacctcaataga aagtgggtatgtggccgtcagattgaaatacagtttgcacaaggtgatcgcaaaatgatc acaggagatcaagaagccccagccaaagaagaactcgaagtagaagttcttcatggggaa gaaataggaggcggtcagacagccttaaagagtctcgacacaggcgattttcttatagcc agtctaaatctcgttccaaatcattaccaaggcggtctacctcagcaaggcagtcaagaa ctccaagaaggaattttggctctagaggacggtcaaggtccaagtccttacaaaagaggt ccaagtcaatag >gi568815592f:88980895_89184193|GENSCAN_predicted_peptide_4|298_aa MLYSRGRSPGGVRDVPDEQGGVGGPRVARHDRFLRSLRVCPRPLDSLRLPLPLLPPQELR RPPARPPPLGLSPASAQPQRPAQRGRSAAPAWTHGPGAGAAPARPRGRGNSEDRSAAVPR PYGIRVWGVLECDGIWGGGGEGQCMSGGRRPRGGLRPRRGRRGLEDPEVVAGLRLGGGGL DSSSPCSCGRQAQPVIQGWECNQATPKCEECPFSLAEVCCFVHTVGNQGKGKAYPLRTSL SSFPGIVMENRAANLTAKRREAKGSLRSSGTRTFRALKSLPSLCGPFEEIRTELENEL >gi568815592f:88980895_89184193|GENSCAN_predicted_CDS_4|897_bp atgctgtacagccgcggccgttcacctggtggcgtccgcgacgttcctgatgaacaggga ggtgttggggggcctcgtgtagcgagacatgaccgcttcctccgttccctccgggtctgc ccgcggccgctggactcgctccgtctcccgctaccgctgctaccaccacaggagctccgc cggcccccggcgcgacccccacccctcggcctcagccccgccagcgcgcagccgcagagg ccggcgcagagggggcgcagcgcggcgccagcctggacgcacgggccgggggcgggggcg gcgccggcgcggcccaggggccgcgggaattccgaagacaggagcgcggccgttcccagg ccctatgggatccgggtatggggggtcctggagtgcgacgggatttggggagggggcggc gagggccagtgtatgtcaggagggcggaggccaagagggggcttgaggccgcggcgggga cgccgggggttagaggacccagaggtggtggcggggctgcggctgggcggaggtgggtta gactcgtcttcgccttgctcctgtggccggcaggcacagcctgttattcagggctgggag tgtaaccaggcaactcctaagtgtgaagagtgccccttttcgctcgcagaagtgtgttgt tttgtacacactgtggggaatcagggcaaaggaaaagcctatcctctcagaacctccctg tcctcatttccaggaatagtgatggagaacagagcagctaatctcactgccaagagaagg gaagccaaagggagcctcaggtcatcaggtaccaggacattcagggctttaaagtcactg ccttcactttgtgggccatttgaagaaatccgcacagaacttgaaaacgagttgtaa >gi568815592f:88980895_89184193|GENSCAN_predicted_peptide_5|505_aa MRPGGERPVEGGACNGRSELELLKLRSAECIDEAAERLGALSRAIWSQPELAYEEHHAHR VLTHFFEREPPAASWAVQPHYQLPTAFRAEWEPPEARAPSATPRPLHLGFLCEYDALPGI GHACGHNLIAEVGAAAALGVRGALEGLPRPPPPVKVVVLGTPAEEDGGGKIDLIEAGAFT NLDVVFMAHPSQENAAYLPDMAEHDVTVKYYGKASHSASYPWEGLNALDAAVLAYNNLSV FRQQMKPTWRVHGIIKNGGVKPNIIPSYSELIYYFRAPSMKELQVLTKKAEDCFRAAALA SGCTVEIKGGAHDYYNVLPNKSLWKAYMENGRKLGIEFISEDTMLNGPSERQKKGPSERQ KGSHASAVSQKRRKGVGGSVECWPRGPVKAVRQGEGRFAVTFMRDVTDTGEWVADSVEGS TDFGNVSFVVPGIHPYFHIGSNALNHTEQYTEAAGSQEAQFYTLRTAKALAMTALDVIFK PELLEGIREDFKLKLQEEQFVNAVE >gi568815592f:88980895_89184193|GENSCAN_predicted_CDS_5|1518_bp atgaggcccggaggggagcggcccgtggaagggggcgcgtgcaatggccgctccgagctg gagctactgaagctgcgctcggcggagtgcatcgacgaggcggccgagcggctgggggcc ctgagccgcgcgatctggagccagcccgagctggcctacgaggagcaccatgcccaccgc gtgctgacgcacttcttcgagcgggagccgcccgcggcctcctgggcagtgcagccgcac taccagctgcccacggccttccgcgccgagtgggagccgccggaggcccgggcaccgagc gccacgccacgcccgctgcacctgggcttcctctgcgagtacgacgcgctgcccggcatc ggccacgcctgcggccacaacctcatcgctgaggtcggggcggcggccgcgctgggcgtg aggggggccttagagggcctccccaggccgcctccgcccgtgaaggtagttgtcctggga acccctgcagaagaagatggtggtggcaaaattgatttaattgaagcaggggcttttaca aatcttgatgttgtttttatggcccacccatcacaagagaatgctgcttatctaccagat atggctgaacatgatgtgactgtgaaatactatggaaaagcatctcattctgcttcttat ccctgggaaggattaaatgcattagatgctgctgtgctggcctataacaatctgtctgtg ttcagacagcaaatgaaaccaacctggagagttcatggtataataaaaaatggtggtgta aaacccaatatcattccctcttattctgaattaatctattacttccgtgcaccctcaatg aaagaacttcaagttttgaccaaaaaggcagaagattgcttcagagctgcagctttggct tcagggtgcacagtggaaattaaaggtggagcacatgattattacaatgttcttcccaat aagagcctatggaaagcctatatggaaaatggaagaaagctaggaatagagttcatttca gaagatacaatgttgaatggcccttcagagagacagaagaaggggccatcagagaggcag aagggaagccatgcaagtgctgtgtcacagaagagaagaaagggagtaggtggcagtgtg gagtgctggccaagagggccagtaaaagccgtgagacaaggtgagggcaggtttgcagtg accttcatgagagatgtgactgatacaggggagtgggtggcagactcagtggaaggatct acggattttggaaatgttagttttgtggttcctggaattcatccatattttcacattgga tctaatgccttgaatcatactgaacagtacactgaagctgctgggtcacaggaagctcag ttctacactctgcggacggccaaagctctggcaatgacggcactggatgttatttttaaa ccagagttactggaaggaatcagagaggactttaaactgaaacttcaagaagaacagttt gtaaatgcagtagaataa >gi568815592f:88980895_89184193|GENSCAN_predicted_peptide_6|218_aa XWYNRLYINFTLRRHIFFFLLQTYFPATLMVMLSWVSFWIDRRAVPARVPLGITTVLTMS TIITGVNASMPRVSYIKAVDIYLWVSFVFVFLSVLEYAAVNYLTTVQERKEQKLREKLPC TSGLPPPRTAMLDGNYSDGEVNDLDNYMPENGEKPDRMMVQLTLASERSSPQRKSQRSSY HYIRCCRKYDTVATDVSCYPDPLEKHTTSVVGTFSSTR >gi568815592f:88980895_89184193|GENSCAN_predicted_CDS_6|657_bp ngctggtacaaccgtctctacattaatttcacgttgcgtcgccacatcttcttcttcttg ctccaaacttatttccccgctaccctgatggtcatgctgtcctgggtgtccttctggatc gaccgcagagccgtgcctgccagagtccccttaggtatcacaacggtgctgaccatgtcc accatcatcacgggcgtgaatgcctccatgccgcgcgtctcctacatcaaggccgtggac atctacctctgggtcagctttgtgttcgtgttcctctcggtgctggagtatgcggccgtc aactacctgaccactgtgcaggagaggaaggaacagaagctgcgggagaagcttccctgc accagcggattacctccgccccgcactgcgatgctggacggcaactacagtgatggggag gtgaatgacctggacaactacatgccagagaatggagagaagcccgacaggatgatggtg cagctgaccctggcctcagagaggagctccccacagaggaaaagtcagagaagcagctat cattatattaggtgctgcaggaaatacgacactgtagcgactgatgttagttgttaccca gatcccctggaaaagcacactaccagtgttgtgggcacatttagttccacccgttag