GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:16:36 Sequence gi568815586r:90951349_91156281 : 204933 bp : 35.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 579 574 6 1.05 1.01 Sngl - 3394 2174 1221 1 0 83 42 1193 0.975 109.75 1.00 Prom - 3865 3826 40 -4.75 2.14 PlyA - 4327 4322 6 1.05 2.13 Term - 12978 12808 171 0 0 90 28 118 0.776 2.84 2.12 Intr - 18791 18696 96 2 0 52 91 58 0.762 1.79 2.11 Intr - 20654 20452 203 0 2 96 106 165 0.999 17.18 2.10 Intr - 21632 21474 159 0 0 106 16 88 0.842 2.44 2.09 Intr - 26462 26367 96 0 0 24 45 128 0.461 1.16 2.08 Intr - 26914 26740 175 1 1 100 77 187 0.580 17.39 2.07 Intr - 28011 27839 173 1 2 21 -2 153 0.323 -1.66 2.06 Intr - 32200 32043 158 0 2 68 49 71 0.366 -0.07 2.05 Intr - 32968 32864 105 0 0 8 77 144 0.494 3.71 2.04 Intr - 33858 33759 100 1 1 75 72 117 0.499 7.05 2.03 Intr - 36985 36861 125 0 2 87 27 85 0.117 1.61 2.02 Intr - 47073 46994 80 0 2 58 100 1 0.160 -4.27 2.01 Init - 51217 51053 165 1 0 65 74 207 0.902 16.68 2.00 Prom - 59440 59401 40 -3.65 3.03 PlyA - 59736 59731 6 1.05 3.02 Term - 100170 99998 173 1 2 93 41 136 0.751 6.41 3.01 Init - 104933 104048 886 2 1 65 106 430 0.492 37.12 3.00 Prom - 107812 107773 40 -3.15 4.00 Prom + 110102 110141 40 -4.95 4.01 Init + 126109 126127 19 0 1 77 110 11 0.933 2.51 4.02 Term + 127051 127211 161 2 2 24 37 322 0.988 17.82 4.03 PlyA + 128063 128068 6 1.05 5.03 PlyA - 128404 128399 6 1.05 5.02 Term - 137367 137272 96 0 0 12 45 105 0.181 -4.41 5.01 Init - 140496 140302 195 0 0 65 76 172 0.788 12.68 5.00 Prom - 150721 150682 40 -6.35 6.03 PlyA - 151558 151553 6 1.05 6.02 Term - 152971 152817 155 2 2 99 34 162 0.990 9.00 6.01 Init - 157631 156770 862 2 1 66 89 568 0.978 49.50 6.00 Prom - 158789 158750 40 -5.45 7.05 PlyA - 158977 158972 6 1.05 7.04 Term - 169447 169341 107 2 2 82 38 61 0.048 -1.91 7.03 Intr - 190241 190102 140 1 2 76 64 115 0.067 7.09 7.02 Intr - 194904 194714 191 0 2 99 -2 120 0.068 1.56 7.01 Intr - 200444 200306 139 1 1 111 95 78 0.994 10.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_1|406_aa MTQTLDTREDPLNLGGGGGGGCGCGWAHSASLSSWSSCHRRRPGAPAYNRPHRYSPKTEY GPPRKQPKQQHGPGFWFQPPVCSNWGCWGGPWRPPPPGFWKFPCPVQVFRVYGLHPLCFC CCSCWSGSWNPGWVKPPGRKKRWGRRGRGLRHHPRHSYPRSPPADVSTLPRPVKLYEWRE PGMRAPPNTTQFIMNQIYEDMRQQEKVERQQEALRAQKATVSGEASPARSSGNDAPPGGS KETWGLQETLYGFVQNPSLAFSPNPEENQSLAPLLVEEEEEKKNDDEEEYDQEVCDAKEA SEEEEEVEDEEEEVEDEEEEEVEEAEYVEEGEEELEEEELEEEEEVLEENEQRGEEFHLP LEMPLSIFVEAEEKRENFISCTFLNPEQIIPKVPQESLFMAQDFNC >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_1|1221_bp atgactcagaccctcgacacaagggaagaccctctgaacctgggcggcggtggcggcggc ggctgtggctgtggctgggcacactcggcctccttgagctcctggtcgtcctgccatcga aggcgcccgggtgctccagcgtacaatagaccgcaccgatatagccccaagaccgagtat gggcccccaaggaagcagccgaagcaacagcacggcccgggcttttggttccaaccaccc gtgtgctctaactgggggtgctggggagggccctggcgcccaccccctccaggattctgg aaattcccctgcccggtgcaagtgtttcgggtgtatggcctgcaccctctctgcttttgc tgctgctcctgctggagcgggtcctggaaccctggctgggtgaagcccccaggcaggaag aagcgctggggccgcagaggccgcggcctgcgccaccaccctcgccactcctacccgcgg agcccgccagcggatgtgagcacgctgccgcggccggtcaagctgtatgagtggagagag cctggcatgcgagcgccgcccaacaccacccaattcatcatgaaccagatctacgaggac atgaggcagcaggagaaggtggagcgtcagcaggaggcgctgcgggcgcagaaggccacg gtgagcggcgaggcctccccagccagatcctccggaaacgacgcgccccctggcggcagc aaggaaacctggggactgcaggaaactctgtatggctttgtgcagaatccctctctagca ttcagtcccaacccagaggaaaaccagtctcttgccccgctgctggtggaagaagaggag gagaagaaaaatgatgatgaggaggagtatgaccaggaggtgtgtgatgcaaaggaggcg agcgaggaggaagaagaggtcgaagatgaggaggaagaggtcgaagatgaggaggaagaa gaggtcgaagaggctgaatatgtggaggagggagaggaggagctggaagaggaggagctg gaagaggaagaggaggtcctggaggagaacgagcagagaggggaagaatttcacttgcct ctggaaatgcctttatcaatcttcgtagaggctgaagaaaagagagagaactttataagc tgcacttttttaaacccagagcagataattcccaaagtgccacaggaatccctgttcatg gcacaggactttaactgttag >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_2|601_aa MKTLAGLVLGLVIFDAAVTAPTLESINYDSETYDATLEDLDNLYNYENIPVDKVENIYDP LFSILEASFRPNNAIRVLYITHYIVITNQLESRIKKLLLMASEESFPLIGHFKYHFSMMR TSEIFLEHRSADVYDSSPHDLGRGPSGDPYWGQLADQTHNGPWTLLIGIVALTNAAAETP VFLLDHKEDQGSSARKYKKSRSWFQANQRSQLRRVRVVKEPFPRKPDTHVFSLAAALVAF NWPTANKEGWKKAKGHIGVVKGKSGISGDTTRTLGHKKWVLDVAIPIAWSKFSAQAYETV REIEIATVMPSGNRELLTPPPQPEKAQEEEEEEESTPRLIDGSSPQEPEFTGVLGPHTNE EQRVIDEQDKIFASEIRPQQFKGIRDDEDEDDDFPTCLLCTCISTTVYCDDHELDAIPPL PKNTAYFYSRFNRIKKINKNDFASLSDLKRIDLTSNLISEIDEDAFRKLPQLRELVLRDN KIRQLPELPTTLTFIDISNNRLGRKGIKQEAFKDMYDLHHLYLTDNNLDHIPLPLPENLR ALHLQNNNILEMHEDTFCNVKNLTYIRKALEDIRLDGNPINLSKTPQAYMCLPRLPVGSL V >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_2|1806_bp atgaagacattagcaggacttgttctgggacttgtcatctttgatgctgctgtgactgcc ccaactctagagtccatcaactatgactcagaaacctatgatgccaccttagaagacctg gataatttgtacaactatgaaaacatacctgttgataaagttgagaatatctatgatcca ttgttttctatattagaagcaagttttagacccaacaatgccatcagagtgctatatatt acccactatatcgtcatcactaatcagctagagtcaagaataaagaagctgctcctaatg gcttcagaagaatcatttcctctcattgggcattttaaatatcacttttccatgatgaga acaagtgaaatctttttagaacacaggtcagcagatgtttatgactccagtccccatgat ctgggtcgaggtcccagtggggatccatactggggacagcttgctgaccaaacacacaat ggtccctggaccctgctgatcggaatagttgcactcaccaatgcagcagcagaaacacca gttttcctcttagaccacaaagaggaccaaggaagctcagcccggaagtataagaaaagc agaagctggtttcaagcaaaccaacgctctcaactccgaagagtcagggttgttaaagag ccctttcccagaaagcctgacacccatgtctttagtctggcagccgcactagttgctttt aactggccaacagcaaataaagaagggtggaaaaaagcaaaaggccatattggtgtggtt aagggaaagtctgggatttctggggataccacaaggacactgggccacaaaaagtgggtc ctggatgtcgccatcccaattgcctggagcaaattttcagcccaagcttacgagaccgtc agagaaattgaaatagccacagtgatgccttcagggaacagagagctcctcactccaccc ccacagcctgagaaggcccaggaagaggaagaggaggaggaatctactcccaggctgatt gatggctcttctccccaggagcctgaattcacaggggttctggggccacacacaaatgaa gaacagagagtaattgatgagcaggataaaatctttgcctctgagattaggccacagcag ttcaaggggataagagatgatgaggatgaagatgatgactttccaacctgtcttttgtgt acttgtataagtaccaccgtgtactgtgatgaccatgaacttgatgctattcctccgctg ccaaagaacaccgcttatttctattcccgctttaacagaattaaaaagatcaacaaaaat gactttgcaagcctaagtgatttaaaaaggattgatctgacatcaaatttaatatctgag attgatgaagatgcattccgaaaactgcctcaacttcgagagcttgtcctgcgtgacaac aaaataaggcagctcccagaattgccaaccactttgacatttattgatattagcaacaat agacttggaaggaaagggataaagcaagaagcatttaaagacatgtatgatctccatcat ctgtacctcactgataacaacttggaccacatccctctgccactcccagaaaatctacga gcccttcacctccagaataacaacattctggaaatgcacgaagatacgttctgcaatgtt aaaaatttgacttatattcgtaaggcactagaggacattcgattggatggaaaccctatt aatctcagcaaaactcctcaagcatacatgtgtctacctcgtctgcctgttgggagcctt gtctaa >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_3|352_aa MAGTICFIMWVLFITDTVWSRSVRQVYEVHDSDDWTIHDFECPMECFCPPSFPTALYCEN RGLKEIPAIPSRIWYLYLQNNLIETIPEKPFENATQLRWINLNKNKITNYGIEKGALSQL KKLLFLFLEDNELEEVPSPLPRSLEQLQLARNKVSRIPQGTFSNLENLTLLDLQNNKLVD NAFQRDTFKGLKNLMQLNMAKNALRNMPPRLPANTMQLFLDNNSIEGIPENYFNVIPKVA FLRLNHNKLSDEGLPSRGFDVSSILDLQLSHNQLTKVPRISAHLQHLHLDHNKIKSVNVS VICPSPSMLPAERDSFSYGPHLRYLRLDGNEIKPPIPMALMTCFRLLQAVII >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_3|1059_bp atggcaggcacaatctgtttcatcatgtgggtgttattcataacagacactgtgtggtct agaagtgtgaggcaggtctatgaagtacatgattcagatgattggactattcatgacttc gagtgtcccatggaatgtttctgcccacccagttttcctactgctttatattgtgaaaat agaggtctcaaagaaattcctgctattccttcaagaatttggtatctttatcttcaaaac aacctgatagaaaccattcctgaaaagccatttgagaatgccacccagctaagatggata aatctaaacaagaacaaaataaccaactacggaattgaaaaaggagccctaagccagctg aagaagttgctcttcttatttctggaagataatgagctagaggaggtaccttctccattg ccaagaagtttagaacaattacaattagctagaaataaggtgtccagaattcctcaaggg acctttagcaatctggagaacctgacccttcttgacctacagaacaacaaattagtggac aatgcctttcaaagagacacttttaaaggactcaagaatctcatgcagctaaacatggcc aagaatgccctgaggaatatgcctccaagattaccagccaatacaatgcagttgttttta gacaacaattccattgaaggaataccagaaaattattttaatgtgattcctaaagtggcc tttttgagactaaatcacaacaaactgtcagatgagggtctcccatcaagaggatttgat gtatcatcaattctagatcttcaactgtcgcacaatcaactcacaaaggttccccgaatc agtgctcatctgcagcaccttcaccttgatcataacaaaattaaaagtgtgaatgtctct gtaatatgtcccagcccatccatgctgcctgcagaacgagattccttcagttatggacct catcttcgctacctccgtctggatggaaatgaaatcaaaccaccaattccaatggcttta atgacctgcttcagacttctgcaggctgtcattatttaa >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_4|59_aa MRSLKKVLDKDESKDEDEEKGQEEKEEEEKGEKEEEKEEEENKNKNTVKKQPSPPKLTL >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_4|180_bp atgaggtcactgaaaaaggtccttgataaagatgaaagcaaagatgaagatgaggaaaaa ggacaggaagaaaaggaggaggaggagaagggagagaaagaggaggaaaaggaggaagag gagaataagaataagaatacagtgaagaagcagccatcaccaccaaaactcacactttag >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_5|96_aa MELPVKKGQLLSLRFGRISHPSLLALEIPNGPEKKGSPTKQHSCSTNKQPECFFNWVPDP VPHYRASIILTPKPGRDTTTTTTTTTTTTKTSGPYP >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_5|291_bp atggagctcccagtgaagaaggggcagctgctatctctgaggtttggtcgaatcagccat cccagcctgctagctttggagattccaaatggtccagaaaagaaagggtcccccacaaag cagcacagctgctctaccaataaacaaccagagtgcttctttaactgggtccctgatcct gttcctcattacagggccagcatcatcctgacaccaaagcctggcagagatacaacaaca acaacaacaacaacaacaacaacaacaaaaacttcaggcccatatccctga >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_6|338_aa MSLSAFTLFLALIGGTSGQYYDYDFPLSIYGQSSPNCAPECNCPESYPSAMYCDELKLKS VPMVPPGIKYLYLRNNQIDHIDEKAFENVTDLQWLILDHNLLENSKIKGRVFSKLKQLKK LHINHNNLTESVGPLPKSLEDLQLTHNKITKLGSFEGLVNLTFIHLQHNRLKEDAVSAAF KGLKSLEYLDLSFNQIARLPSGLPVSLLTLYLDNNKISNIPDEYFKRFNALQYLRLSHNE LADSGIPGNSFNVSSLVELDLSYNKLKNIPTVNENLENYYLEVNQLEKFDIKSFCKILGP LSYSKIKHLRLDGNRISETSLPPDMYECLRVANEVTLN >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_6|1017_bp atgagtctaagtgcatttactctcttcctggcattgattggtggtaccagtggccagtac tatgattatgattttcccctatcaatttatgggcaatcatcaccaaactgtgcaccagaa tgtaactgccctgaaagctacccaagtgccatgtactgtgatgagctgaaattgaaaagt gtaccaatggtgcctcctggaatcaagtatctttaccttaggaataaccagattgaccat attgatgaaaaggcctttgagaatgtaactgatctgcagtggctcattctagatcacaac cttctagaaaactccaagataaaagggagagttttctctaaattgaaacaactgaagaag ctgcatataaaccacaacaacctgacagagtctgtgggcccacttcccaaatctctggag gatctgcagcttactcataacaagatcacaaagctgggctcttttgaaggattggtaaac ctgaccttcatccatctccagcacaatcggctgaaagaggatgctgtttcagctgctttt aaaggtcttaaatcactcgaataccttgacttgagcttcaatcagatagccagactgcct tctggtctccctgtctctcttctaactctctacttagacaacaataagatcagcaacatc cctgatgagtatttcaagcgttttaatgcattgcagtatctgcgtttatctcacaacgaa ctggctgatagtggaatacctggaaattctttcaatgtgtcatccctggttgagctggat ctgtcctataacaagcttaaaaacataccaactgtcaatgaaaaccttgaaaactattac ctggaggtcaatcaacttgagaagtttgacataaagagcttctgcaagatcctggggcca ttatcctactccaagatcaagcatttgcgtttggatggcaatcgcatctcagaaaccagt cttccaccggatatgtatgaatgtctacgtgttgctaacgaagtcactcttaattaa >gi568815586r:90951349_91156281|GENSCAN_predicted_peptide_7|192_aa XLGLSFNSISAVDNGSLANTPHLRELHLDNNKLTRVPGGLAEHKYIQVVYLHNNNISVVG SSDFCPPGHNTKKASYSGVSLFSNPVQYWEIQPSTFRCVYVRSAIQLGNYNVLVAETQQC IMPCPLKQMLQQEETFQDMTIKVEECQAELGNKGCEGEGITERLPGRLSTYYDNLGSQCS YAPDLPVTCQPH >gi568815586r:90951349_91156281|GENSCAN_predicted_CDS_7|579_bp nngttgggattgagtttcaacagcatctctgctgttgacaatggctctctggccaacacg cctcatctgagggagcttcacttggacaacaacaagcttaccagagtacctggtgggctg gcagagcataagtacatccaggttgtctaccttcataacaacaatatctctgtagttgga tcaagtgacttctgcccacctggacacaacaccaaaaaggcttcttattcgggtgtgagt cttttcagcaacccggtccagtactgggagatacagccatccaccttcagatgtgtctac gtgcgctctgccattcaactcggaaactataatgttctagtggcagagacacagcagtgc ataatgccttgccctctgaagcagatgttgcagcaggaggagacattccaggacatgaca attaaagtggaggaatgtcaagcagaactggggaataagggatgtgaaggggagggcatt actgaaagattacctgggagactatccacatactacgacaaccttggttcacagtgcagt tatgctcctgatcttcctgtaacttgtcagccccattaa