GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:11:38 Sequence gi568815597f:76599789_76800535 : 200747 bp : 38.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1210 1217 8 0 2 108 94 4 0.640 3.26 1.02 Term + 6628 6814 187 1 1 66 37 130 0.731 1.68 1.03 PlyA + 6933 6938 6 1.05 2.00 Prom + 12896 12935 40 -5.95 2.01 Init + 13717 13864 148 0 1 79 61 93 0.263 6.00 2.02 Intr + 17587 17707 121 2 1 55 55 56 0.037 -2.37 2.03 Intr + 27664 27771 108 1 0 91 71 79 0.399 4.98 2.04 Term + 28832 29018 187 2 1 139 44 61 0.899 2.78 2.05 PlyA + 31070 31075 6 1.05 3.00 Prom + 36797 36836 40 -2.05 3.01 Init + 44922 44954 33 2 0 81 108 13 0.419 2.52 3.02 Term + 50424 50597 174 2 0 99 42 120 0.565 5.28 3.03 PlyA + 50745 50750 6 1.05 4.09 PlyA - 51177 51172 6 1.05 4.08 Term - 68549 68359 191 0 2 37 43 158 0.387 2.83 4.07 Intr - 69616 69486 131 0 2 93 19 74 0.444 0.42 4.06 Intr - 75521 75481 41 2 2 107 106 34 0.457 3.20 4.05 Intr - 80730 80634 97 2 1 44 92 111 0.602 6.19 4.04 Intr - 85417 85386 32 1 2 127 116 3 0.601 3.01 4.03 Intr - 87704 87578 127 1 1 35 70 121 0.447 4.76 4.02 Intr - 91153 91106 48 0 0 81 98 36 0.292 0.78 4.01 Init - 93127 92979 149 1 2 100 14 41 0.167 -2.48 4.00 Prom - 98161 98122 40 -6.75 5.00 Prom + 98582 98621 40 -6.65 5.01 Sngl + 100001 100750 750 1 0 110 52 980 0.935 92.52 5.02 PlyA + 101175 101180 6 1.05 6.06 PlyA - 104106 104101 6 1.05 6.05 Term - 106264 106155 110 2 2 99 39 63 0.056 0.19 6.04 Intr - 109475 109367 109 2 1 84 47 110 0.063 5.44 6.03 Intr - 126018 125903 116 1 2 77 55 88 0.024 3.65 6.02 Intr - 135276 135125 152 2 2 74 60 79 0.484 2.59 6.01 Init - 137757 137417 341 0 2 74 72 143 0.807 8.08 6.00 Prom - 138530 138491 40 -5.45 7.00 Prom + 139055 139094 40 -10.75 7.01 Init + 139556 139864 309 1 0 37 33 257 0.400 12.46 7.02 Intr + 141110 141283 174 1 0 88 37 110 0.460 5.11 7.03 Term + 146139 146201 63 2 0 123 50 47 0.374 1.31 7.04 PlyA + 146675 146680 6 1.05 8.03 PlyA - 148906 148901 6 1.05 8.02 Term - 165409 165101 309 1 0 41 43 278 0.472 12.68 8.01 Init - 179231 179121 111 2 0 46 78 107 0.358 5.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 116224 116379 156 0 0 42 93 143 0.957 10.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_1|64_aa MVRDMNGAGSHYPQQTNAGTEKQIPHVLTYKWELNNEDMDTGRETTHTGSCQRVEGEEGE HQQK >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_1|195_bp atggtgagggacatgaatggagctggaagccattatcctcagcaaactaatgcaggaaca gaaaagcaaataccacatgttctcacttataagtgggagctgaacaatgaggacatggac acagggagggaaacaacccatactgggtcctgtcagagggtggagggggaggagggagag catcagcagaaatag >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_2|187_aa MAVCEGYGLDLYPCPNLMSNCNPQCWRRGLVGGDRIMGTDIPLAVLRTVIIGKYTKKEWL IGELKEKAKIFNYLYQVSRETGTYALGRKRVQSGSYLSTGWFTFLLAMDACYGIHVYGMI NDTYCKTEGYRKVPYHYYEQGRDECDEYFLHEHAPYGGHRFITEKKVFAKWAKKHRIIFT HPNWTLS >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_2|564_bp atggctgtctgtgagggatatggtttggatttgtatccctgcccaaatctcatgtcaaat tgtaatccccagtgttggaggaggggcctggtgggaggtgaccggatcatggggacagac atcccccttgctgttctccggacagtgatcataggaaagtatacgaagaaggaatggctg ataggtgaactaaaagagaaagccaaaatctttaattacctttatcaggttagcagggaa acaggcacctatgccttaggtagaaaaagagtccagtctggctcatatctcagcacaggg tggtttaccttccttctggccatggacgcctgttatggcattcacgtctacgggatgata aatgacacctactgcaagacagaagggtatagaaaagtcccctaccattattatgaacaa ggaagagatgagtgtgatgaatattttcttcatgaacatgccccatatgggggtcatagg tttatcactgaaaagaaagtgtttgctaaatgggccaagaagcacaggataatatttaca catccaaactggacattgtcttga >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_3|68_aa MGRKRGILGSQVDPIEGSAMQLREANRLNVWTEYQKKVPLRARVYDRNHRDEESRERDYS NLYEKWHP >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_3|207_bp atgggccggaaaagagggatacttggttcccaagtagacccaattgagggaagtgcaatg caactcagggaggccaaccgcctgaatgtctggacagaataccagaaaaaggtacctctg agagccagagtgtatgacagaaatcacagagatgaagaaagtagagaaagggattattca aatctgtatgaaaaatggcatccctaa >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_4|271_aa MAQRGLVTAWAAASDVASHKHWQLPGGAKSVSAQSARVEAWEPLPRFQRMSAPEPYTCRH SLNSFRKVVEEAHTGNSSLTAEKIISCSENTFVHFQGKEAKQERKHGQDLPECYVKNRRK SHEDNKEDDGDINKPHRYLQTSSDLGGVTYIGSLWLFSTSKAVNGEIQRPLLTAIDSATL PACCVGRGSLAETASSGRNYCQLHWDILYKMDSQWLSERLEVKTEPIGHGWVREQSCALY SQKDVVKVSQDLAFYNQAKLVSVVSAKINCG >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_4|816_bp atggctcaaaggggactagttacagcttgggctgctgcttcagatgttgcaagccataaa cattggcagcttccaggtggtgctaagtctgtgagcgcacagagtgcaagagttgaggct tgggagcctctgcctagatttcagaggatgtctgccccagaaccctacacatgcaggcat tcactgaattccttcaggaaagtggtggaggaagctcacactggaaattccagtttgact gcagagaaaataatctcctgttcagaaaatacctttgttcacttccagggaaaagaggct aaacaggaaaggaaacatggacaggatctgcctgaatgctatgtcaaaaatagaaggaag tcccatgaggataataaagaagatgatggtgacatcaataagccccacaggtacctgcag acatccagtgacttaggtggggtcacatatataggtagcctgtggctgttcagtacttcc aaagctgtaaatggggaaattcagagacctttgctcactgccattgattctgccactctc ccagcttgctgtgtgggtcgtggcagccttgctgagactgccagctcaggacgcaattac tgccaattgcattgggatattttgtacaaaatggactcccagtggctaagtgagaggctc gaagttaaaacagaaccaattggtcatggatgggtgagggagcagtcatgcgctctgtat tctcagaaggatgttgtaaaagtgtcacaggacctcgctttctacaatcaagctaaacta gtttctgttgtcagtgccaagataaactgtggctag >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_5|249_aa MAPSRKFFVGGNWKMNGRKQSLRELVRTLNAAKVPADTEVVCTPPTAYIDFARQKLDPKI AVAAQNCYKVTNGAFTGEISPGMIKDCRATWVVLGHSERRHVFGESDELIGQKVAHALAE GLGVIACTGEKLDEREAGITEKVVFEQTKVIADNVKDWSKVVLAYEPVWAIGTGKTATPQ QDQEVHDKLRGWLKSNVSDAVAQSTRIIYGGSVTGATCKELASQPGVDGFLVGGASLKPE FVDIINAKQ >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_5|750_bp atggcgccctccaggaagttcttcgttggggggaactggaagatgaacgggcgaaagcag agtctgcgggagctcgtccgcactctgaacgcggccaaggtgccggccgacaccgaggtg gtttgtactccgcctactgcctatatcgacttcgcccggcagaagctagatcccaagatt gctgtggctgcgcagaactgctacaaagtgactaatggggcttttactggggagatcagc cctggcatgatcaaagactgcagagccacgtgggtggtcctggggcactcagagagaagg catgtctttggggagtcagatgagctgattgggcagaaagtggcccatgctctggcagag ggactcggagtaatcgcctgcactggggagaagctagatgaaagggaagctggcatcact gagaaggttgttttcgagcagacaaaggtcatcgcagataacgtgaaggactggagcaag gtcgtcctggcctatgagcctgtgtgggccattggtactggcaagactgcaacaccccaa caggaccaggaagtacacgacaagctccgaggatggcttaagtccaacgtctctgatgcg gtggctcagagcacccgtatcatttatggaggctctgtgactggggcaacctgcaaggag ctggccagccagcctggcgtggatggcttccttgtgggtggtgcttccctcaagcccgaa ttcgtggacatcatcaatgccaaacaatga >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_6|275_aa MNKLDRLLARLIKKKREKKKIDTIKYDKGDITTDPTEIQATIREYCKHLYANKLENLEEI DKLLDTYTLPRLNQEEVKSLNRPITSSEIEAVTNSLPTKKSSGPDGFTAKFNQRDIDEAG NHHSQQTNTRKENQTPRILTHKWESNDENTWTQGGEHHTPGPVRVTATVACNGIALAERY GRGQARKSLLGQQFVIMKIAEAQIYPFKCSYQNPGSRPNTTLDENNNLDKGSGTGEQLKF LSADSTEEYMQTKICRGHGVAPMGISQRVLLDTKI >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_6|828_bp atgaacaaactagatagactgctagccagactaataaagaagaaaagagagaagaaaaaa atagacacaataaaatatgataaaggggatatcaccactgatcctacagaaatacaagct accatcagagaatactgtaaacacctctatgcaaacaaattagaaaatctagaagaaata gataaactcctggacacctataccctcccaagactaaaccaggaagaagtcaaatccctg aatagaccaataacaagttctgaaattgaggcagtaactaatagcctaccaaccaaaaaa agctcaggaccagatggattcacagccaagttcaaccagagggacatagatgaagctgga aaccatcattctcagcaaactaacacaagaaaagaaaaccaaacaccacgtattctcact cataagtgggagtcgaatgatgagaacacatggacacagggaggggaacatcacacacca gggcctgtcagggtaacagccaccgttgcctgcaatggaattgctctagcagaacgatat ggtagaggacaagccagaaaatctctcctgggtcaacaattcgtcataatgaaaattgca gaggcccaaatatatcctttcaaatgtagttatcagaaccctgggagtcggccaaacacc acactggatgaaaacaataatctagacaaagggtcaggcaccggagagcagttgaagttc ctatcagctgatagtacagaagaatatatgcaaactaaaatatgccgaggacatggtgta gctcccatgggaatatcccaaagggttttattggatactaagatataa >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_7|181_aa MRREKNQIDAIKNDKGGITTDPTGMQTTSREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIEAIINSLPTKKSPEPDGFTAEFYQRYKAGPSAAGLLEFAGGPLQ TLFAWVSPVEAAEQQRLLACSFLWKLHPRGAPARCQPELSSVAILCQTHQSSKTDTAQPL L >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_7|546_bp atgagaagagagaagaatcaaatagatgcaataaaaaatgataaagggggtatcaccacc gatcccacaggaatgcaaactaccagcagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacctacaccctcccaagactaaat caggaagaagttgaatctctgaatagaccaataacaggctctgaaattgaggcaataatt aatagcttaccaaccaaaaaaagtccagaaccagatggattcacagctgaattctaccag aggtacaaggcaggcccctctgctgcaggtctgctggagtttgctggaggtcctctccag accctgtttgcatgggtatcaccagtggaggctgcagaacagcaaagattgcttgcctgt tccttcctctggaagcttcatcccagaggggcacctgccagatgccagccggagctctcc tctgtggctatactgtgccagacccatcagagctccaagaccgacacggcacagccactc ctctag >gi568815597f:76599789_76800535|GENSCAN_predicted_peptide_8|139_aa MLKIARIQYKASTCSVLDLTNEKNHESERSLSQQGGGQGDVQSGISRASTHSAAAARVAA ELWPACAWGAEIGRCICPGQPAVNLTIGSSEGSSTARPTLTFLPTHETATPCMLYCFLTI LPALPVISNSLCARVHDLI >gi568815597f:76599789_76800535|GENSCAN_predicted_CDS_8|420_bp atgctgaagatagccaggatccagtacaaggcaagtacctgctccgtgcttgatttaacc aatgagaaaaaccatgaaagcgaaaggagtctgtcccaacaaggtggagggcagggggat gtgcagagtgggattagcagggcatcaacccacagcgcggcagctgcccgggtggcagca gagctctggcccgcgtgtgcctggggcgccgagatcggccgatgcatttgcccagggcag ccagctgtgaatttgactatcggcagcagcgagggcagctccacagctcgccccacactc actttccttcccactcatgaaacagccacaccatgcatgctctattgtttcctaacaatt cttcctgcgctgccggttatcagcaatagtctgtgcgctcgagtgcatgacctcatttaa