GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:20:05 Sequence gi568815578r:3834066_4064017 : 229952 bp : 44.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20560 20676 117 0 0 91 99 57 0.994 7.30 1.02 Intr + 23570 23744 175 1 1 58 54 190 0.982 12.11 1.03 Intr + 27267 27439 173 1 2 112 58 90 0.999 8.06 1.04 Intr + 28189 28348 160 0 1 98 85 137 0.989 13.96 1.05 Intr + 30359 30723 365 0 2 105 105 256 0.894 23.90 1.06 Intr + 31618 31974 357 0 0 94 35 338 0.512 24.55 1.07 Intr + 32327 32498 172 1 1 61 91 33 0.392 0.42 1.08 Intr + 32788 32922 135 2 0 107 79 31 0.535 4.64 1.09 Term + 39862 39989 128 2 2 41 45 115 0.433 0.94 1.10 PlyA + 40315 40320 6 1.05 2.00 Prom + 41356 41395 40 -4.66 2.01 Init + 54774 54951 178 2 1 84 27 224 0.914 13.35 2.02 Intr + 55256 55663 408 0 0 37 96 234 0.883 13.34 2.03 Intr + 73861 74213 353 2 2 77 19 294 0.974 16.44 2.04 Intr + 76512 76765 254 2 2 87 97 146 0.957 11.63 2.05 Intr + 78393 78569 177 0 0 40 116 81 0.928 5.13 2.06 Intr + 82862 82985 124 2 1 54 72 113 0.983 6.89 2.07 Intr + 84606 84731 126 2 0 72 99 100 0.996 10.38 2.08 Intr + 88719 88884 166 2 1 99 78 -12 0.264 -1.57 2.09 Intr + 91881 91940 60 1 0 65 94 31 0.067 0.11 2.10 Term + 97431 97525 95 1 2 55 40 127 0.217 2.59 2.11 PlyA + 98001 98006 6 -0.45 3.06 PlyA - 98242 98237 6 -0.45 3.05 Term - 100030 99906 125 2 2 45 48 68 0.299 -2.95 3.04 Intr - 100285 100057 229 1 1 135 72 98 0.360 10.34 3.03 Intr - 111153 111112 42 0 0 98 107 17 0.545 3.04 3.02 Intr - 149066 149020 47 0 2 55 77 62 0.012 -0.07 3.01 Init - 181555 181372 184 1 1 24 91 189 0.973 10.08 3.00 Prom - 189449 189410 40 -4.76 4.00 Prom + 189629 189668 40 -12.78 4.01 Sngl + 189852 190379 528 2 0 90 48 818 0.962 74.06 4.02 PlyA + 190629 190634 6 1.05 5.04 PlyA - 192498 192493 6 1.05 5.03 Term - 212083 212016 68 2 2 46 41 109 0.201 0.10 5.02 Intr - 223487 223321 167 1 2 -4 92 122 0.024 3.00 5.01 Init - 225896 225565 332 2 2 64 4 194 0.058 5.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:3834066_4064017|GENSCAN_predicted_peptide_1|593_aa MPFAEDKTYKYICRNFSNFCNVDVVEILPYLPCLTARDQDRLRATCTLSGNRDTLWHLFN TLQRRPGWVEYFIAALRGCELVDLADEVASVYQSYQPRTSDRPPDPLEPPSLPAERPGPP TPAAAHSIPYNSCREKEPSYPMPVQETQAPESPGENSEQALQTLSPRAIPRNPDGGPLES SSDLAALSPLTSSGHQEQDTELGSTHTAGAAEGKQGAESDQAEPIICSSGAEAPANSLPS KVPTTLMPVNTVALKVPANPASVSTVPSKLPTSSKPPGAVPSNALTNPAPSKLPINSTRA GMVPSKVPTSMVLTKVSASTVPTDGSSRNEETPAAPTPAGATGGSSAWLDSSSENRGLGS ELSKPGVLASQVDSPFSGCFEDLAISASTSLGMGPCHGPEENEYKSEGTFGIHVAENPSI QLLEGNPGPPADPDGGPRPQADRKFQERECSRPSPLAIQVIIHVLQRSFLLQASALLAQV EQEGPLEHVVLGNASPVALVPEGLRAAPYMGWGGDTWWAELRQRFGFQLPHFRGCVALAD VRLLVLLLHVDKSYVHRDIGKNVSNHTLTVAPNLKQPKCPSTNPNVHPQLKLQ >gi568815578r:3834066_4064017|GENSCAN_predicted_CDS_1|1782_bp atgccgtttgctgaagacaagacctataagtatatctgccgcaatttcagcaatttttgc aatgtggatgttgtagagattctgccttacctgccctgcctcacagcaagagaccaggat cgactgcgggccacctgcacactctcagggaaccgggacaccctctggcatctcttcaat acccttcagcggcggcccggctgggtggagtacttcattgcggcactgaggggctgtgag ctagttgatctcgcggacgaagtggcctctgtctaccagagctaccagcctcggacctcg gaccgtcccccagacccactggagccaccgtcacttcctgctgagaggccagggcccccc acacctgctgcggcccacagcatcccctacaacagctgcagagagaaggagccaagttac cccatgcctgtccaggagacccaggcgccagagtccccaggagagaattcagagcaagcc ctgcagacgctcagccccagagccatcccaaggaatccagatggtggccccctggagtcc tcctctgacctggcagccctcagccctctgacctccagcgggcatcaggagcaggacaca gaactgggcagtacccacacagcaggggctgcagagggtaaacagggtgcagagagtgac caggccgagcctatcatctgctccagtggggcagaggcacctgccaactctctgccctcc aaagtgcctaccaccttgatgcctgtgaacacagtggccctgaaagtgcctgccaaccca gcatctgtcagcacagtgccctccaagttgccaactagctcaaagccccctggtgcagtg ccttctaatgcgctcaccaatccagcaccatccaaattgcccatcaactcaacccgtgct ggcatggtgccatccaaagtgcctactagcatggtgctcaccaaggtgtctgccagcaca gtccccactgacgggagcagcagaaatgaggagaccccagcagctccaacacccgccggc gccactggaggcagctcagcctggctagacagcagctctgagaataggggccttgggtcg gagctgagtaagcctggcgtgctggcatcccaggtagacagcccgttctcgggctgcttc gaggatcttgccatcagtgccagcacctccttgggcatggggccctgccatggcccagag gagaatgagtataagtccgagggcacctttgggatccacgtggctgagaaccccagcatc cagctcctggagggcaaccctgggccacctgcggacccggatggcggccccaggccacaa gccgaccggaagttccaggagagggagtgctccagaccttccccactggcaatccaggtt atcatccatgtcctccagaggagcttcctcctccaggcctcagccctgttggcccaggtg gagcaggagggaccactggaacatgtggtgcttgggaatgcctctcctgttgcattggtc cctgaaggcctcagggcagctccctacatgggctggggaggagacacctggtgggcagag ctcaggcagaggtttggatttcagctccctcacttccggggctgtgtggctttggcagat gtcagacttctggtcttgcttctccacgtggacaaaagctatgtccaccgagacattggc aagaatgtttctaaccacacgctgactgtagccccaaacctgaaacaacccaaatgtcca tccaccaacccaaatgtccatccacagttgaagctacagtga >gi568815578r:3834066_4064017|GENSCAN_predicted_peptide_2|646_aa MLGEGLAASTAAAELGRGTKAKVSRGSSPFVCRRPAARRCGSTAGLEEGSSCEDGTEQRA SSDWLPARWRNGRGGRPRARLCSGWTAAEEARRNPTLGGLLGRQRLLLRMGGGRLGAPME RHGRASATSVSSAGEQAAGDPEGRRQEPLRRRASSASVPAVGASAEGTRRDRLGSYSGPT SVSRQRVESLRKKRPLFPWFGLDIGGTLVKLVYFEPKDITAEEEEEEVESLKSIRKYLTS NVAYGSTGIRDVHLELKDLTLCGRKGNLHFIRFPTHDMPAFIQMGRDKNFSSLHTVFCAT GGGAYKFEQDFLTIGDLQLCKLDELDCLIKGILYIDSVGFNGRSQCYYFENPADSEKCQK LPFDLKNPYPLLLVNIGSGVSILAVYSKDNYKRVTGTSLGGGTFFGLCCLLTGCTTFEEA LEMASRGDSTKVDKLVRDIYGGDYERFGLPGWAVASSFGNMMSKEKREAVSKEDLARATL ITITNNIGSIARMCALNENINQVVFVGNFLRINTIAMRLLAYALDYWSKGQLKALFSEHE LSRALTLCDPPHNFTCFIALCIRPRLHSLVVWKCSRPLLLLTSFPRNSQVLSLTRAVPFS LPSTSGHEGGCDRGPAMQMRRPILVEDPDAGSKYSGCVAKSRRNYC >gi568815578r:3834066_4064017|GENSCAN_predicted_CDS_2|1941_bp atgctgggggaggggctggcggcctcgacggcagctgcggaactaggccgagggacaaag gctaaggtcagccgcggttcaagccctttcgtctgccgacgaccagcggccagacgctgc gggagcactgctgggctggaggagggctcgagctgcgaggacggcacggagcagcgggcc tcgtcggattggcttcctgcgcgttggcgcaacggaagaggcggccggccgagggcgcgc ctctgctctggctggactgccgcggaggaggcgagaaggaatccgacgctggggggcttg ctcgggcggcagcgactgctgctgcggatgggagggggccggctcggcgcgcccatggag cgccacggcagggcttccgccacctccgtctcgtcggctggggagcaggcggccggggac cccgaagggcggcggcaggagccactgcggcgccgggcgagcagcgcgtcggtgcccgcg gtcggggcctcggctgagggcacgaggcgggatcgactgggctcttacagcggccccacc tcggtctcccgccagcgcgtcgaaagcctgaggaaaaagcggccgctttttccatggttt ggactggatatcggtggaactctggtcaagctggtatattttgaacccaaagacatcact gctgaagaagaagaggaagaagtggaaagtcttaaaagcattcggaagtacctgacctcc aatgtggcttatgggtctacaggcattcgggacgtgcacctcgagctgaaggacctgact ctgtgtggacgcaaaggcaatctgcactttatacgctttcccactcatgacatgcctgct tttattcaaatgggcagagataaaaacttctcgagtctccacactgtcttttgtgccact ggaggtggagcgtacaaatttgagcaggattttctcacaataggtgatcttcagctttgc aaactggatgaactagattgcttgatcaaaggaattttatacattgactcagtcggattc aatggacggtcacagtgctattactttgaaaaccctgctgattctgaaaagtgtcagaag ttaccatttgatttgaaaaatccgtatcctctgcttctggtgaacattggctcaggggtt agcatcttagcagtatattccaaagataattacaaacgggtcacaggtactagtcttgga ggaggaactttttttggtctctgctgtcttcttactggctgtaccacttttgaagaagct cttgaaatggcatctcgtggagatagcaccaaagtggataaactagtacgagatatttat ggaggggactatgagaggtttggactgccaggctgggctgtggcttcaagctttggaaac atgatgagcaaggagaagcgagaggctgtcagtaaagaggacctggccagagcgactttg atcaccatcaccaacaacattggctcaatagcaagaatgtgtgcccttaatgaaaacatt aaccaggtggtatttgttggaaatttcttgagaattaatacgatcgccatgcggcttttg gcatatgctttggattattggtccaaggggcagttgaaagcacttttttcggaacacgag ctcagtcgggctctcaccctctgtgacccacctcacaacttcacctgtttcattgcactt tgcatccgacctcgccttcattccttagttgtctggaaatgctccaggcctctgctgctt ctcacctcattccccagaaacagtcaggttctgtctcttaccagagctgtccctttcagt ctgccctccactagtggccatgaagggggctgtgaccgtggtccagcaatgcagatgcgc aggcccatcctggtggaggacccagatgcagggagcaaatattcgggttgtgttgctaag agtcgcaggaactactgctag >gi568815578r:3834066_4064017|GENSCAN_predicted_peptide_3|208_aa MRGAAPAPPPAGPPLARGRRAQSAAATSAALCPDGRSPPAACRPPQGADGPARLPLPPAR PILEAKKFKIKILASGKVILKEKVKELNLHEGVTSVHHLGGELLGPYSRRCLGDRYGDVQ QNSRAMSIGLTSSPCGVSFCRCLIKWLEVRKVCPLCNMPVLQLAQLHRAPSWGREHCIAY RKDQTVAGHDVCVEPGGTHVVSVWLLST >gi568815578r:3834066_4064017|GENSCAN_predicted_CDS_3|627_bp atgagaggcgcggcgcccgcccctcccccggcgggcccgcccctcgcgaggggccgccgg gcgcagtcagccgccgccacctccgccgcgctctgtccggacggacgctcgccgcccgca gcctgccgcccgccgcagggcgctgacggtcccgcacgtctgccgctgccaccggcgcgc ccgattctagaagccaagaagttcaagatcaagatactggcatctggcaaggttatatta aaagagaaagtaaaagaattgaatttacatgagggggtgacatccgttcaccacctagga ggggagctccttggaccatacagcagacgttgtctgggtgacagatatggagatgttcag cagaacagccgtgccatgagcataggactgacatcttccccctgtggtgtctccttctgc aggtgccttattaagtggctggaggttcgtaaagtgtgtcccctgtgcaacatgccagtt ctacagctggcccagttgcacagggccccttcctggggcagagaacattgtatagcttac cgcaaggatcagactgttgctggacacgacgtctgtgtggagccaggaggaacacatgtg gtgtctgtatggctgctctctacctag >gi568815578r:3834066_4064017|GENSCAN_predicted_peptide_4|175_aa MSSQIRQNYSTDVEAAVNSLVNLYLQASYTYLSLGFYFDRDDAALEGVSHFFRELTEEKR EGYERLLKMQNQRGGRALFQDIKKPAEDEWGKTPDAMKAAMALEKKLNQALLDLHALDSA HMDPHLCDFLETHFLDEEVKLIKKMGDHLTNLHRLGGPEAGLGEYLFERLTLKHV >gi568815578r:3834066_4064017|GENSCAN_predicted_CDS_4|528_bp atgagctcccagattcgtcaaaattattccaccgacgtggaggcagccgtcaacagcctg gtcaatttgtacctgcaggcctcctacacctacctctctctgggcttctatttcgaccgc gatgatgcggctctggaaggcgtgagccacttcttccgcgaattgaccgaggagaagcgc gagggctacgagcgtctcctgaagatgcaaaaccagcgtggcggccgcgctctcttccag gacatcaagaagccagctgaagatgagtggggtaaaaccccagatgccatgaaagctgcc atggccctggagaaaaagctgaaccaggcccttttggatcttcatgccctggattctgcc cacatggacccccatctctgtgacttcctggagactcacttcctagatgaggaagtgaag ctcatcaagaagatgggtgaccacctgacgaacctccacaggctgggaggcccagaggct gggctgggcgagtatctcttcgaaaggctcactctcaagcacgtctaa >gi568815578r:3834066_4064017|GENSCAN_predicted_peptide_5|188_aa MPLKCHHGKTGRVCNVPQHAVGIVLNKQVKGKILAKRINVHIEHRKHSKSRDGFLRRVKE NDQTKKEVKEKGTWVQPKRQPAPPREAHFVRTNEKEPELLEPLPYEFRHNRACSEIPQRN DAAREVGTAGMMGIGQAERPGNDPKVMGSHGRVKIREDMIGFEIKKGRSSPLSATDNAKI LWDTFEGS >gi568815578r:3834066_4064017|GENSCAN_predicted_CDS_5|567_bp atgcccctcaagtgtcaccatggcaaaactggaagagtctgcaatgttccccagcatgcc gttggcattgttctaaacaaacaagttaagggcaagattcttgccaagagaattaatgtg catattgagcacaggaagcactctaagagccgagatggcttcctgagacgcgtgaaggaa aatgatcagacaaagaaagaagtcaaagagaaaggcacctgggttcaaccaaagcgccag cctgctccacccagagaagcacactttgtgagaaccaatgagaaggagcctgagctgctg gaacctcttccctatgaattcaggcataatagggcatgcagtgagatcccccagaggaat gatgccgccagagaggttggcacagctggaatgatgggaataggccaagctgaaagacct ggaaatgatcctaaggtaatgggaagccatggaagagttaagatcagggaggacatgatt ggattcgagatcaagaaagggaggtcctcacccctgtcagctacggacaatgcgaagatc ctgtgggacacctttgaaggcagctag