GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:24:37 Sequence gi568815595r:156709551_156910732 : 201182 bp : 38.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 1855 1850 6 1.05 1.09 Term - 4743 4334 410 1 2 9 38 330 0.715 14.49 1.08 Intr - 5352 4787 566 2 2 50 -7 304 0.438 8.61 1.07 Intr - 8622 8582 41 0 2 106 92 59 0.628 4.20 1.06 Intr - 10041 9913 129 0 0 59 73 86 0.755 4.17 1.05 Intr - 31399 31221 179 2 2 47 68 126 0.135 5.22 1.04 Intr - 32050 31886 165 2 0 43 49 173 0.548 7.91 1.03 Intr - 32404 32077 328 1 1 101 68 148 0.611 8.45 1.02 Intr - 41633 41545 89 0 2 63 101 56 0.013 3.17 1.01 Init - 61950 61857 94 0 1 55 62 82 0.039 2.89 1.00 Prom - 88193 88154 40 -4.15 2.06 PlyA - 89278 89273 6 1.05 2.05 Term - 89487 89335 153 0 0 59 42 130 0.353 2.44 2.04 Intr - 97180 97082 99 1 0 101 3 143 0.525 6.49 2.03 Intr - 97645 97577 69 1 0 58 103 39 0.628 0.86 2.02 Intr - 100788 99818 971 1 2 -5 -18 931 0.312 62.98 2.01 Init - 101182 101092 91 1 1 67 32 146 0.590 7.60 2.00 Prom - 102717 102678 40 -10.65 3.00 Prom + 102772 102811 40 -11.04 3.01 Init + 103815 103997 183 2 0 101 30 65 0.430 1.16 3.02 Term + 106490 106720 231 1 0 71 54 259 0.971 16.29 3.03 PlyA + 107748 107753 6 1.05 4.05 PlyA - 108027 108022 6 1.05 4.04 Term - 113612 113284 329 0 2 58 37 157 0.178 1.49 4.03 Intr - 117354 117082 273 1 0 57 60 145 0.289 5.19 4.02 Intr - 120774 120622 153 1 0 62 55 87 0.528 1.92 4.01 Init - 128353 128266 88 1 1 90 90 121 0.928 13.35 4.00 Prom - 132067 132028 40 -6.85 5.00 Prom + 136525 136564 40 -5.65 5.01 Init + 139382 139786 405 1 0 41 86 133 0.118 5.04 5.02 Intr + 143218 143437 220 0 1 72 27 174 0.169 6.65 5.03 Term + 151766 151881 116 0 2 91 32 107 0.561 3.15 5.04 PlyA + 152790 152795 6 1.05 6.02 PlyA - 152807 152802 6 1.05 6.01 Sngl - 166882 166484 399 1 0 77 41 194 0.993 9.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:156709551_156910732|GENSCAN_predicted_peptide_1|666_aa MSVGYLTGDIKMSVGHLTGGMKTGEIRVDVIDSFGSSAFLSEGLKSLSRISQEKGMRMRV MTPPSRFASPGKLNCPWSVIPRGQATLGWNFLHCPPVSQQKKPRALPSDVGQHAESPSSC PSVSGIGGFLVSLTLTMKPRTLAVSVTALKVVRLELVPSDVQMCSEFLPSGVKLQTFAGS VTALKAVRLEFFIPPGGLVVALASEVKLQTFVVSVTAHKSSVDPKPLGWSMGLGAMEQGV ALVGEARAAQEPTEGMGGSGMAAAGPEPCPTGRQLRTGEELSAAPKVLSEYIESAITSYH KLGNYKQQKFISHSLKSGKSKIEVLADLPTQREDDEPHNFVRMYKECLSCWLESGIPNLG VWPKRIHTTAEKYREYEAREQTDQTQVQELHRSQDRDFETMAKLHIPVMVDEVVHCLSPQ KGQIFLDMTFGSGGHTKAILQKESDIVLYALDRDPTAYALAEHLSELYPKQIRAMLGQFS QAEALLMKAGVQPGTFDGVLMDLGCSSMQLDTPERGSSLRKDGPLDIRMDDLLQQSTYIA TKTFQALRIFVNNELNELYTGLKTAQKFLRLGGRLVALSFHSLEDRIVKRFLLGISMTER FNLSVRQQVMKTSQLGSDHENTEEVSMRRAPLMWELIHKKVFSPQDQDVQDNPRGRSAKL RAAIKL >gi568815595r:156709551_156910732|GENSCAN_predicted_CDS_1|2001_bp atgtctgttggatatctgactggagatataaagatgtctgttggacatctgactggaggt atgaagactggagaaataagagtggatgtgattgactcatttggaagcagtgcattttta tcagagggactgaaaagcctttcgagaatcagtcaagaaaaaggaatgcgcatgagagtc atgaccccaccttcaaggtttgcttcaccagggaaactcaactgcccctggtctgtgatc ccaaggggtcaggccactttaggctggaacttcctccactgcccacctgtgagtcaacaa aagaagccaagagctttaccctcagacgtaggccagcatgctgaaagcccgtcttcctgc cctagtgtgtctggaattggtgggttcttggtctcactgactttaacaatgaagccgcgg accctcgcggtgagtgttacagctcttaaggtggtgcgtctggagcttgttccttctgat gttcagatgtgttcggagtttcttccttctggagtgaagctgcagacctttgcggggagt gttacagctcttaaggcagtgcgtctggagtttttcattcctcccggtgggctcgtggtc gcgctggcttcagaagtgaagctacagaccttcgtggtgagtgttacagctcataaaagc agtgtggacccaaagccccttgggtggtcgatgggactgggtgccatggagcagggggtg gcactcgtcggggaggctcgggccgcacaggagcccacggagggtatgggaggctcaggc atggcggctgcaggtcccgagccctgccccacgggaaggcagctaaggacaggcgaggaa ttgagcgcagcgccgaaagtcctgtctgagtacattgagagtgctataacaagttaccat aaactgggaaattataaacaacagaaatttatttctcacagtttgaagtctggaaagtcc aagattgaggtgctggcagatttgcctactcaacgtgaagatgatgagcctcataacttt gtgagaatgtataaagaatgcctttcatgttggttggaatctggcatacctaatttaggt gtctggccaaaaagaatacatactacagcagaaaaatatagagaatatgaagcccgggag caaacagatcaaactcaagtccaggagttacacagatctcaagatagagattttgaaact atggctaaattacatattccagtaatggtggatgaagttgttcattgtttgtcaccacaa aaaggacagatttttctagatatgacatttggttcgggagggcacacaaaagccattctg cagaaggagtcagatattgttctctatgccttggacagagacccaacagcttatgcatta gctgaacatctttcagagttgtatcctaaacaaatccgagctatgctgggccagttcagc caggcagaagccttattaatgaaagctggagtgcagccaggaacttttgatggagttctt atggatcttgggtgttcctccatgcaacttgatactcctgaaagaggttcttcccttcgg aaagatggccctttggacataagaatggatgacttactacagcaatctacctatattgcc accaagactttccaggctcttcgcatatttgtgaacaatgagctcaatgaactctacacg ggactgaagacagctcagaagtttctgagacttggtggtcgtcttgttgccctctccttc cattcactagaggatcgcatcgtcaaaagatttttgcttggaataagcatgacagaaaga tttaacctaagtgttagacaacaagtgatgaaaacatctcaattgggttcagatcatgaa aacacggaagaagtctctatgagaagagctcctttaatgtgggaactgatacacaagaag gtatttagtccacaagatcaggatgtacaagataaccccagagggcgctcagccaagctt agagcagctatcaaattataa >gi568815595r:156709551_156910732|GENSCAN_predicted_peptide_2|460_aa MSGKDEQQEQTIAEDLVVTKYKMEGDIANRGDQVTGRKADVIKAAHLCAEAALRLVKPGN QNTQVTQAWNKVAHSFNCTPVEGMLSHQLKQHVIDGEKTIIQNPTDQQKKDHEKAEFDIH KVYAVDVLVSSGEGKPKDAGQRTTIYKQDPSKQYGLKMKTSRAFFSEVETRFDAMPFTLR AFEDEKKAQMGVVECAKHELLQPFNVLYEKEGEFVAQFKFTVLLMPSGPMRITSGPFEPD LCKSEMEVQDAELKALLQSSTSRKTQKKKTKKASKTAENVASGETLEENEAGDRWVPSPQ LAAPASSPSHHTPDSVKRSSSSPPRTASRAGVSLPPPQFPNPLPSNNNQLQLTLEEAAVH LRIGVELGEDKKRSWQQGRKIILEMSEEVVGGGGGGGSGRGGGGGVVCTCVPNIGQGATL HEHLLNAIDLLNELIFLFKEKEDNCEGLKEGKVNYQKKAL >gi568815595r:156709551_156910732|GENSCAN_predicted_CDS_2|1383_bp atgtcgggcaaggatgagcagcaggagcaaactatcgctgaggacctggtcgtgaccaag tataagatggagggtgacatcgccaacaggggggaccaagtaacagggaggaaggcagat gttattaaggcagctcacctttgtgctgaagctgccctacgcctggtcaaacctggaaat cagaacacacaagtgacacaagcctggaacaaagttgcccactcatttaactgcacgcca gtagaaggtatgctgtcacaccagttgaagcagcatgtcattgatggagaaaaaaccatt atccagaatcccacagaccagcagaagaaggaccatgaaaaagctgaatttgacatacat aaagtgtatgctgtggatgttcttgtcagctcaggagagggcaagcccaaggatgcagga cagagaaccactatttacaaacaagacccttctaaacagtatggactgaaaatgaaaact tcacgtgccttcttcagtgaggtggaaacgcgttttgatgccatgccgtttactttaaga gcatttgaagatgagaagaaggctcagatgggtgtggtggagtgcgccaaacatgaactg ctgcaaccatttaatgttctctatgagaaggagggtgaatttgttgcccagtttaaattt acagttctgctcatgcccagtggccccatgcggataaccagtggtcccttcgagcctgac ctctgcaagtctgagatggaggtccaggatgcagagctaaaggccctcctccagagttct acaagtcgaaaaacccagaaaaagaaaacaaagaaggcctccaagactgcagagaatgtc gccagtggggaaacattagaagaaaatgaagctggggacaggtgggtcccatctccccag cttgctgctcctgcctcatccccttcccaccacaccccggactctgtgaagcgcagttct tcttctccacctaggaccgccagcagagcaggggtctccctgcccccaccccagttcccc aacccactcccttccaacaacaaccagctccaactgactctggaagaagcagcagtacac ttaagaattggagtagagttgggggaagataaaaagagaagttggcagcaggggagaaaa attatcttagaaatgtctgaagaagtagtaggaggaggaggaggaggaggaagtgggaga ggaggaggaggaggagtagtgtgcacgtgtgtcccaaatataggccaaggtgccacatta catgagcatctactaaatgcaattgatcttctaaatgaattgatcttcttgtttaaagaa aaggaagacaactgtgaaggtttgaaggaaggcaaagtgaactaccagaaaaaggcattg tga >gi568815595r:156709551_156910732|GENSCAN_predicted_peptide_3|137_aa MQSANCTSAYDSGKTYTHRLNMMAERIKKHEANFSGENDGKQSIVCFCLMIFLTGDFDEG LNSSDAPTGDHLEGADPPPPPSAKRGAAAATRPEPALSCAALRDWLHFSAPTAVVTQTDL IAGRHDTRFPSPARGVA >gi568815595r:156709551_156910732|GENSCAN_predicted_CDS_3|414_bp atgcagtcagcaaactgcacgagtgcctatgactctggtaaaacttatacacatagactt aatatgatggcagagagaattaagaaacatgaggccaactttagtggggagaatgatggg aagcaaagtattgtatgcttttgtcttatgattttcttaacaggagactttgatgaagga ctgaatagttctgacgcccccaccggcgaccacctggaaggggctgatccacccccacct ccctctgcgaagcgcggagcggctgcggccacccggccagagccagcgctgagctgcgct gcgctgcgtgactggctgcatttctcagccccgacggctgtggttactcaaactgatctc atcgcaggtcgtcacgacacgcgcttcccctcgcctgcacgcggcgtggcgtga >gi568815595r:156709551_156910732|GENSCAN_predicted_peptide_4|280_aa MAPTAMVQQAASPKLLRDKEPGKLACRQKAYEIALNMMKNTQELQDHFFLKGRTAHMAMI ITQCFKEIITMPELTAKATGVSRQAINSRSAFKPLRKEPFPWRESPTTSRSGAGEGGSSH PKTFLKRQALPHRNQPQGQTKLLEWLSNLSLSNPVPPGPAVAGRGGGGGCGVALEEPFSP LLHCGSPFLGWPRPELAPSACREVWRERGERELGLRAVLAGQLEFRVGVGLAGPALGAAS RPCQPRAMRGLAPRPVAVEVVLGPPAVPAHWRCTRFLAGP >gi568815595r:156709551_156910732|GENSCAN_predicted_CDS_4|843_bp atggcccccacagctatggtacagcaagcagcatcaccaaagctgctgagagataaggaa cctggaaagctagcatgccggcaaaaggcttatgaaattgcactaaacatgatgaaaaat acacaagaacttcaagatcactttttcctgaagggaagaactgctcacatggccatgatt atcacacagtgttttaaggagataatcacaatgcctgagctcactgcaaaagcaacagga gtgtcacgtcaggccattaactcacgttcagctttcaagcccttacgtaaagaaccattc ccatggcgtgaatcacccacaacgagcaggtcaggggcaggtgaaggcgggagttcacat cctaagactttcctgaagcggcaggctcttccacacagaaaccaaccgcaaggccagaca aaactgctggaatggctttcaaatctgtccctttctaaccccgttccccctggacctgca gtggccggaagaggaggaggaggagggtgcggggtggcacttgaggaacccttcagccca ctgctgcactgtgggagcccctttctgggctggccaaggccagagctggctccctcagct tgcagggaggtgtggagggagaggggcgagcgggaactggggctgcgcgccgtgcttgcg ggccagctggagttccgggtgggcgtgggcttggcaggccccgcactcggagcagccagc cggccctgccagccacgggcaatgaggggcttagcacccaggccagtggctgtggaggtt gtactgggtccccccgcagtgccagcccactggcgctgcactcgatttctcgctgggcct tag >gi568815595r:156709551_156910732|GENSCAN_predicted_peptide_5|246_aa MCIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLNLISNFSKVSGYKINVQKSQAFLYTN NRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKSLLNEIKEDRNKWKNISCS WVGRINIMKMAILPKMLPEEKVCKYCGVSYLILHEFKAMEEKVKAMEKEMKFYQGSVDRE KRLQEKLHSLSQELEQYKIDNKSKTERLTLFGVYGSSVKGIYAKNDWDNLILNNQDLIEQ RTDAEL >gi568815595r:156709551_156910732|GENSCAN_predicted_CDS_5|741_bp atgtgcattcagttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgatt gtatatctagaaaaccccattgtctcagcccaaaatctccttaacctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaat aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactac aaatcacttctcaatgaaataaaagaggatagaaacaaatggaagaacatttcatgctca tgggtaggaagaatcaatatcatgaaaatggccatattgcccaagatgttgcctgaagaa aaagtttgtaagtactgtggagtcagctatctaattcttcatgaatttaaggctatggaa gaaaaagtgaaagcaatggaaaaagagatgaaattttatcaaggaagtgtagatcgtgaa aagagacttcaagaaaagctgcattctcttagccaagaacttgaacagtacaaaattgac aacaaatccaaaacagaaaggttgacactttttggtgtttatggaagtagtgtaaagggc atttatgctaaaaatgattgggacaacttaattttgaataatcaagatttgatagaacaa agaactgatgcagaactgtag >gi568815595r:156709551_156910732|GENSCAN_predicted_peptide_6|132_aa MTILPKAIHRFKAISIKVPSSSFTELEKTILKFLWNQKKTCTAKASLGKKKKCEDITLPN FKLYYKAMVTKTAWYWYKNRHVDQWNRRENPEIKPNTYSHLIFNNKKQNQSRERTPYSTN GATIIGKPHVEE >gi568815595r:156709551_156910732|GENSCAN_predicted_CDS_6|399_bp atgaccatactgccaaaagcaatccatagattcaaagcaatttctatcaaagtaccatca tcatccttcacagaactagaaaaaacaatcttaaaattcttatggaaccaaaaaaagacc tgcacagctaaagccagtctaggcaaaaagaaaaaatgtgaagacatcacattacccaac ttcaaactatactacaaggctatggttaccaaaacagcatggtactggtataaaaacagg catgtagaccaatggaacagaagagagaacccagaaataaagccaaatacttacagccac ctgattttcaacaataaaaaacaaaaccaaagtagggaaaggacaccctattccacaaat ggtgccacgataattggcaagccacatgtagaagaataa