GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:56:53 Sequence gi568815586r:101913111_102161962 : 248852 bp : 38.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 658 653 6 1.05 1.02 Term - 4221 4088 134 1 2 114 48 79 0.854 3.77 1.01 Init - 5508 5202 307 0 1 66 22 191 0.723 7.80 1.00 Prom - 8370 8331 40 -4.95 2.00 Prom + 9200 9239 40 -4.75 2.01 Init + 11196 11226 31 2 1 53 86 29 0.264 -0.85 2.02 Intr + 11305 11468 164 2 2 46 32 159 0.333 4.87 2.03 Intr + 11575 11756 182 0 2 71 86 86 0.586 4.44 2.04 Term + 23372 23621 250 2 1 49 48 308 0.518 17.09 2.05 PlyA + 24558 24563 6 1.05 3.04 PlyA - 27593 27588 6 1.05 3.03 Term - 31066 30935 132 1 0 52 46 87 0.010 -1.99 3.02 Intr - 42540 42264 277 2 1 71 51 213 0.116 12.50 3.01 Init - 43504 43419 86 1 2 49 113 42 0.861 3.17 3.00 Prom - 45393 45354 40 -9.15 4.22 PlyA - 45721 45716 6 1.05 4.21 Term - 46867 46646 222 1 0 28 50 156 0.539 1.73 4.20 Intr - 51253 51042 212 2 2 54 110 168 0.437 13.31 4.19 Intr - 51357 51330 28 0 1 132 106 -8 0.120 2.17 4.18 Intr - 53406 53354 53 1 2 75 78 51 0.045 0.51 4.17 Intr - 67173 67108 66 1 0 97 105 18 0.028 2.36 4.16 Intr - 93473 93218 256 2 1 56 43 165 0.545 4.89 4.15 Intr - 126868 126758 111 1 0 42 91 70 0.301 2.36 4.14 Intr - 131102 130995 108 2 0 57 111 99 0.728 8.66 4.13 Intr - 131925 131773 153 0 0 58 76 86 0.367 3.75 4.12 Intr - 133009 132944 66 1 0 106 81 45 0.370 3.78 4.11 Intr - 140084 139768 317 0 2 8 53 223 0.006 5.66 4.10 Intr - 146350 146269 82 1 1 105 61 33 0.240 0.59 4.09 Intr - 148236 148138 99 0 0 84 92 95 0.844 8.89 4.08 Intr - 161984 161891 94 1 1 105 95 45 0.944 5.95 4.07 Intr - 164393 164212 182 2 2 111 80 87 0.979 7.94 4.06 Intr - 166106 165949 158 0 2 90 78 49 0.552 2.91 4.05 Intr - 176197 175995 203 0 2 -1 -2 227 0.107 2.81 4.04 Intr - 186090 185996 95 0 2 106 91 102 0.749 10.14 4.03 Intr - 187994 187922 73 1 1 83 95 52 0.859 3.89 4.02 Intr - 199122 198998 125 0 2 49 108 96 0.979 6.16 4.01 Init - 205408 205253 156 1 0 70 92 101 0.977 8.66 4.00 Prom - 209516 209477 40 -3.65 5.03 PlyA - 209806 209801 6 1.05 5.02 Term - 226652 225755 898 1 1 34 40 364 0.934 17.05 5.01 Init - 227620 226968 653 1 2 69 72 234 0.791 14.71 5.00 Prom - 228081 228042 40 -6.45 6.02 PlyA - 228252 228247 6 1.05 6.01 Sngl - 229196 228651 546 2 0 46 42 266 0.686 13.65 6.00 Prom - 233051 233012 40 -5.55 7.00 Prom + 233489 233528 40 -5.65 7.01 Init + 235210 235353 144 0 0 93 95 84 0.758 9.77 7.02 Term + 238460 238801 342 1 0 46 34 288 0.950 12.73 7.03 PlyA + 239079 239084 6 1.05 8.02 PlyA - 239399 239394 6 1.05 8.01 Term - 242923 242813 111 1 0 94 45 133 0.646 7.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 35457 35562 106 0 1 119 106 28 0.839 7.00 S.002 Init - 52402 52207 196 1 1 48 80 113 0.835 5.64 S.003 Term - 100082 99998 85 1 1 101 43 118 0.944 4.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_1|146_aa MVTVVFFDPVFIQTDRVEAHQALRASLFTLLVMVTLTHCFHARSHRGGPDQPTVGDTEQR LPPSGTSTRGPATSRIIPEQGRWLLKIKAELWGEFLKHDISESFHYLPEFSILGNPAVSS SHPGVPPLPLTSSPEQDSINIWLIAE >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_1|441_bp atggtgacagttgttttctttgatcctgttttcatccagacagatcgagtggaggcacac caagccctccgagcgtcactcttcactctccttgtcatggtcacactcacccactgtttt catgctaggtcacaccgaggaggacctgaccagcctacagtaggagatacagaacagagg cttcctccttctggtaccagtaccaggggacctgccacatcccgtatcatccctgaacaa ggaagatggcttctaaaaataaaggcagaactttggggggaatttcttaagcatgatatt agtgaaagcttccattacctgccagagttctccatattgggaaatcctgctgtctcctct agccaccctggtgttcctcctttacctcttacctccagtccagaacaggactcaataaac atttggttgatagctgaatga >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_2|208_aa MIPDPFGLYKDSQTLASEVYVTQDSEMKRSEIQQEGEKGIRNPTRRRKGNQKSNKKGKRE SILPQAGFPLIEVTGHLEPPDPQPIHKKRWKERTSQQFQPQAHELSPIGWLGSHAIGRSE ALIGQTLYQMLLVVAPQAQKDELILEGNYFELASNSEALIQQAMAIENKAIRRAVGGSPV SEKGTGPLADGCIPERLQMQRASLSVML >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_2|627_bp atgattccagatccatttggactttataaagattctcaaacgttagcatcagaagtttat gtcactcaagattcagagatgaagagatcagaaatccaacaagaaggagaaaagggaatc agaaatccaacaagaaggagaaaagggaatcagaaatccaacaagaaggggaaaagggaa tctatattgcctcaagcaggctttcccctgatagaagtgacagggcatctagaaccccca gacccacaacctatccacaaaaagaggtggaaagaaaggacttctcaacagttccaacca caagcccatgaattgtctcccattggctggcttggatcacatgccattgggaggtctgag gctctgattggtcagacgctataccagatgttgcttgttgtagctcctcaggcccaaaaa gatgagttaatccttgaaggaaattattttgaacttgcatcaaactcagaggctttgatc caacaggccatggcaattgaaaacaaggctatcagaagagctgtgggtggtagtcctgtc tctgagaaaggaacagggccactggctgacggctgcattccagaacgcctccagatgcag agggcctcacttagtgtgatgttgtga >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_3|164_aa MGHIWGPQAPGHGPVLVRGLLGTVPHSRSRHIIPPHVDPQSSRSGISCDAEAFRDHRKPI LMMNPGLSLPLEQSWFHTFTSKRALNSCHVAALRFGLKQHDRHTFPESSQEWSLGSPSHS GARSGGSEAKATLRLWPITVSSSPTPPSLLGLSGDTNQKPPTSI >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_3|495_bp atgggccatatctggggtccccaagccccaggtcacggaccagtactggtccgtggcctg ttaggaactgtgccacacagcagaagcaggcacatcatcccgcctcatgtggatccccag agcagcagatctggaatctcctgtgatgctgaggccttcagggaccacaggaaacccatc ttaatgatgaatcctggactttctttgcccttggaacagtcctggtttcacacatttacc tccaaaagagcacttaacagctgccacgtggctgccctgaggtttgggctgaaacaacat gaccgtcataccttcccagagtcctcccaagaatggtctctaggttctccttcgcacagt ggggctcgctctggaggttcagaggcaaaagccactctgaggctgtggcccatcacagtg tcctcctcccccacaccaccttcccttttggggctttctggagacaccaatcagaagcca cccacctccatttga >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_4|952_aa MKQDASRNAAYTVDCEDYVHVVEFNPFENGDSGNLIAYGGNNYVVIGTCTFQEEEADVEG IQYKTLRTFHHGVRVDGIAWSPETRLDSLPPVIKFCTSAADMKIRLFTSDLQDKNEYKVL EGHTDFINGLVFDPKEGQEIASVSDDHTCSEERLCLATVQSSKCEVTAFLQVYPTAPKRQ RPSRTGHDDDGGFVEKKRGKCGEKKERSDCYCVCVERNMKDNSRSLNRFKWYSSGVLMPN PTFLPIYHYYLHPQGFFNLIEPELKEHCLKLMVAEKNGTIRFYDLLAQQAILSLESEQVP LMSAHWCLKNTFKVGAVAGNDWLIWDITRSRWSTISENLFATTGYPGKMASQFQIHHLGH PQVPAIQQKRTVAFLNQFVVHTVQFLNRFSTVCEELFWLYLYSSTSLNSREKFVHQNLMK FKGSKVGLVLGTPGVDLEPEYVGVGLGPGCTVTSLEPESAGVVLQPPSMGANLAMQSTGV VPDSGSAGTYLDPDSPQGFVWGQLGAGASVQPGVTGARLEPEIIGAGLKLADLSLRIQQI ETTLNILDAKEFLGVRVQGRTLNDHMDKLTLVISMAIESSPKLAKHRQMRVIDIRMNSEK KLSSIPGLDDVTVEVSPLNVTSVTNGAHPEATSEQPQQNSTQDSGLQESEVSAENILTVA KDPRYARYLKMVQVVAFTSGLLSLCCRWLNATTCLLPLRNPIISISTKEPSSTASYPTVN PGLLGTASDDFIKDVDVTLSDILYYLNQESVLWALPGKNGQNPTHPFYEIQCSVLASLLI PEFCTKIELSTNTDIEKMKVKHLFYSNLEPAPADETSEPLLLVKKGLVFSLEVQVGAAKL IENCLSPVADNWVFGFVHTGEDIIHGLQLLGYETCGVSQLPSYERYMEKHGFVPLWTVTA VLPAENQHQLPAIQGSHLACSSPVEPSDKCSPRQQHVGQKNHPAEPSQLTES >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_4|2859_bp atgaagcaagatgcctcaagaaatgctgcctacactgtggattgtgaagattatgtgcat gtggtagaatttaatccctttgagaatggggattcaggaaacctaattgcatatggtggc aataattatgtggtcattggcacgtgtacgtttcaggaagaagaagcagacgttgaaggc attcagtataaaacacttcgaacatttcaccatggagtcagggttgatggcatagcttgg agcccagagactagacttgattcattgcctccagtaatcaaattttgtacttcagctgct gatatgaaaattagattatttacttcagatcttcaggataaaaatgaatataaggtttta gagggccataccgatttcattaatggtttggtgtttgatcccaaagaaggccaagaaatt gcaagtgtgagtgacgatcacacctgcagtgaggagcgcctctgcctggccactgtgcaa tcttccaagtgtgaagtgacagcctttctgcaggtgtacccaacagctccgaagagacag cgaccatcgagaacgggccatgatgacgatggcggttttgtcgaaaagaaaagggggaaa tgtggggaaaagaaagagagatcagattgttactgtgtctgtgtagaaagaaatatgaaa gataactccaggtcactcaacaggtttaaatggtacagctcaggagttttgatgccaaat cccacattcttgccaatctaccactactatttacaccctcaaggatttttcaacttaata gaaccagaattaaaagaacactgcttaaaactaatggttgcagagaagaatggaacaatc cggttttatgatcttttggcccaacaggctattttatctcttgaatcagaacaagtgcca ttaatgtcagcacactggtgcttaaaaaacaccttcaaagttggagccgttgcaggaaat gattggttaatttgggatattactcggtccaggtggtccacaattagtgaaaatctgttt gcaaccactggttatcctggcaaaatggcaagccagtttcaaattcatcatttaggacac cctcaggtgccagctattcaacagaaaagaacggtggcttttctaaaccaatttgtggtg cacactgtacagttcctcaaccgcttttctacagtttgtgaggagctgttctggctgtat ctgtattctagtacttctttgaactctagagaaaagtttgttcatcaaaacctcatgaag tttaaaggttccaaggtgggcctggtgcttggaacccctggggtggacctggagccagag tatgtgggggtgggcctgggtcctggttgtactgtgactagcctggagcctgagtctgca ggagtggtcctgcagcctccgtccatgggggccaatctggcaatgcaatctactggggtg gtcccagactctgggtctgctggaacatacctagaccctgactcaccacagggatttgtc tggggccagcttggtgctggggcaagtgtgcagcctggggtcacaggggccagactggag cctgagattataggtgctggcctgaaactggcagacctttcacttcgtatccaacaaatt gaaacaactctcaatattttagatgcaaaggaattcctgggagttagagtccaaggtcgc acattaaatgaccatatggacaagttaaccttagttatttccatggccatagagtcatcc ccaaagttagccaaacatcgacaaatgcgtgtcattgatatcaggatgaactcagaaaag aagttgtcatctatcccaggcctagatgatgtcacagttgaagtatctcctttaaatgtc accagtgtcacaaatggagcacatcctgaagccacttcagagcaaccacagcagaacagt acacaagactctggactacaggaaagtgaagtatcagcagaaaatatcttaactgtagcc aaggatccaagatatgccagatatctcaaaatggttcaagtggtagcatttacttcaggc ttgctgtcactttgctgccgatggttgaatgccactacctgcctcctgccattaaggaat cctatcatctcaattagtactaaagagcccagttctacagcatcatatcctactgtgaat cctggcctgttgggtacagcttctgatgattttatcaaagatgttgatgtcaccctcagt gacatcctttattatcttaatcaagagagtgttctctgggcccttcctgggaaaaatggt caaaatcctacacatccattctatgaaatccagtgcagtgtcttagcttccctacttatt cctgaattttgcactaaaatagagctctcaaccaatactgatatagagaagatgaaagtt aagcatctgttttattcaaacctggaacctgctcctgcagatgaaacctcagaacccctc cttcttgtcaaaaaggggctggtcttcagccttgaagtccaggttggggctgccaaactc attgagaattgtctgagccctgtagctgacaattgggtgtttggctttgtacacactggt gaagacatcatccatggactacaacttcttggctatgagacctgtggtgtgtcacagctg ccatcctatgagaggtacatggagaagcatgggtttgtgcccttatggacggtcacagct gtgctcccagctgaaaaccagcatcaactgccagccatacaagggagccatcttgcatgt tccagcccagttgagccttcagataaatgcagccccagacaacagcatgtggggcagaag aaccacccagccgagcccagtcaactcacagaatcatga >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_5|516_aa MKAEIKMFFETNKNKDTTHQNLWDTFKAVCRGKFIALIAHKRKQERSKIDTLTSQLKELQ KQEQTHSKASRRQEIAKIRAQLKEIETQKTLQKINEPRSCFFERINKIDGPLVRLIKKKR EKNQIATIKNDKGDITTNPTEIQTIIRQYYKHLYANELENLEEMDQFLDTYTLPRLNQEE VESLNRPITGSEIEAIINSLPTKKIPGPDGFTAKFYQRTKDKNHMIISTDAEKAFHKIQQ PFRLKTLNKLGTDGKYLKIIRANYDKATANIILNGQKLEAFPLKTCTRQGCPLSPLLVNI VLEVLARAIRQEKEIKGIHLGKEEVKLSLFADDIIVYLENPIVSAQNLLKLISNFSSLRI QNQCAKLTSILIHQSQTESQIMSELSFTIATKRIKYLGTQLTRDVKDLFKENYKPLLNEI KEDTDKWKNIPCSWTGRINMVKMAILSKVIYRFNAIPMKLPMTFFTELEKTTLKFIWNQK RAHIAKTILSKKNKAGGITLPDFKLYYKATVTKTAW >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_5|1551_bp atgaaggcagaaataaagatgttctttgaaaccaataagaacaaagacacaacacaccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaattgcccac aagagaaagcaggaaagatctaaaatcgacaccctaacatcacaattaaaagaactacag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaatagctaagatcagagca caactgaaggaaatagagacacaaaaaaccctccaaaaaatcaatgaacccaggagctgt ttttttgaaaggatcaacaaaattgatggaccactagtaagactaataaagaagaaaaga gagaagaatcaaatagccacaataaaaaacgataaaggggatatcaccaccaatcccaca gaaatacaaactatcatcagacaatactataaacacctctatgcaaatgaactagaaaat ctagaagaaatggatcaattcctggacacatataccctcccaagactaaaccaggaagaa gttgaatccctgaatagaccaataacaggctctgaaattgaggcaataattaatagccta ccaaccaaaaaaattccaggaccagatggattcacagctaaattctaccagagaaccaaa gacaaaaaccacatgattatctcaacagatgcagaaaaggccttccacaaaattcaacag cccttcaggctaaaaactctcaataaactaggtactgatgggaagtatctcaaaataata agagctaattatgacaaagccacagccaatatcatactgaatgggcaaaaactggaagca ttccctttgaaaacttgcacaagacagggatgccctctctcaccactcctagtcaacata gtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagggtattcattta ggaaaagaagaagtcaaattgtccctgtttgcagatgacattattgtatatttagaaaac cccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcagtctcaggata caaaatcaatgtgcaaaactcacaagcattcttatacaccaatcacagacagagagccaa atcatgagtgaactctcattcacaattgctacaaagagaataaaatacctaggaacccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaata aaagaggacacagacaaatggaagaacattccatgctcatggacaggaagaatcaatatg gtgaaaatggccatactgtccaaggtaatttatagatttaatgccatccccatgaagcta ccaatgactttcttcacagaattggaaaaaactactttaaaattcatatggaaccaaaaa agagcccacattgccaagacaatcctaagcaagaagaacaaagctggaggcatcacgctt cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtag >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_6|181_aa MRCMHKLQWLIHQVEERVSVIEDQINEMKREDKFREKRVKRNKQSLQEIWDYMKRPNLHL IGVPESDGENGTKLENTLQDIIQENFPNLAGQANIQIQEIQRTPQRYSSRRPTPRHIIIR FTKVELKEKMLRAAREKGQVTHKGKPIRLTADLSAETLQARRQWGPIFNILKEKNFQPRI S >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_6|546_bp atgagatgcatgcacaagcttcagtggctgattcatcaagtggaagaaagggtatcagtg attgaagatcaaattaatgaaatgaagcgagaagacaagtttagagaaaaaagagtaaaa agaaacaaacaaagcctccaagaaatatgggactatatgaaaagaccaaatttacatctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaacctagcagggcaggccaacattcaaattcaggaaata cagagaactccacaaagatactcctcgagaagaccaactccaagacacataattatcaga ttcaccaaagttgaactgaaggaaaaaatgttaagggcagccagagagaaaggtcaggtt actcacaaagggaagcccatcagactaacagcagatctctcggcagaaacgctacaagcc agaagacagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatag >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_7|161_aa MDVTDHYEDVRKIYDDFLKNSNMLDLIDVYQKCRALTSNCENYNTVSPGSTIQPPSERGS LLLSRPDLLAPGEDPPVSASLTLERQKGESLLAWSWSLNFRLPGIPITVQKKRRDLKKLA AVMRQVRQWESHPLQNIEEATWHQLTVQAEKKEPGTQGASV >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_7|486_bp atggacgtgactgaccattatgaggacgttaggaagatttatgatgatttcttgaagaac agtaatatgttagatctgattgatgtttatcaaaaatgtagggctttgacttctaattgt gaaaattataacacagtatctcctgggtcaactatacagccaccttcagaaagaggaagc ttgctgctgtctaggccagatctcttggctcctggggaagatccaccagtgtcagcctca ttgactctggagcggcagaagggagagtccctcttagcatggagctggtccctaaacttc agattgcctgggattcccatcacggtccagaagaagagacgagacctgaagaagctggca gctgtcatgcggcaagtgagacaatgggaaagccacccgcttcagaacattgaagaagcc acctggcaccaactcactgtccaagctgagaagaaggaaccagggacccaaggagcttca gtttaa >gi568815586r:101913111_102161962|GENSCAN_predicted_peptide_8|36_aa IPRNGIWGHAVSVIALLEAVGHGREPWNPATSVQLD >gi568815586r:101913111_102161962|GENSCAN_predicted_CDS_8|111_bp attccaaggaatggaatctggggccacgcagtgagtgttatagctctattagaagccgtg ggtcacggaagagaaccgtggaacccagcgactagtgttcagctcgattag