GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:18:34 Sequence gi568815597r:23210457_23441008 : 230552 bp : 45.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16587 16780 194 2 2 64 58 77 0.053 0.74 1.02 Intr + 22135 22223 89 1 2 110 80 15 0.144 2.51 1.03 Term + 34408 34775 368 2 2 -7 44 447 0.840 25.57 1.04 PlyA + 34872 34877 6 1.05 2.12 PlyA - 35074 35069 6 1.05 2.11 Term - 100610 99998 613 1 1 95 42 461 0.752 36.16 2.10 Intr - 100866 100745 122 0 2 70 66 95 0.999 4.79 2.09 Intr - 103246 103097 150 1 0 92 97 82 0.999 9.86 2.08 Intr - 108232 108027 206 2 2 53 94 166 0.999 12.62 2.07 Intr - 111207 111072 136 0 1 95 98 73 0.998 9.14 2.06 Intr - 113276 113100 177 2 0 24 84 96 0.747 2.92 2.05 Intr - 123175 123062 114 1 0 32 107 69 0.640 3.84 2.04 Intr - 127405 127298 108 1 0 83 93 45 0.924 4.98 2.03 Intr - 128152 128034 119 2 2 82 110 104 0.999 12.18 2.02 Intr - 130564 130396 169 1 1 55 75 109 0.037 5.82 2.01 Init - 134390 134328 63 2 0 80 37 99 0.066 3.16 2.00 Prom - 139987 139948 40 -2.66 3.29 PlyA - 143156 143151 6 1.05 3.28 Term - 152765 151513 1253 0 2 76 43 1045 0.995 90.55 3.27 Intr - 156712 156586 127 1 1 -14 97 101 0.825 1.05 3.26 Intr - 157609 157517 93 1 0 109 115 26 0.581 7.56 3.25 Intr - 171075 170987 89 1 2 112 100 59 0.716 9.19 3.24 Intr - 173961 173886 76 0 1 119 31 70 0.916 3.49 3.23 Intr - 176963 176817 147 2 0 95 110 245 0.988 27.83 3.22 Intr - 183565 183423 143 2 2 30 78 295 0.944 22.77 3.21 Intr - 184589 184392 198 0 0 81 100 30 0.696 2.82 3.20 Intr - 187145 187029 117 0 0 88 51 103 0.811 7.04 3.19 Intr - 187499 187336 164 1 2 91 93 154 0.997 15.82 3.18 Intr - 198270 198208 63 2 0 78 98 33 0.220 1.13 3.17 Intr - 206934 206793 142 1 1 86 66 148 0.917 11.81 3.16 Intr - 207553 207448 106 1 1 77 66 161 0.970 12.59 3.15 Intr - 208683 208621 63 0 0 108 75 76 0.960 7.21 3.14 Intr - 212378 212175 204 2 0 96 43 96 0.937 5.30 3.13 Intr - 214228 214109 120 1 0 42 89 162 0.354 12.39 3.12 Intr - 220669 220579 91 0 1 92 74 10 0.321 -0.00 3.11 Intr - 221462 221240 223 0 1 72 77 95 0.406 3.99 3.10 Intr - 222816 222621 196 0 1 85 77 93 0.942 6.89 3.09 Intr - 223237 222969 269 2 2 100 98 166 0.528 16.05 3.08 Intr - 223913 223798 116 1 2 106 75 180 0.999 18.59 3.07 Intr - 224162 224077 86 2 2 35 80 110 0.999 3.52 3.06 Intr - 225572 225395 178 1 1 49 94 192 0.978 15.82 3.05 Intr - 226198 226104 95 1 2 65 75 65 0.999 1.66 3.04 Intr - 226588 226455 134 2 2 103 73 164 0.999 16.76 3.03 Intr - 226864 226674 191 0 2 109 67 252 0.999 24.43 3.02 Intr - 227141 226968 174 1 0 86 69 191 0.762 16.05 3.01 Intr - 228378 228284 95 0 2 70 -23 163 0.480 2.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 130552 130396 157 1 1 55 75 117 0.924 7.17 S.002 Init + 134273 134403 131 1 2 110 49 140 0.855 9.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:23210457_23441008|GENSCAN_predicted_peptide_1|216_aa MPILWMNNLRLRGLKKLSPGYLGELQNNNNNNNNNKSHLLTGDYWIPGAVFRSTLHMKSP SPPVRQEECREEMRQFVQRTATVSMSQKEIYTTLRGGPQVPEEHVFAKKHNKKVLKKMQA NSAKAVSARAEAIEALVNPKEVKPKIPKGVSRKLDRLAYIAHPKLGKRAHARIVKGLRLC RPKAKAKDQMKAQATAAASVPVRAPRGAQAPTKASE >gi568815597r:23210457_23441008|GENSCAN_predicted_CDS_1|651_bp atgcccattttatggatgaacaatctgaggctcagaggactgaaaaagctctccccaggt tatctcggggagctgcaaaacaacaacaacaacaacaacaacaacaagagccacctgtta acaggggattactggataccaggtgctgtcttcagaagtactttgcatatgaagtctccc agtccacccgtgaggcaggaagaatgcagagaagagatgaggcaatttgtacaaaggaca gcaactgtatccatgtcccagaaggaaatctatacaaccttgaggggtggaccccaagtt cctgaggaacacgtgtttgccaagaagcacaacaagaaggtcctaaagaagatgcaggcc aacagtgccaaggccgtgagtgcacgtgccgaggctatcgaggccctcgtaaaccccaag gaggttaagcccaagatcccaaagggtgttagccgcaagcttgatcgacttgcctacatt gcccaccccaagcttgggaagcgtgctcatgcccgcattgtcaaggggctcaggctgtgc cggccaaaggccaaggccaaggatcaaatgaaggcccaggctacagctgcagcttcagtt ccagttcgggctcccagaggtgcccaggcccctacaaaggcttcagagtag >gi568815597r:23210457_23441008|GENSCAN_predicted_peptide_2|658_aa MTERGPNAWLWACLGGTPAAGQHNKMANQVNGNAVQLKEEEEPMDTSSVTHTEHYKTLIE AGLPQKVAERLDEIFQTGLVAYVDLDERAIDALREFNEEGALSVLQQFKESDLSHVQNKS AFLCGVMKTYRQREKQGSKVQESTKGPDEAKIKALLERTGYTLDVTTGQRKYGGPPPDSV YSGVQPGIGTEVFVGKIPRDLYEDELVPLFEKAGPIWDLRLMMDPLSGQNRGYAFITFCG KEAAQEAVKLCDSYEIRPGKHLGVCISVANNRLFVGSIPKNKTKENILEEFSKVTEGLVD VILYHQPDDKKKNRGFCFLEYEDHKSAAQARRRLMSGKVKVWGNVVTVEWADPVEEPDPE VMAKVKVLFVRNLATTVTEEILEKSFSEFGKLERVKKLKDYAFVHFEDRGAAVKAMDEMN GKEIEGEEIEIVLAKPPDKKRKERQAARQASRSTAYEDYYYHPPPRMPPPIRGRGRGGGR GGYGYPPDYYGYEDYYDDYYGYDYHDYRGGYEDPYYGYDDGYAVRGRGGGRGGRGAPPPP RGRGAPPPRGRAGYSQRGAPLGPPRGSRGGRGGPAQQQRGRGSRGSRGNRGGNVGGKRKA DGYNQPDSKRRQTNNQQNWGSQPIAQQPLQQGGDYSGNYGYNNDNQEFYQDTYGQQWK >gi568815597r:23210457_23441008|GENSCAN_predicted_CDS_2|1977_bp atgaccgaaagaggcccgaacgcgtggctatgggcgtgtctggggggaacgccggccgcg gggcagcataataaaatggctaatcaggtgaatggtaatgcggtacagttaaaagaagag gaagaaccaatggatacttccagtgtaactcacacagaacactacaagacactgatagag gcaggcctcccacagaaggtggcagaaagacttgatgaaatatttcagacaggattggta gcttatgtcgatcttgatgaaagagcaattgatgctctcagggaatttaatgaagaagga gctctgtctgtactacagcagttcaaggaaagtgacttatcacatgttcagaacaaaagt gcatttttatgtggagttatgaagacctacaggcagagagagaaacaggggagcaaggtg caagagtccacaaagggacctgatgaagcgaagatcaaggccttgcttgagagaactggt tatactctggatgtaaccacaggacagaggaagtatggtggtcctccaccagacagtgtg tactctggcgtgcaacctggaattggaacggaggtatttgtaggcaaaataccaagggat ttatatgaggatgagttggtgcccctttttgagaaggccggacccatttgggatctacgt cttatgatggatccactgtccggtcagaatagagggtatgcatttatcaccttctgtgga aaggaagctgcacaggaagccgtgaaactgtgtgacagctatgaaattcgccctggtaaa caccttggagtgtgcatttctgtggcaaacaacagactttttgttggatccattccgaag aataagactaaagaaaacattttggaagaattcagtaaagtcacagagggtttggtggac gttattctctatcatcaacccgatgacaaaaagaagaatcgggggttctgcttccttgaa tatgaggatcacaagtcagcagcacaagccagacgccggctgatgagtggaaaagtaaaa gtgtggggaaatgtagttacagttgaatgggctgaccctgtggaagaaccagatccagaa gtcatggctaaggtaaaagttttgtttgtgagaaacttggctactacggtgacagaagaa atattggaaaagtcattttctgaatttggaaaactcgaaagagtaaagaagttgaaagat tatgcatttgttcattttgaagacagaggagcagctgttaaggctatggatgaaatgaat ggcaaagaaatagaaggggaagaaattgaaatagtcttagccaagccaccagacaagaaa aggaaagagcgccaagctgctagacaggcctccagaagcactgcgtatgaagattattac taccaccctcctcctcgcatgccacctccaattagaggtcggggtcgtggtggggggaga ggtggatatggctaccctccagattactacggctatgaagattactatgatgattactat ggttatgattatcacgactatcgtggaggctatgaagatccctactacggctatgatgat ggctatgcagtaagaggaagaggaggaggaaggggagggcgaggtgctccaccaccacca agggggaggggagcaccacctccaagaggtagagctggctattcacagaggggggcacct ttgggaccaccaagaggctctaggggtggcagagggggtcctgctcaacagcagagaggc cgtggttcccgtggatctcggggcaatcgtgggggcaatgtaggaggcaagagaaaggca gatgggtacaaccagcctgattccaagcgtcgtcagaccaacaaccaacagaactggggt tcccaacccatcgctcagcagccgcttcagcaaggtggtgactattctggtaactatggt tacaataatgacaaccaggaattttatcaggatacttatgggcaacagtggaagtag >gi568815597r:23210457_23441008|GENSCAN_predicted_peptide_3|1650_aa INRPPVKLTLLTCQVRPNPEEKKCFDLVTRECLRVRLVRDTSPCGPSGFGGGSLGSGKAS SISVGQKPPSLPTDNRTYHFQAEDEHECEAWVSVLQNSKDEALSSAFLGEPSAGPGSWGS AGHDGEPHDLTKLLIAEVKSRPGNSQCCDCGAADPTWLSTNLGVLTCIQCSGVHRELGVR FSRMQSLTLDLLGPSELLLALNMGNTSFNEVMEAQLPSHGGPKPSAESDMGTRRDYIMAK YVEHRFARRCTPEPQRLWTAICNRDLLSVLEAFANGQDFGQPLPGPDAQAPEELVLHLAV KVANQASLPLVDFIIQNGGHLDAKAADGNTALHYAALYNQPDCLKLLLKGRALVGTVNEA GETALDIARKKHHKECEELVRTRGGKGPLTLSLSLLSPQPSEPPSLCFWQLEQAQAGTFA FPLHVDYSWVISTEPGSDSEEDEEEKRCLLKLPAQAHWASGRLDISNKTYETVASLGAAT PQGESEDCPPPLPVKNSSRTLVQGCARHASGDRSEVSSLSSEAPETPESLGSPASSSSLM SPLEPGDPSQAPPNSEEGLREPPGTSRPSLTSGTTPSEMYLPVRFSSESTRSYRRGARSP EDGPSARQPLPRRNVPALLLPLCPSPRRASRANMGQEEELLRIAKKLEKMVARKNTAKAT QPSLSGPRVSSSVKWEEEMVSFGDGFLWKCPELSPEKMQVWPQGPCSMEPVYSHVCAWAL GSVWEGALDLLKKLHSCQMSIQLLQTTRIGVAVNGVRKHCSDKEVVSLAKVLIKNWKRLL DSPGPPKGEKGEEREKAKKKEKGLECSDWKPEAGLSPPRKKREDPKTRRDSVDSKSSASS SPKRPSVERSNSSKSKAESPKTPSSPLTPTFASSMCLLAPCYLTGDSVRDKCVEMLSAAL KADDDYKDYGVNCDKMASEIEDHILELCRGCGCLHRLAAPIQVSLLYMALPSNSFVFLIF FGSGDQENEAVFPELLQRVEGVGGSEACLGCSVWTETLWLVARDDFLPELKSTDMKYRNR VRSRISNLKDPRNPGLRRNVLSGAISAGLIAKMTAEEMASDELRELRNAMTQEAIREHQM AKTGGTTTDLFQCSKCKKKNCTYNQVQTRSADEPMTTFVLCNECGNRWKVSSLEDAEKPF KFCCLSGFVVLLMEQPAMNKRCLLAAFPALSPWRRTLVLEMAATLLMAGSQAPVTFEDMA MYLTREEWRPLDAAQRDLYRDVMQENYGNVVSLDFEIRSENEVNPKQEISEDVQFGTTSE RPAENAEENPESEEGFESGDRSERQWGDLTAEEWVSYPLQPVTDLLVHKEVHTGIRYHIC SHCGKAFSQISDLNRHQKTHTGDRPYKCYECGKGFSRSSHLIQHQRTHTGERPYDCNECG KSFGRSSHLIQHQTIHTGEKPHKCNECGKSFCRLSHLIQHQRTHSGEKPYECEECGKSFS RSSHLAQHQRTHTGEKPYECNECGRGFSERSDLIKHYRVHTGERPYKCDECGKNFSQNSD LVRHRRAHTGEKPYHCNECGENFSRISHLVQHQRTHTGEKPYECNACGKSFSRSSHLITH QKIHTGEKPYECNECWRSFGERSDLIKHQRTHTGEKPYECVQCGKGFTQSSNLITHQRVH TGEKPYECTECEKSFSRSSALIKHKRVHTD >gi568815597r:23210457_23441008|GENSCAN_predicted_CDS_3|4953_bp ataaaccggcccccggtgaagctgaccctgctgacgtgccaagtgaggccaaaccctgag gagaaaaagtgcttcgacctggtgacccgtgagtgcttgagagtccgacttgtcagggac acatctccatgtggccccagcggctttggtgggggctccctgggctccgggaaggccagc agcatttctgtgggtcagaagcccccttctctcccgacagacaaccggacgtaccacttt caggcagaggacgagcacgagtgtgaggcgtgggtgtcagtgttgcagaacagcaaggac gaagccctgagcagcgccttcctcggggagcccagcgctggcccggggtcctgggggtcc gccggccatgatggggagccgcacgacctcacaaagctgctcatcgcggaggtgaagagc aggcctgggaatagccagtgctgcgactgcggggctgcagaccccacgtggctcagcacc aacctgggcgtgctcacctgcatccagtgctcgggcgtccaccgcgaactgggcgtgcgc ttttcgcgcatgcagtcactcaccttggacctgctgggcccctccgagttgttgctggcc ttgaacatgggaaacacgagcttcaatgaggtcatggaggcccagctaccctcacacggc ggccctaaaccctcagctgagagtgacatgggcacccgcagggactacattatggccaag tatgtggagcataggtttgcacgccggtgcacacctgagcctcagcgactctggacagcc atttgcaacagggacctcctgtcggtactggaggcctttgccaatgggcaggactttgga cagccgctgccagggcctgatgcacaggcacctgaagaactcgtcttgcatttggctgtc aaagtcgccaaccaggcttccctgcctctggtggatttcatcatccagaacggtggtcac ctggatgccaaggctgctgacgggaacacggctctgcactacgcagcactctacaaccag cccgactgcctcaagctgctgctgaaggggagagctttggttggcacagtaaatgaagca ggcgagacagctctggacatagccaggaagaagcaccacaaggagtgtgaggagctggtg aggactcggggaggcaaggggcccctgactctttctctctccctgctgagcccccaaccc tctgaaccaccaagcctttgtttttggcagctggagcaggcccaggcggggacctttgcc ttccctctacatgtggactactcctgggtaatttccacagagcctggctctgacagtgag gaggatgaggaagagaagcgctgcttgctgaagctcccggcccaggctcactgggccagt gggaggctggacatcagcaacaagacctatgagactgtcgccagcctgggagcagccacc cctcagggcgagagtgaggactgtcccccgcccttgccagtcaaaaactcttctcggact ttggtccaagggtgtgcaagacatgccagtggagatcgttctgaagtctccagcctgagt tcagaggcccctgagacccctgagagcctgggcagtccagcctcctcctccagtctgatg agccccttggaacctggggatcccagccaagccccacccaactctgaagagggcctccga gagcccccaggcacctccagacccagcctgacatccgggaccaccccttcggagatgtac ctccccgtcagattcagctccgagagcactcgctcctatcggcggggggcgcggagccct gaagatggtccctcagccaggcagcctctgcccagaaggaacgtgccggccctactgctg cccctgtgcccctcgccccgccgggcgtcgcgggccaacatgggccaggaagaggagctg ctgaggatcgccaaaaagctggagaagatggtggccaggaagaacacggcaaaagccact caaccttccctttctggacctcgagtttcctcttctgtgaaatgggaagaggaaatggtg tcttttggagatggattcctgtggaagtgcccagagctgtcccctgaaaagatgcaagtc tggccgcagggtccctgcagcatggagccagtttacagccacgtctgcgcctgggctctc ggcagtgtctgggaaggggccctggaccttctgaagaagctgcacagctgccagatgtcc atccagctactacagacaaccaggattggagttgctgttaatggggtccgcaagcactgc tcagacaaggaggtggtgtccttggccaaagtccttatcaaaaactggaagcggctgcta gactcccctggacccccaaaaggagaaaaaggagaggaaagagaaaaggcaaagaagaag gaaaaagggcttgagtgttcagactggaagccagaagcaggcctttctccaccaaggaaa aaacgagaagaccccaaaaccaggagagactctgtggactccaagtcttctgcctcctcc tctccaaaaagaccatcggtggaaagatcaaacagcagcaaatcaaaagcggagagcccc aaaacacctagcagccccttgacccccacgtttgcctcttccatgtgtctcctggccccc tgctatctcacaggggactctgtccgggacaagtgtgtggagatgctgtcagcagccctg aaggcggacgatgattacaaggactatggagtcaactgtgacaagatggcatcagaaatc gaagatcatatccttgaactgtgccggggctgtgggtgtctgcaccgtctagcagcaccc atccaagtctctttattgtatatggctcttcccagcaacagctttgtcttcctgatcttc tttggctcaggagaccaagaaaatgaggctgtgttccccgagctcctccagagggtggag ggagtgggaggctcggaagcctgcctaggatgcagtgtttggacagaaactctgtggttg gtggccagggatgacttcctgccggagctcaagagcacggacatgaagtaccggaaccgc gtgcgcagccgcataagcaacctcaaggaccccaggaaccccggcctgcggcggaacgtg ctcagtggggccatctccgcagggcttatagccaagatgacggcagaggaaatggccagt gatgaactgagggagttgaggaatgccatgacccaggaggccatccgtgagcaccagatg gccaagactggcggcaccaccactgacctcttccagtgcagcaaatgcaagaagaagaac tgcacctataaccaggtgcagacacgcagtgctgatgagcccatgactacctttgtctta tgcaatgaatgtggcaatcgctggaaggtctcctccttggaggatgccgagaaacctttc aaattttgttgcctttctggttttgtagttctgctgatggaacagccagccatgaacaag cgctgtctgctagctgcttttcctgctctctctccctggaggcgaacccttgtgctcgag atggcagccaccctgctcatggctgggtcccaggcacctgtgacgtttgaagatatggcc atgtatctcacccgggaagaatggagacctctggacgctgcacagagggacctttaccgg gatgttatgcaggagaattatggaaatgttgtctcactagattttgagatcaggagtgag aacgaggtaaatcccaagcaagagattagtgaagatgtacaatttgggactacatctgaa agacctgctgagaatgctgaggaaaatcctgaaagtgaagagggctttgaaagcggagat aggtcagaaagacaatggggagatttaacagcagaagagtgggtaagctatcctctccaa ccagtcactgatctacttgtccacaaagaagtccacacaggcatccgctatcatatatgt tctcattgtggaaaggccttcagtcagatctcagaccttaatcgacatcagaagacccac actggagacagaccctataaatgttatgaatgtggaaaaggcttcagtcgcagctcacac cttattcagcatcaaagaacacatactggggagaggccttatgactgtaacgagtgtggg aaaagttttggaagaagttctcacctgattcagcatcagacaatccacactggagagaag cctcacaaatgtaatgagtgtggaaaaagtttctgccgtctctctcacctaatccaacac caaaggacccacagtggtgagaaaccctatgagtgtgaggagtgtgggaaaagcttcagc cggagctctcacctagctcagcaccagaggacccacacgggtgagaaaccttatgaatgt aacgaatgtggccgaggcttcagtgagagatctgatctcatcaaacactatcgagtccac acaggggagaggccctacaagtgtgatgagtgtgggaagaatttcagtcagaactccgac cttgtgcgtcatcgcagagcccacacgggagagaagccataccactgtaacgaatgtggg gaaaatttcagccgcatctcacacttggttcagcaccagagaactcacactggagagaag ccatatgaatgcaatgcttgtgggaaaagcttcagccggagctctcatctcatcacacac cagaaaattcacactggagagaagccttatgagtgtaatgagtgttggcgaagctttggt gaaaggtcagatctaattaaacatcagagaacccacacaggggagaagccctacgagtgt gtgcagtgtgggaaaggtttcacccagagctccaacctcatcacacatcaaagagttcac acgggagagaaaccttatgaatgtaccgaatgtgagaagagtttcagcaggagctcagct cttattaaacataagagagttcatacggactaa