GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:07:54 Sequence gi568815597r:23261972_23467171 : 205200 bp : 44.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 2696 2691 6 1.05 1.11 Term - 49095 48483 613 2 1 95 42 461 0.752 36.16 1.10 Intr - 49351 49230 122 1 2 70 66 95 0.999 4.79 1.09 Intr - 51731 51582 150 2 0 92 97 82 0.999 9.86 1.08 Intr - 56717 56512 206 0 2 53 94 166 0.999 12.62 1.07 Intr - 59692 59557 136 1 1 95 98 73 0.998 9.14 1.06 Intr - 61761 61585 177 0 0 24 84 96 0.747 2.92 1.05 Intr - 71660 71547 114 2 0 32 107 69 0.640 3.84 1.04 Intr - 75890 75783 108 2 0 83 93 45 0.924 4.98 1.03 Intr - 76637 76519 119 0 2 82 110 104 0.999 12.18 1.02 Intr - 79049 78881 169 2 1 55 75 109 0.037 5.82 1.01 Init - 82875 82813 63 0 0 80 37 99 0.066 3.16 1.00 Prom - 88472 88433 40 -2.66 2.40 PlyA - 91641 91636 6 1.05 2.39 Term - 101250 99998 1253 1 2 76 43 1045 0.995 90.55 2.38 Intr - 105197 105071 127 2 1 -14 97 101 0.825 1.05 2.37 Intr - 106094 106002 93 2 0 109 115 26 0.581 7.56 2.36 Intr - 119560 119472 89 2 2 112 100 59 0.716 9.19 2.35 Intr - 122446 122371 76 1 1 119 31 70 0.916 3.49 2.34 Intr - 125448 125302 147 0 0 95 110 245 0.988 27.83 2.33 Intr - 132050 131908 143 0 2 30 78 295 0.944 22.77 2.32 Intr - 133074 132877 198 1 0 81 100 30 0.696 2.82 2.31 Intr - 135630 135514 117 1 0 88 51 103 0.811 7.04 2.30 Intr - 135984 135821 164 2 2 91 93 154 0.997 15.82 2.29 Intr - 146755 146693 63 0 0 78 98 33 0.220 1.13 2.28 Intr - 155419 155278 142 2 1 86 66 148 0.917 11.81 2.27 Intr - 156038 155933 106 2 1 77 66 161 0.970 12.59 2.26 Intr - 157168 157106 63 1 0 108 75 76 0.960 7.21 2.25 Intr - 160863 160660 204 0 0 96 43 96 0.937 5.30 2.24 Intr - 162713 162594 120 2 0 42 89 162 0.354 12.39 2.23 Intr - 169154 169064 91 1 1 92 74 10 0.321 -0.00 2.22 Intr - 169947 169725 223 1 1 72 77 95 0.406 3.99 2.21 Intr - 171301 171106 196 1 1 85 77 93 0.942 6.89 2.20 Intr - 171722 171454 269 0 2 100 98 166 0.528 16.05 2.19 Intr - 172398 172283 116 2 2 106 75 180 0.999 18.59 2.18 Intr - 172647 172562 86 0 2 35 80 110 0.999 3.52 2.17 Intr - 174057 173880 178 2 1 49 94 192 0.978 15.82 2.16 Intr - 174683 174589 95 2 2 65 75 65 0.999 1.66 2.15 Intr - 175073 174940 134 0 2 103 73 164 0.999 16.76 2.14 Intr - 175349 175159 191 1 2 109 67 252 0.999 24.43 2.13 Intr - 175626 175453 174 2 0 86 69 191 0.779 16.05 2.12 Intr - 176863 176769 95 1 2 70 -23 163 0.528 2.16 2.11 Intr - 177259 177190 70 0 1 65 89 28 0.831 -0.22 2.10 Intr - 179246 179131 116 2 2 15 114 97 0.693 4.25 2.09 Intr - 179502 179416 87 0 0 78 60 107 0.991 7.07 2.08 Intr - 179759 179684 76 1 1 39 92 99 0.990 4.92 2.07 Intr - 180300 180215 86 0 2 79 58 77 0.989 2.42 2.06 Intr - 180641 180530 112 1 1 70 94 97 0.992 8.88 2.05 Intr - 189557 189508 50 2 2 107 101 45 0.982 5.18 2.04 Intr - 190800 190726 75 0 0 73 109 33 0.888 3.61 2.03 Intr - 194055 193910 146 1 2 105 81 205 0.955 21.50 2.02 Intr - 194223 194151 73 0 1 77 111 67 0.996 6.88 2.01 Init - 198815 198783 33 2 0 49 91 54 0.311 1.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 79037 78881 157 2 1 55 75 117 0.924 7.17 S.002 Init + 82758 82888 131 2 2 110 49 140 0.855 9.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:23261972_23467171|GENSCAN_predicted_peptide_1|658_aa MTERGPNAWLWACLGGTPAAGQHNKMANQVNGNAVQLKEEEEPMDTSSVTHTEHYKTLIE AGLPQKVAERLDEIFQTGLVAYVDLDERAIDALREFNEEGALSVLQQFKESDLSHVQNKS AFLCGVMKTYRQREKQGSKVQESTKGPDEAKIKALLERTGYTLDVTTGQRKYGGPPPDSV YSGVQPGIGTEVFVGKIPRDLYEDELVPLFEKAGPIWDLRLMMDPLSGQNRGYAFITFCG KEAAQEAVKLCDSYEIRPGKHLGVCISVANNRLFVGSIPKNKTKENILEEFSKVTEGLVD VILYHQPDDKKKNRGFCFLEYEDHKSAAQARRRLMSGKVKVWGNVVTVEWADPVEEPDPE VMAKVKVLFVRNLATTVTEEILEKSFSEFGKLERVKKLKDYAFVHFEDRGAAVKAMDEMN GKEIEGEEIEIVLAKPPDKKRKERQAARQASRSTAYEDYYYHPPPRMPPPIRGRGRGGGR GGYGYPPDYYGYEDYYDDYYGYDYHDYRGGYEDPYYGYDDGYAVRGRGGGRGGRGAPPPP RGRGAPPPRGRAGYSQRGAPLGPPRGSRGGRGGPAQQQRGRGSRGSRGNRGGNVGGKRKA DGYNQPDSKRRQTNNQQNWGSQPIAQQPLQQGGDYSGNYGYNNDNQEFYQDTYGQQWK >gi568815597r:23261972_23467171|GENSCAN_predicted_CDS_1|1977_bp atgaccgaaagaggcccgaacgcgtggctatgggcgtgtctggggggaacgccggccgcg gggcagcataataaaatggctaatcaggtgaatggtaatgcggtacagttaaaagaagag gaagaaccaatggatacttccagtgtaactcacacagaacactacaagacactgatagag gcaggcctcccacagaaggtggcagaaagacttgatgaaatatttcagacaggattggta gcttatgtcgatcttgatgaaagagcaattgatgctctcagggaatttaatgaagaagga gctctgtctgtactacagcagttcaaggaaagtgacttatcacatgttcagaacaaaagt gcatttttatgtggagttatgaagacctacaggcagagagagaaacaggggagcaaggtg caagagtccacaaagggacctgatgaagcgaagatcaaggccttgcttgagagaactggt tatactctggatgtaaccacaggacagaggaagtatggtggtcctccaccagacagtgtg tactctggcgtgcaacctggaattggaacggaggtatttgtaggcaaaataccaagggat ttatatgaggatgagttggtgcccctttttgagaaggccggacccatttgggatctacgt cttatgatggatccactgtccggtcagaatagagggtatgcatttatcaccttctgtgga aaggaagctgcacaggaagccgtgaaactgtgtgacagctatgaaattcgccctggtaaa caccttggagtgtgcatttctgtggcaaacaacagactttttgttggatccattccgaag aataagactaaagaaaacattttggaagaattcagtaaagtcacagagggtttggtggac gttattctctatcatcaacccgatgacaaaaagaagaatcgggggttctgcttccttgaa tatgaggatcacaagtcagcagcacaagccagacgccggctgatgagtggaaaagtaaaa gtgtggggaaatgtagttacagttgaatgggctgaccctgtggaagaaccagatccagaa gtcatggctaaggtaaaagttttgtttgtgagaaacttggctactacggtgacagaagaa atattggaaaagtcattttctgaatttggaaaactcgaaagagtaaagaagttgaaagat tatgcatttgttcattttgaagacagaggagcagctgttaaggctatggatgaaatgaat ggcaaagaaatagaaggggaagaaattgaaatagtcttagccaagccaccagacaagaaa aggaaagagcgccaagctgctagacaggcctccagaagcactgcgtatgaagattattac taccaccctcctcctcgcatgccacctccaattagaggtcggggtcgtggtggggggaga ggtggatatggctaccctccagattactacggctatgaagattactatgatgattactat ggttatgattatcacgactatcgtggaggctatgaagatccctactacggctatgatgat ggctatgcagtaagaggaagaggaggaggaaggggagggcgaggtgctccaccaccacca agggggaggggagcaccacctccaagaggtagagctggctattcacagaggggggcacct ttgggaccaccaagaggctctaggggtggcagagggggtcctgctcaacagcagagaggc cgtggttcccgtggatctcggggcaatcgtgggggcaatgtaggaggcaagagaaaggca gatgggtacaaccagcctgattccaagcgtcgtcagaccaacaaccaacagaactggggt tcccaacccatcgctcagcagccgcttcagcaaggtggtgactattctggtaactatggt tacaataatgacaaccaggaattttatcaggatacttatgggcaacagtggaagtag >gi568815597r:23261972_23467171|GENSCAN_predicted_peptide_2|1958_aa MNKADVNIRVQILEGDQAILQRIKKAVRAIHSSGLGHVENEEQYREAVESLGNSHLSQNS HELSTGFLNLAVFTREVAALFKNLIQNLNNIVSFPLDSLMKGQLRDGRQDSKKQLEKAWK DYEAKMAKLEKERDRARVTGGIPGEVAQDMQRERRIFQLHMCEYLLKAGESQMKQGPDFL QSLIKFFHAQHNFFQDGWKAAQSLFPFIEKLAASVHALHQAQEDELQKLTQLRDSLRGTL QLESREGQEHLSRKNSGCGYSIHQHQGNKQFGTEKVGFLYKKSDGIRRVWQKRKCGVKYG CLTISHSTINRPPVKLTLLTCQVRPNPEEKKCFDLVTRECLRVRLVRDTSPCGPSGFGGG SLGSGKASSISVGQKPPSLPTDNRTYHFQAEDEHECEAWVSVLQNSKDEALSSAFLGEPS AGPGSWGSAGHDGEPHDLTKLLIAEVKSRPGNSQCCDCGAADPTWLSTNLGVLTCIQCSG VHRELGVRFSRMQSLTLDLLGPSELLLALNMGNTSFNEVMEAQLPSHGGPKPSAESDMGT RRDYIMAKYVEHRFARRCTPEPQRLWTAICNRDLLSVLEAFANGQDFGQPLPGPDAQAPE ELVLHLAVKVANQASLPLVDFIIQNGGHLDAKAADGNTALHYAALYNQPDCLKLLLKGRA LVGTVNEAGETALDIARKKHHKECEELVRTRGGKGPLTLSLSLLSPQPSEPPSLCFWQLE QAQAGTFAFPLHVDYSWVISTEPGSDSEEDEEEKRCLLKLPAQAHWASGRLDISNKTYET VASLGAATPQGESEDCPPPLPVKNSSRTLVQGCARHASGDRSEVSSLSSEAPETPESLGS PASSSSLMSPLEPGDPSQAPPNSEEGLREPPGTSRPSLTSGTTPSEMYLPVRFSSESTRS YRRGARSPEDGPSARQPLPRRNVPALLLPLCPSPRRASRANMGQEEELLRIAKKLEKMVA RKNTAKATQPSLSGPRVSSSVKWEEEMVSFGDGFLWKCPELSPEKMQVWPQGPCSMEPVY SHVCAWALGSVWEGALDLLKKLHSCQMSIQLLQTTRIGVAVNGVRKHCSDKEVVSLAKVL IKNWKRLLDSPGPPKGEKGEEREKAKKKEKGLECSDWKPEAGLSPPRKKREDPKTRRDSV DSKSSASSSPKRPSVERSNSSKSKAESPKTPSSPLTPTFASSMCLLAPCYLTGDSVRDKC VEMLSAALKADDDYKDYGVNCDKMASEIEDHILELCRGCGCLHRLAAPIQVSLLYMALPS NSFVFLIFFGSGDQENEAVFPELLQRVEGVGGSEACLGCSVWTETLWLVARDDFLPELKS TDMKYRNRVRSRISNLKDPRNPGLRRNVLSGAISAGLIAKMTAEEMASDELRELRNAMTQ EAIREHQMAKTGGTTTDLFQCSKCKKKNCTYNQVQTRSADEPMTTFVLCNECGNRWKVSS LEDAEKPFKFCCLSGFVVLLMEQPAMNKRCLLAAFPALSPWRRTLVLEMAATLLMAGSQA PVTFEDMAMYLTREEWRPLDAAQRDLYRDVMQENYGNVVSLDFEIRSENEVNPKQEISED VQFGTTSERPAENAEENPESEEGFESGDRSERQWGDLTAEEWVSYPLQPVTDLLVHKEVH TGIRYHICSHCGKAFSQISDLNRHQKTHTGDRPYKCYECGKGFSRSSHLIQHQRTHTGER PYDCNECGKSFGRSSHLIQHQTIHTGEKPHKCNECGKSFCRLSHLIQHQRTHSGEKPYEC EECGKSFSRSSHLAQHQRTHTGEKPYECNECGRGFSERSDLIKHYRVHTGERPYKCDECG KNFSQNSDLVRHRRAHTGEKPYHCNECGENFSRISHLVQHQRTHTGEKPYECNACGKSFS RSSHLITHQKIHTGEKPYECNECWRSFGERSDLIKHQRTHTGEKPYECVQCGKGFTQSSN LITHQRVHTGEKPYECTECEKSFSRSSALIKHKRVHTD >gi568815597r:23261972_23467171|GENSCAN_predicted_CDS_2|5877_bp atgaataaagctgatgtaaacatccgtgtgcagatcttggaaggagaccaagccatcctg cagagaataaagaaggctgtgcgggcaatccatagctccggccttggccatgtggagaat gaagagcagtaccgagaggccgtggaatccttaggcaacagccacctgtcccagaacagc catgagctgtccacaggcttcctaaacttggccgtgttcacccgcgaggttgctgcgctc ttcaagaacctgattcagaacttgaacaacattgtctctttccccctggacagtctgatg aaggggcagctgagggacggtcgacaggattccaaaaaacagctggagaaggcatggaag gactatgaagccaaaatggccaagctggagaaggagcgcgatcgggccagggtgacagga gggatccctggggaggtggcccaggacatgcagagagagcggcgcatcttccagctgcac atgtgtgagtatctgctcaaagccggggagagccagatgaagcaaggtcctgacttcctt cagagcctcatcaagttcttccacgcccagcacaactttttccaagatggctggaaggct gcccagagcctgttccccttcatcgagaagctggcggcctcagtacatgcactccatcag gcccaggaggacgagctacagaagctgacccagctccgggactccctccgagggacactg cagcttgagagcagagagggacaggaacacctgagccggaagaactcaggatgtggctat agcatccaccagcaccaaggcaacaagcagtttgggacggagaaagtgggctttctatac aagaaaagtgacggaattcgaagagtctggcagaaaaggaagtgtggagtcaagtatggc tgcctgaccatctcacacagcacgataaaccggcccccggtgaagctgaccctgctgacg tgccaagtgaggccaaaccctgaggagaaaaagtgcttcgacctggtgacccgtgagtgc ttgagagtccgacttgtcagggacacatctccatgtggccccagcggctttggtgggggc tccctgggctccgggaaggccagcagcatttctgtgggtcagaagcccccttctctcccg acagacaaccggacgtaccactttcaggcagaggacgagcacgagtgtgaggcgtgggtg tcagtgttgcagaacagcaaggacgaagccctgagcagcgccttcctcggggagcccagc gctggcccggggtcctgggggtccgccggccatgatggggagccgcacgacctcacaaag ctgctcatcgcggaggtgaagagcaggcctgggaatagccagtgctgcgactgcggggct gcagaccccacgtggctcagcaccaacctgggcgtgctcacctgcatccagtgctcgggc gtccaccgcgaactgggcgtgcgcttttcgcgcatgcagtcactcaccttggacctgctg ggcccctccgagttgttgctggccttgaacatgggaaacacgagcttcaatgaggtcatg gaggcccagctaccctcacacggcggccctaaaccctcagctgagagtgacatgggcacc cgcagggactacattatggccaagtatgtggagcataggtttgcacgccggtgcacacct gagcctcagcgactctggacagccatttgcaacagggacctcctgtcggtactggaggcc tttgccaatgggcaggactttggacagccgctgccagggcctgatgcacaggcacctgaa gaactcgtcttgcatttggctgtcaaagtcgccaaccaggcttccctgcctctggtggat ttcatcatccagaacggtggtcacctggatgccaaggctgctgacgggaacacggctctg cactacgcagcactctacaaccagcccgactgcctcaagctgctgctgaaggggagagct ttggttggcacagtaaatgaagcaggcgagacagctctggacatagccaggaagaagcac cacaaggagtgtgaggagctggtgaggactcggggaggcaaggggcccctgactctttct ctctccctgctgagcccccaaccctctgaaccaccaagcctttgtttttggcagctggag caggcccaggcggggacctttgccttccctctacatgtggactactcctgggtaatttcc acagagcctggctctgacagtgaggaggatgaggaagagaagcgctgcttgctgaagctc ccggcccaggctcactgggccagtgggaggctggacatcagcaacaagacctatgagact gtcgccagcctgggagcagccacccctcagggcgagagtgaggactgtcccccgcccttg ccagtcaaaaactcttctcggactttggtccaagggtgtgcaagacatgccagtggagat cgttctgaagtctccagcctgagttcagaggcccctgagacccctgagagcctgggcagt ccagcctcctcctccagtctgatgagccccttggaacctggggatcccagccaagcccca cccaactctgaagagggcctccgagagcccccaggcacctccagacccagcctgacatcc gggaccaccccttcggagatgtacctccccgtcagattcagctccgagagcactcgctcc tatcggcggggggcgcggagccctgaagatggtccctcagccaggcagcctctgcccaga aggaacgtgccggccctactgctgcccctgtgcccctcgccccgccgggcgtcgcgggcc aacatgggccaggaagaggagctgctgaggatcgccaaaaagctggagaagatggtggcc aggaagaacacggcaaaagccactcaaccttccctttctggacctcgagtttcctcttct gtgaaatgggaagaggaaatggtgtcttttggagatggattcctgtggaagtgcccagag ctgtcccctgaaaagatgcaagtctggccgcagggtccctgcagcatggagccagtttac agccacgtctgcgcctgggctctcggcagtgtctgggaaggggccctggaccttctgaag aagctgcacagctgccagatgtccatccagctactacagacaaccaggattggagttgct gttaatggggtccgcaagcactgctcagacaaggaggtggtgtccttggccaaagtcctt atcaaaaactggaagcggctgctagactcccctggacccccaaaaggagaaaaaggagag gaaagagaaaaggcaaagaagaaggaaaaagggcttgagtgttcagactggaagccagaa gcaggcctttctccaccaaggaaaaaacgagaagaccccaaaaccaggagagactctgtg gactccaagtcttctgcctcctcctctccaaaaagaccatcggtggaaagatcaaacagc agcaaatcaaaagcggagagccccaaaacacctagcagccccttgacccccacgtttgcc tcttccatgtgtctcctggccccctgctatctcacaggggactctgtccgggacaagtgt gtggagatgctgtcagcagccctgaaggcggacgatgattacaaggactatggagtcaac tgtgacaagatggcatcagaaatcgaagatcatatccttgaactgtgccggggctgtggg tgtctgcaccgtctagcagcacccatccaagtctctttattgtatatggctcttcccagc aacagctttgtcttcctgatcttctttggctcaggagaccaagaaaatgaggctgtgttc cccgagctcctccagagggtggagggagtgggaggctcggaagcctgcctaggatgcagt gtttggacagaaactctgtggttggtggccagggatgacttcctgccggagctcaagagc acggacatgaagtaccggaaccgcgtgcgcagccgcataagcaacctcaaggaccccagg aaccccggcctgcggcggaacgtgctcagtggggccatctccgcagggcttatagccaag atgacggcagaggaaatggccagtgatgaactgagggagttgaggaatgccatgacccag gaggccatccgtgagcaccagatggccaagactggcggcaccaccactgacctcttccag tgcagcaaatgcaagaagaagaactgcacctataaccaggtgcagacacgcagtgctgat gagcccatgactacctttgtcttatgcaatgaatgtggcaatcgctggaaggtctcctcc ttggaggatgccgagaaacctttcaaattttgttgcctttctggttttgtagttctgctg atggaacagccagccatgaacaagcgctgtctgctagctgcttttcctgctctctctccc tggaggcgaacccttgtgctcgagatggcagccaccctgctcatggctgggtcccaggca cctgtgacgtttgaagatatggccatgtatctcacccgggaagaatggagacctctggac gctgcacagagggacctttaccgggatgttatgcaggagaattatggaaatgttgtctca ctagattttgagatcaggagtgagaacgaggtaaatcccaagcaagagattagtgaagat gtacaatttgggactacatctgaaagacctgctgagaatgctgaggaaaatcctgaaagt gaagagggctttgaaagcggagataggtcagaaagacaatggggagatttaacagcagaa gagtgggtaagctatcctctccaaccagtcactgatctacttgtccacaaagaagtccac acaggcatccgctatcatatatgttctcattgtggaaaggccttcagtcagatctcagac cttaatcgacatcagaagacccacactggagacagaccctataaatgttatgaatgtgga aaaggcttcagtcgcagctcacaccttattcagcatcaaagaacacatactggggagagg ccttatgactgtaacgagtgtgggaaaagttttggaagaagttctcacctgattcagcat cagacaatccacactggagagaagcctcacaaatgtaatgagtgtggaaaaagtttctgc cgtctctctcacctaatccaacaccaaaggacccacagtggtgagaaaccctatgagtgt gaggagtgtgggaaaagcttcagccggagctctcacctagctcagcaccagaggacccac acgggtgagaaaccttatgaatgtaacgaatgtggccgaggcttcagtgagagatctgat ctcatcaaacactatcgagtccacacaggggagaggccctacaagtgtgatgagtgtggg aagaatttcagtcagaactccgaccttgtgcgtcatcgcagagcccacacgggagagaag ccataccactgtaacgaatgtggggaaaatttcagccgcatctcacacttggttcagcac cagagaactcacactggagagaagccatatgaatgcaatgcttgtgggaaaagcttcagc cggagctctcatctcatcacacaccagaaaattcacactggagagaagccttatgagtgt aatgagtgttggcgaagctttggtgaaaggtcagatctaattaaacatcagagaacccac acaggggagaagccctacgagtgtgtgcagtgtgggaaaggtttcacccagagctccaac ctcatcacacatcaaagagttcacacgggagagaaaccttatgaatgtaccgaatgtgag aagagtttcagcaggagctcagctcttattaaacataagagagttcatacggactaa