GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:53:00 Sequence gi568815581f:48793418_48995766 : 202349 bp : 46.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 2729 2644 86 2 2 121 94 7 0.123 3.22 1.08 Intr - 3787 3644 144 1 0 125 93 109 0.557 15.58 1.07 Intr - 6343 6187 157 0 1 55 75 69 0.978 2.31 1.06 Intr - 7968 7838 131 0 2 93 61 70 0.952 4.29 1.05 Intr - 8245 8108 138 1 0 66 68 50 0.636 1.46 1.04 Intr - 10514 10485 30 2 0 130 76 20 0.834 3.33 1.03 Intr - 11574 11318 257 1 2 103 18 165 0.668 8.16 1.02 Intr - 18850 18745 106 1 1 112 99 72 0.631 10.49 1.01 Init - 23951 23865 87 2 0 55 62 97 0.331 2.45 1.00 Prom - 36226 36187 40 -3.76 2.00 Prom + 45306 45345 40 -6.36 2.01 Init + 48291 48470 180 2 0 92 89 151 0.915 14.79 2.02 Intr + 49985 50143 159 1 0 40 77 59 0.527 0.18 2.03 Intr + 54647 54749 103 1 1 118 100 45 0.994 8.45 2.04 Intr + 54905 55038 134 0 2 52 72 216 0.983 16.76 2.05 Intr + 55835 55960 126 1 0 46 80 171 0.997 12.98 2.06 Intr + 57672 57760 89 2 2 59 70 86 0.997 2.67 2.07 Intr + 58142 58211 70 2 1 75 96 69 0.926 5.58 2.08 Intr + 59089 59211 123 0 0 80 80 134 0.990 12.58 2.09 Intr + 61416 61490 75 2 0 65 75 85 0.836 4.61 2.10 Intr + 62675 62770 96 1 0 80 95 9 0.586 1.01 2.11 Intr + 66897 67032 136 2 1 49 99 65 0.883 3.84 2.12 Intr + 68859 68887 29 1 2 121 131 -8 0.994 4.63 2.13 Term + 69421 69588 168 0 0 114 48 115 0.982 7.98 2.14 PlyA + 70400 70405 6 1.05 3.00 Prom + 72121 72160 40 -5.16 3.01 Init + 78134 78146 13 1 1 110 95 8 0.824 3.40 3.02 Intr + 87703 87903 201 2 0 114 72 167 0.981 16.96 3.03 Intr + 88282 88348 67 2 1 92 68 44 0.393 0.76 3.04 Intr + 89929 90082 154 1 1 43 117 41 0.303 2.57 3.05 Intr + 99213 99380 168 2 0 -13 31 187 0.116 3.04 3.06 Intr + 99992 100039 48 1 0 103 95 -8 0.417 0.28 3.07 Intr + 101739 101917 179 2 2 100 111 230 0.983 25.22 3.08 Term + 102238 102352 115 1 1 116 54 138 0.812 11.24 3.09 PlyA + 106670 106675 6 1.05 4.00 Prom + 108394 108433 40 -5.46 4.01 Init + 115087 115403 317 0 2 108 89 288 0.999 25.71 4.02 Intr + 117391 117463 73 1 1 90 95 69 0.998 7.21 4.03 Intr + 119417 119604 188 1 2 71 90 180 0.999 14.99 4.04 Intr + 122659 122770 112 1 1 89 110 89 0.999 11.58 4.05 Intr + 127743 127855 113 2 2 38 92 53 0.969 -0.12 4.06 Intr + 129430 129520 91 1 1 97 90 138 0.952 14.90 4.07 Term + 133547 133717 171 1 0 116 41 119 0.958 7.83 4.08 PlyA + 135618 135623 6 1.05 5.15 PlyA - 136701 136696 6 1.05 5.14 Term - 137195 137058 138 2 0 152 42 69 0.999 6.66 5.13 Intr - 138300 138226 75 0 0 106 72 110 0.983 10.91 5.12 Intr - 139929 139788 142 2 1 47 69 222 0.692 16.56 5.11 Intr - 142825 142753 73 2 1 114 39 19 0.853 -1.94 5.10 Intr - 143707 143603 105 2 0 87 78 104 0.974 9.49 5.09 Intr - 147645 147507 139 0 1 105 58 239 0.892 22.64 5.08 Intr - 149140 149051 90 1 0 72 42 119 0.976 5.89 5.07 Intr - 150558 150508 51 0 0 123 111 33 0.996 8.20 5.06 Intr - 151421 151264 158 0 2 46 83 82 0.056 3.23 5.05 Intr - 160994 160972 23 1 2 94 90 27 0.074 0.69 5.04 Intr - 167570 167469 102 1 0 51 91 115 0.864 7.39 5.03 Intr - 168402 168310 93 2 0 92 105 79 0.998 9.08 5.02 Intr - 171063 170893 171 2 0 102 61 167 0.998 14.46 5.01 Init - 173815 173730 86 1 2 73 91 196 0.999 16.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 151317 151264 54 0 0 62 83 79 0.929 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:48793418_48995766|GENSCAN_predicted_peptide_1|379_aa MGWDCGLARWARVGLRERAAVQPLAPGCAAMSFAFPPFIPQGYKTAFGVGTNKIVTQDNR WELPGAWYFPRASSQAREMPQCPTLESQEGENSEEKGDSSKEDPKETVALAFVRENPGAQ NGLQNAQQQGKKKRKKKRLGLKAGEWGAMLMIGDQSIQLPAFLSSIVRRAAQQYGFREGG EDDDWTLYWTDYSVSLERVMEMKSYQKINHFPGMSEICRKDLLARNMSRMLKMFPKDFRF FPRTWCLPADWGDLQTYSRSRKNKTYICKPDSGCQGKGIFITRTVKEIKPGEDMICQLYI SKPFIIDGFKFDLRIYVLVTSCDPLRIFVYNEGLARFATTSYSRPCTDNLDDICMHLTNY SINKHSSNFSRDAHSGSKS >gi568815581f:48793418_48995766|GENSCAN_predicted_CDS_1|1137_bp atgggctgggactgcggcttggctcgctgggcaagggtagggctgcgggagcgagcggcg gtccagcccctggcgcccgggtgcgcggccatgtcttttgcctttcctccttttattcct caaggctacaagactgcttttggtgttggcaccaacaaaattgttacgcaagacaatagg tgggaactaccaggggcctggtatttccccagagcctcctcccaggccagggagatgcca cagtgcccgactttggaaagccaggaaggggaaaactccgaagagaagggggacagttcc aaagaagatccaaaagaaaccgtcgcgctggcttttgtgagagagaacccaggggcacaa aacggacttcagaatgcccagcagcaaggcaagaagaagaggaagaaaaagaggttagga ttgaaagctggggaatggggagccatgttgatgattggtgatcaatctatccagctgccg gcctttctttcttcaatagtgcgcagggctgcccaacagtacggctttagagagggaggg gaagacgatgactggactctctattggacagattactcagtgtcactggagcgggtgatg gaaatgaaaagttaccagaagatcaatcacttccccgggatgagtgaaatctgccggaag gacttgctggccaggaacatgagccgcatgttaaagatgttccctaaagatttccgcttt ttccctaggacctggtgtcttcctgctgactggggagatttgcagacctacagcaggtca agaaaaaataagacatacatttgtaagccggattcgggctgccaagggaaaggtatattc atcacccggacagtgaaagaaatcaaaccaggggaggatatgatctgtcagctgtatatt tcaaagccctttatcattgatgggtttaagtttgacctacggatttatgtactggtgaca tcctgtgaccctctcaggatttttgtgtacaatgaaggactggcccgctttgcgacgacc tcttactcccgcccttgcacagacaacctggatgatatctgcatgcacctgactaattat tccattaataagcacagttcaaatttcagtcgagatgcacactctggcagtaagagn >gi568815581f:48793418_48995766|GENSCAN_predicted_peptide_2|495_aa MEETIKDPPTSAVLLDHCHFSQVIFNSVEKFYIPGGDVTCHYTFTQHFIPRRKDWIGIFR YVMCLFASAIHPEIRHHSQLHCEGAVAYLYPSLNSFSIIVWNIKLIDEQRKLEVGWKTTR EYYTFMWVTLPIDLNNKSAKQQEVQFKAYYLPKDDEYYQFCYVDEDGVVRGASIPFQFRP ENEEDILVVTTQGEVEEIEQHNKELCKENQELKDSCISLQKQNSDMQAELQKKQEELETL QSINKKLELKVKEQKDYWETELLQLKEQNQKMSSENEKMGIRVDQLQAQLSTQEKEMEKL VQGDQDKTEQLEQLKKENDHLFLSLTEQEPRNKKAALTVEEQLAQEVERLKAKDENFDLS KRLSENEIICNALQRQKERLEGENDLLKRENSRLLSYMGLDFNSLPYQVPTSDEGGARQN PGLAYGNPYSGIQESSSPSPLSIKKCPICKADDICDHTLEQQQMQPLCFNCPICDKIFPA TEKQIFEDHVFCHSL >gi568815581f:48793418_48995766|GENSCAN_predicted_CDS_2|1488_bp atggaggagaccatcaaagatccccccacatcagctgtcttgctggatcactgtcatttc tctcaggtcatctttaacagtgtggagaagttctacatccctggaggggacgtcacatgt cattataccttcacccagcatttcatccctcgtcgaaaggattggattggcatctttaga tatgtcatgtgcctatttgccagtgctattcatcctgaaatcagacaccattctcaactg cactgtgaaggagctgttgcttatttatatccttcattgaattccttttccataattgtt tggaacatcaagcttattgatgagcaacggaagctggaggtggggtggaagacaacccgt gagtattacaccttcatgtgggttactttgcccattgacctaaacaacaaatcagctaaa cagcaggaagtccaattcaaagcttactacctgcccaaggatgatgagtattaccagttc tgctatgtggatgaggatggtgtggtccggggagcaagtattcctttccaattccgtcca gaaaatgaggaagacatcctggttgttaccactcagggagaggtggaagagattgagcag cacaacaaggagctttgcaaagaaaaccaggagctgaaggacagctgtatcagcctccag aagcagaactcagacatgcaggctgagctccaaaagaagcaggaggagctagaaacccta cagagcatcaataagaagttggaactgaaagtgaaagaacagaaggactattgggagaca gagctgcttcaactgaaagaacaaaaccagaagatgtcctcagaaaatgagaagatggga atcagagtggatcagcttcaggcccagctgtcaactcaagagaaagaaatggagaagctt gttcagggagatcaagataagacagagcagttagagcagctgaaaaaggaaaatgaccac ctctttctcagtttaactgaacaggagcctaggaacaagaaagcggcactaactgtggag gagcagttagcacaagaggtggaacgccttaaggcaaaggatgaaaactttgacctgtca aaaagactgagtgagaacgaaattatatgtaatgctctgcagagacagaaagagagattg gaaggagaaaatgatcttttgaagagggagaacagcagattgctcagttacatgggtctg gattttaattctttgccgtatcaagtacctacttcagatgaaggaggcgcaagacaaaat ccaggacttgcctatggaaacccatattctggtatccaagaaagttcttcccccagcccg ctctccatcaagaaatgccctatctgcaaagcagatgatatttgtgatcacaccttggag caacagcagatgcagcccctttgtttcaattgtccaatttgtgacaagatcttcccagct acagagaagcagatctttgaagaccacgtgttctgccactctctctga >gi568815581f:48793418_48995766|GENSCAN_predicted_peptide_3|314_aa MPGLDRDAGTQHRTWAHFIQIMKVEETAASKCNWPAVTQLHDSKVEFLLPHWVLCGQQQA TCHHQEARHLTGILSSPSTGCETLRKSLNDALAGLCSSEPSLRYMTLTNQSVARKRNGQH SLQALPLCELLTHRTSQRRNTGPKPESPASIRMEAGIATKLLWRERKQLRKANKSGESMT STNGDAGILRPMRMEKVQDTWTEKMQTAGALFISPALPSYSNFPLQVARREFQTSVVSRD IDTAAKFIGAGAATVGVAGSGAGIGTVFGSLIIGYARNPSLKQQLFSYAILGFALSEAMG LFCLMVAFLILFAM >gi568815581f:48793418_48995766|GENSCAN_predicted_CDS_3|945_bp atgcctggcctagaccgagacgcgggcacccagcaccgcacctgggcccacttcatccag atcatgaaggtggaggagactgcagccagcaagtgcaactggccagcagtcacacagctc catgactccaaggttgagttcctgctgcctcactgggtcctctgtggccagcaacaagcc acgtgtcaccaccaagaggcccgacatcttacaggaatcctgtcttcaccatcaactggc tgcgagaccttgagaaagtcacttaatgatgcactggctgggctatgctctagtgaaccc agcctaagatacatgacgcttactaaccagtcagtagcaaggaaacggaatggccagcac tcgcttcaggctctgcctctgtgtgagttgctgacacacagaacatcgcagaggaggaac accgggcccaaaccggaaagccccgcctctatccgcatggaggcgggaattgccacgaag ctcctgtggagggagaggaagcagctgcggaaagccaataagagtggggaatcgatgacg tcaaccaatggggacgcggggatattacggccaatgagaatggagaaggtccaggacacg tggactgaaaaaatgcagaccgccggggcattattcatttctccagctctgccttcctac agcaacttcccactccaggtggccagacgggagttccagaccagtgttgtctcccgggac attgacacagcagccaagtttattggtgctggggcagccacagttggtgtggctggttca ggggctggcattggaaccgtgtttggcagcttgatcattggctatgccaggaacccgtct ctcaagcagcagctcttctcctatgccattcttggctttgccctgtctgaggccatgggg cttttctgtttgatggtcgccttcctcatcctcttcgccatgtga >gi568815581f:48793418_48995766|GENSCAN_predicted_peptide_4|354_aa MAESPTEEAATAGAGAAGPGASSVAGVVGVSGSGGGFGPPFLPDVWAAAAAAGGAGGPGS GLAPLPGLPPSAAAHGAALLSHWDPTLSSDWDGERTAPQCLLRIKRDIMSIYKEPPPGMF VVPDTVDMTKIHALITGPFDTPYEGGFFLFVFRCPPDYPIHPPRVKLMTTGNNTVRFNPN FYRNGKVCLSILGTWTGPAWSPAQSISSVLISIQSLMTENPYHNEPGFEQERHPGDSKNY NECIRHETIRVAVCDMMEGKCPCPEPLRGVMEKSFLEYYDFYEVACKDRLHLQGQTMQDP FGEKRGHFDYQSLLMRLGLIRQKVLERLHNENAEMDSDSSSSGTETDLHGSLRV >gi568815581f:48793418_48995766|GENSCAN_predicted_CDS_4|1065_bp atggcggagagtccgactgaggaggcggcaacggcgggcgccggggcggcgggccccggg gcgagcagcgttgctggtgttgttggcgttagcggcagcggcggcgggttcgggccgcct ttcctgccggatgtgtgggcggcggcggcggcagcgggcggggccgggggcccggggagc ggcctggctccgctgcccgggctcccgccctcagccgctgcccacggggccgcgctgctt agccactgggaccccacgctcagctccgactgggacggcgagcgcaccgcgccgcagtgt ctactccggatcaagcgggatatcatgtccatttataaggagcctcctccaggaatgttc gttgtacctgatactgttgacatgactaagattcatgcattgatcacaggcccatttgac actccttatgaagggggtttcttcctgttcgtgtttcggtgtccgcccgactatcccatc cacccacctcgggtcaaactgatgacaacgggcaataacacagtgaggtttaaccccaac ttctaccgcaatgggaaagtctgcttgagtattctaggtacatggactggacctgcctgg agcccagcccagagcatctcctcagtgctcatctctatccagtccctgatgactgagaac ccctatcacaatgagcccggctttgaacaggagagacatccaggagacagcaaaaactat aatgaatgtatccggcacgagaccatcagagttgcagtctgtgacatgatggaaggaaag tgtccctgtcctgaacccctacgaggggtgatggagaagtcctttctggagtattacgac ttctacgaggtggcctgcaaagatcgcctgcaccttcaaggccaaactatgcaggaccct tttggagagaagcggggccactttgactaccagtccctcttgatgcgcctgggactgata cgtcagaaagtgctggagaggctccataatgagaatgcagaaatggactctgatagcagt tcatctgggacagagacagaccttcatgggagcctgagggtttag >gi568815581f:48793418_48995766|GENSCAN_predicted_peptide_5|481_aa MVATKTFALLLLSLFLAVGLGEKKEGHFSALPSLPVGSHAKVSSPQPRGPRYAEGTFISD YSIAMDKIHQQDFVNWLLAQKGKKNDWKHNITQREARALELASQANRKEEEAVEPQSSPA KNPSDEDLLRDLLIQELLACLLDQTNLCRLRLRNRRERDDARLGLPPWGAGGGVRDVETR GPGSRAARGPRVGMHRRGVGAGAIAKKKLAEAKYKERGTVLAEDQLAQPFPSLYVLRVYP VIVIGEEVEEAVKQENGAMSKQLDMFKTNLEEFASKHKQEIRKNPEFRVQFQDMCATIGV DPLASGKGFWSEMLGVGDFYYELGVQIIEVCLALKHRNGGLITLEELHQQVLKGRGKFAQ DVSQDDLIRAIKKLKALGTGFGIIPVGGTYLIQSVPAELNMDHTVVLQLAEKNGYVTVSE IKASLKWETERARQVLEHLLKEGLAWLDLQAPGEAHYWLPALFTDLYSQEITAEEAREAL P >gi568815581f:48793418_48995766|GENSCAN_predicted_CDS_5|1446_bp atggtggccacgaagacctttgctctgctgctgctgtccctgttcctggcagtgggacta ggagagaagaaagagggtcacttcagcgctctcccctccctgcctgttggatctcatgct aaggtgagcagccctcaacctcgaggccccaggtacgcggaagggactttcatcagtgac tacagtattgccatggacaagattcaccaacaagactttgtgaactggctgctggcccaa aaggggaagaagaatgactggaaacacaacatcacccagagggaggctcgggcgctggag ctggccagtcaagctaataggaaggaggaggaggcagtggagccacagagctccccagcc aagaaccccagcgatgaagatttgctgcgggacttgctgattcaagagctgttggcctgc ttgctggatcagacaaacctctgcaggctcaggctgaggaaccgtcgtgaaagagatgac gcgcggctcgggcttccgccttggggagccggcggcggagtccgggacgtggagacccgg ggtcccggcagccgggcggcccgcgggcccagggtggggatgcaccgccgcggggtggga gctggcgccatcgccaagaagaaacttgcagaggccaagtataaggagcgagggacggtc ttggctgaggaccagctagcccagcccttcccttctctgtacgtgctccgagtttaccca gtgattgtgattggggaagaagtggaggaagccgttaagcaggaaaatggggctatgtca aagcagttggacatgttcaagaccaacctggaggaatttgccagcaaacacaagcaggag atccggaagaatcctgagttccgtgtgcagttccaggacatgtgtgcaaccattggcgtg gatccgctggcctctggaaaaggattttggtctgagatgctgggcgtgggggacttctat tacgaactaggtgtccaaattatcgaagtgtgcctggcgctgaagcatcggaatggaggt ctgataactttggaggaactacatcaacaggtgttgaagggaaggggcaagttcgcccag gatgtcagtcaagatgacctgatcagagccatcaagaaactaaaggcacttggcactggc ttcggcatcatccctgtgggcggcacttacctcattcagtctgttccagctgagctcaat atggatcacaccgtggtgctgcagctggcagagaagaatggctacgtgactgtcagtgag atcaaagccagtcttaaatgggagaccgagcgagcgcggcaagtgctggaacacctgctg aaggaagggttggcgtggctggacttacaggccccaggggaggcccactactggctgcca gctctcttcactgacctctactcccaggagattacagctgaggaggccagagaagccctc ccctga