GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:34:26 Sequence gi568815591f:30811910_31023626 : 211717 bp : 50.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16770 16828 59 0 2 95 86 62 0.192 5.23 1.02 Intr + 24749 24855 107 0 2 61 115 81 0.946 8.03 1.03 Intr + 27560 27654 95 1 2 59 68 86 0.780 2.46 1.04 Intr + 28226 28325 100 2 1 57 76 40 0.006 -0.19 1.05 Intr + 28851 28939 89 2 2 94 89 66 0.006 6.07 1.06 Intr + 38545 38646 102 1 0 124 93 49 0.015 8.29 1.07 Intr + 39774 40006 233 0 2 71 72 51 0.010 -0.78 1.08 Intr + 43406 43610 205 0 1 -11 81 200 0.005 7.66 1.09 Intr + 45343 45427 85 1 1 110 23 16 0.005 -2.88 1.10 Intr + 47348 47415 68 1 2 82 68 97 0.013 4.80 1.11 Intr + 49516 49538 23 1 2 91 105 -2 0.007 -1.01 1.12 Intr + 51716 51837 122 0 2 66 68 76 0.014 3.61 1.13 Intr + 53300 53325 26 1 2 123 106 -43 0.121 -2.18 1.14 Intr + 60839 60917 79 2 1 94 61 60 0.460 3.45 1.15 Intr + 63586 63747 162 0 0 129 80 170 0.984 20.47 1.16 Intr + 70272 70452 181 2 1 80 46 210 0.834 15.44 1.17 Intr + 71012 71084 73 0 1 115 89 58 0.997 7.06 1.18 Intr + 73926 74046 121 0 1 106 92 31 0.853 5.80 1.19 Intr + 76163 76231 69 1 0 78 91 74 0.961 6.08 1.20 Intr + 77572 77826 255 0 0 11 83 111 0.283 0.54 1.21 Intr + 80932 80988 57 0 0 32 60 109 0.097 1.58 1.22 Intr + 99923 100384 462 1 0 59 86 835 0.023 73.75 1.23 Intr + 105619 105639 21 0 0 133 57 15 0.535 0.64 1.24 Intr + 110157 110321 165 2 0 131 68 190 0.864 21.46 1.25 Intr + 110655 110735 81 2 0 139 49 39 0.652 4.93 1.26 Term + 111541 111720 180 0 0 87 43 371 0.877 30.11 1.27 PlyA + 113582 113587 6 1.05 2.04 PlyA - 116051 116046 6 1.05 2.03 Term - 120930 120774 157 2 1 96 37 71 0.611 0.21 2.02 Intr - 122468 122381 88 0 1 110 99 26 0.612 4.93 2.01 Init - 130258 130171 88 1 1 64 41 57 0.221 -0.60 2.00 Prom - 130421 130382 40 -3.26 3.00 Prom + 139703 139742 40 -4.26 3.01 Init + 152160 152216 57 2 0 99 84 100 0.982 10.24 3.02 Intr + 156922 157027 106 0 1 71 47 133 0.418 7.29 3.03 Intr + 157154 157261 108 0 0 85 94 13 0.600 1.86 3.04 Intr + 157958 158055 98 0 2 15 94 67 0.791 -0.27 3.05 Intr + 159210 159307 98 2 2 115 72 178 0.891 17.71 3.06 Intr + 160054 160216 163 1 1 73 75 161 0.991 13.28 3.07 Intr + 162076 162229 154 0 1 58 72 122 0.930 7.25 3.08 Intr + 162520 162580 61 2 1 101 77 93 0.997 7.29 3.09 Intr + 163062 163131 70 0 1 129 44 118 0.974 10.68 3.10 Intr + 163868 163995 128 1 2 115 55 64 0.992 5.28 3.11 Intr + 164520 164649 130 0 1 77 94 128 0.999 13.00 3.12 Intr + 165372 165413 42 2 0 72 85 66 0.860 3.14 3.13 Term + 171660 171752 93 2 0 98 53 110 0.433 6.33 3.14 PlyA + 173285 173290 6 1.05 4.06 PlyA - 173376 173371 6 1.05 4.05 Term - 175811 175650 162 2 0 119 46 50 0.100 1.84 4.04 Intr - 184307 184093 215 0 2 28 89 119 0.134 4.43 4.03 Intr - 185968 185889 80 0 2 108 8 17 0.032 -5.11 4.02 Intr - 191958 191825 134 0 2 58 70 71 0.324 1.74 4.01 Init - 192136 191987 150 1 0 38 101 120 0.744 8.34 4.00 Prom - 197888 197849 40 -7.06 5.03 PlyA - 198348 198343 6 1.05 5.02 Term - 204695 204564 132 2 0 65 49 107 0.836 2.59 5.01 Init - 207980 207822 159 2 0 71 68 110 0.827 7.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 29661 29728 68 1 2 131 42 53 0.961 3.10 S.002 Intr - 42908 42661 248 2 2 147 100 66 0.937 9.96 S.003 Intr - 47517 47383 135 0 0 54 84 73 0.878 4.26 S.004 Init + 100001 100384 384 1 0 111 86 808 0.960 79.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:30811910_31023626|GENSCAN_predicted_peptide_1|1073_aa XKNQLLPSDKVDGELGALRLEDVEDELIREEVILSPVPSVLKLQTASKPIDLSVAKALQM WLVQQELGNKTLCFGDVKIEHGFKEHLLPTSLCQCVPGVRSCAKPRDAYRDELLKKPLHH RGGPCGVLAAVQGCVLQKLLFEGDSKADCAQGLQPSDAHRTRCLVLALADIVWRAGGRER AVVALLGNRSPEREGGWSKATQPLKEHGAPNPAPIPGRLGLHTPGDCTPRGSLGEDGGGV PPSVKTKGKSAFRSLETHLSVVVLGSLKEQKGDLLLIRSVVRVQQSQERRAEPDGIRVWD LARVARNGITWSHSQLKQNVEPPEPGKGSDWFLFRHRFLQKPFLDTPSPNLKSVLKALPF EVGPYGCILLTLSAILSRSTELGGSKAEGDQSQGEDIMDRCKPSRVLPTTACLATSLSEA PRRVEKQTFEVTCEAGWVGGALNGPHRGKEDPEHSCLSSVLSVTQELVNLLLTGKAVSNV FNDVVELDSGDGNITLLRGIAARSDIGFLSLFEHYNMCQVGCFLKTPRFPIWVVCSESHF SILFSLQPGLLRDWRTERLFDLYYYDGLANQQEQIRLTIDTTQTISEDTDNDLVPPLELC IRTKSVFHVCENFADTSCLPVPLPPSIPHHGGSTLAVVPRKAEWALSGTFFPMKNTGSIN VEKQLRQYQQWALTAEEKSDFTKPSQFVSKYGTSVSYLSSEDERSDMDKFHQSGSTPKPN RLLWICLTHRSPVNSLKHLDSIEALNSSTSNGFQKEDENMQKCGGLLMKKGAQPRLWLSS QRELSTRQRSQAKPPASMASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQ TAVQDNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYIIAQCV GAIVATAILSGITSSLTGNSLGRNDLVVAAMWLADGVNSGQGLGIEIIGTLQLVLCVLAT TDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGP FIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK >gi568815591f:30811910_31023626|GENSCAN_predicted_CDS_1|3222_bp nngaaaaatcagttgctgccgtctgacaaggtggatggtgagctgggtgccctgcggctc gaggatgtggaggatgagttgataagggaagaggtcatcctgtcgccagtcccatcagtg ctcaagttgcagacagcatcaaaaccaattgacctctcagtagcaaaggcacttcagatg tggctggtgcagcaagagctggggaataagaccctctgctttggagatgttaaaattgaa catggatttaaggagcacttgcttcccactagcctttgccaatgtgtcccaggagtcaga tcctgtgctaagccccgtgatgcctacagagatgagctcctcaagaagcctctgcaccac aggggtggtccttgcggagtcctggcagctgtccaaggctgtgtcctacagaaactcctg tttgaaggagatagcaaagccgactgtgctcagggactgcagccttcagatgcccaccgg acccgctgcctcgtcctggccctcgcagacattgtgtggcgggcagggggccgagagaga gccgttgttgcactattaggaaacagaagcccagagagggagggtggctggtccaaggct acacagccactaaaggagcacggggccccaaacccggctccgattcctggacgtcttggg ctacacacccctggggactgcacacccagggggagcctcggagaggatggaggtggagtt cctccatcagtgaaaacaaagggcaagtcagcattccgcagtctggaaacacatttgtca gttgtggtactaggcagtctgaaggagcagaaaggagacctgctgttgatccggagtgtt gtgagagtgcagcagtcacaggagaggcgagcagagcctgatggcatccgtgtgtgggat ctggccagggtggccaggaatgggatcacctggtcacactcccagctgaagcagaacgtg gagccaccagagccaggaaaaggaagtgactggtttctgttcagacatcgctttctccaa aaacccttccttgatactccctccccgaacttgaagtctgtgttgaaggcccttcctttt gaagtgggcccctatggctgcatcctgctcaccctttctgccatcctgtccaggtctaca gagctaggaggaagcaaggctgagggagaccagagccagggtgaggacatcatggacaga tgtaaacctagccgagtgctgccgaccactgcctgcctggccacttcattgtcagaggcc ccaagaagagtcgagaaacagacctttgaggtcacatgtgaggcagggtgggttggaggg gccctcaatggacctcacaggggcaaagaagaccctgagcacagttgcttgtcatcagtc ctctcagtcacccaggaacttgtcaatctgctcctgactgggaaagctgtgtccaacgtt ttcaacgatgtggttgagctggattctggggatgggaacatcacacttctcagaggcatt gctgcacgcagtgatattggcttcttatctctctttgagcattacaacatgtgccaggtt ggctgcttcctgaagaccccgaggttccccatctgggtggtttgcagtgagagccacttc agcatcctctttagcctgcagccggggctcctgcgtgactggaggactgagaggctcttt gacttgtactactacgatggcctggccaaccagcaggagcagatccggctgaccattgac accacccaaaccatctctgaggacacagacaacgaccttgtcccacccctcgagctctgc atcagaaccaagtctgtgttccatgtctgcgagaactttgctgacaccagctgcctgcct gttccacttcccccatcaatcccccatcatggtgggtccaccttggcagttgttcctaga aaagctgagtgggctctatcaggcacctttttccccatgaaaaacactgggtccatcaac gttgaaaagcagttaaggcagtatcagcaatgggcactcactgccgaggagaagtctgac ttcaccaagccaagccagtttgtctccaaatacggcaccagtgtctcttatttgagcagt gaagatgaacgctctgatatggataaattccatcaaagtggctcaaccccaaaacccaat agactcctctggatctgcttgacacacagaagcccagtaaatagtctgaagcacttggat agcatagaagcacttaattcttccacatccaatggatttcaaaaagaagatgagaacatg cagaagtgcggtggcctgctgatgaagaagggggcccagcccaggctgtggctcagctct cagagggaattgagcacccggcagcggtctcaggccaagccccctgccagcatggccagc gagttcaagaagaagctcttctggagggcagtggtggccgagttcctggccacgaccctc tttgtcttcatcagcatcggttctgccctgggcttcaaatacccggtggggaacaaccag acggcggtccaggacaacgtgaaggtgtcgctggccttcgggctgagcatcgccacgctg gcgcagagtgtgggccacatcagcggcgcccacctcaacccggctgtcacactggggctg ctgctcagctgccagatcagcatcttccgtgccctcatgtacatcatcgcccagtgcgtg ggggccatcgtcgccaccgccatcctctcaggcatcacctcctccctgactgggaactcg cttggccgcaatgacctggtagtggcagctatgtggctggctgatggtgtgaactcgggc cagggcctgggcatcgagatcatcgggaccctccagctggtgctatgcgtgctggctact accgaccggaggcgccgtgaccttggtggctcagccccccttgccatcggcctctctgta gcccttggacacctcctggctattgactacactggctgtgggattaaccctgctcggtcc tttggctccgcggtgatcacacacaacttcagcaaccactggattttctgggtggggcca ttcatcgggggagccctggctgtactcatctacgacttcatcctggccccacgcagcagt gacctcacagaccgcgtgaaggtgtggaccagcggccaggtggaggagtatgacctggat gccgacgacatcaactccagggtggagatgaagcccaaatag >gi568815591f:30811910_31023626|GENSCAN_predicted_peptide_2|110_aa MKLTVLQTYDTITLEMKKKGLDLSTSPGCPNVSLPLELSPGTHSGREASRDGGAQSKDSS RSLARTPAPAGSSTIEGQSMIHGGFVPGDLLPQEKLLAPVQATPAGQHRD >gi568815591f:30811910_31023626|GENSCAN_predicted_CDS_2|333_bp atgaaactgactgtattgcaaacatatgacacaatcaccctggagatgaaaaagaaagga ttggatctaagcacttcgcctggatgcccgaatgtttctttgcccctggaactgtctcct gggactcacagtggaagagaggccagcagggatggcggtgctcagagcaaggacagctcc aggagcttggcacggacaccagccccagctggttccagcacaatagaaggacagtcaatg attcatggaggcttcgttcctggggaccttctcccccaggagaagctgctggctcctgta caagctaccccagcagggcagcacagggattaa >gi568815591f:30811910_31023626|GENSCAN_predicted_peptide_3|435_aa MDRRMWGAHVFCVLSPLPTQVLGHMHPECDFITQLREDESACLQAAEEMPNTTLGCPATW DGLLCWPTAGSGEWVTLPCPDFFSHFSSESGAVKRDCTITGWSEPFPPYPVACPVPLELL AEEESYFSTVKIIYTVGHSISIVALFVAITILVALRRLHCPRNYVHTQLFTTFILKAGAV FLKDAALFHSDDTDHCSFSTVMAMGEGAGQVLCKVSVAASHFATMTNFSWLLAEAVYLNC LLASTSPSSRRAFWWLVLAGWGLPVLFTGTWVSCKLAFEDIACWDLDDTSPYWWIIKGPI VLSVGVNFGLFLNIIRILVRKLEPAQGSLHTQSQYWYCVFVSGDTGKGRLSKSTLFLIPL FGIHYIIFNFLPDNAGLGIRLPLELGLGSFQGFIVAILYCFLNQEGFADWAYDADSWADP LTGKLFLSQDSSFHN >gi568815591f:30811910_31023626|GENSCAN_predicted_CDS_3|1308_bp atggaccgccggatgtggggggcccacgtcttctgcgtgttgagcccgttaccgacccag gtattgggccacatgcacccagaatgtgacttcatcacccagctgagagaggatgagagt gcctgtctacaagcagcagaggagatgcccaacaccaccctgggctgccctgcgacctgg gatgggctgctgtgctggccaacggcaggctctggcgagtgggtcaccctcccctgcccg gatttcttctctcacttcagctcagagtcaggggctgtgaaacgggattgtactatcact ggctggtctgagccctttccaccttaccctgtggcctgccctgtgcctctggagctgctg gctgaggaggaatcttacttctccacagtgaagattatctacaccgtgggccatagcatc tctattgtagccctcttcgtggccatcaccatcctggttgctctcaggaggctccactgc ccccggaactacgtccacacccagctgttcaccacttttatcctcaaggcgggagctgtg ttcctgaaggatgctgcccttttccacagcgacgacactgaccactgcagcttctccact gtaatggccatgggtgaaggggctgggcaggttctatgcaaggtctctgtggccgcctcc catttcgccaccatgaccaacttcagctggctgttggcagaagccgtctacctgaactgc ctcctggcctccacctcccccagctcaaggagagccttctggtggctggttctcgctggc tgggggctgcccgtgctcttcactggcacgtgggtgagctgcaaactggccttcgaggac atcgcgtgctgggacctggacgacacctccccctactggtggatcatcaaagggcccatt gtcctctcggtcggggtgaactttgggctttttctcaatattatccgcatcctggtgagg aaactggagccagctcagggcagcctccatacccagtctcagtattggtactgtgtgttt gtgtctggggatactgggaaagggcgtctctccaagtcgacacttttcctgatcccactc tttggaattcactacatcatcttcaacttcctgccagacaatgctggcctgggcatccgc ctccccctggagctgggactgggttccttccagggcttcattgttgccatcctctactgc ttcctcaaccaagagggattcgcagactgggcctacgatgcagattcctgggcagatcct ttgactggcaagctcttcctctctcaggactccagcttccacaactga >gi568815591f:30811910_31023626|GENSCAN_predicted_peptide_4|246_aa MDASQRYKIPGSETKDFITHSKSTSQSCIGLLRFSDITQVPQRQCKGPMMESNTELGTFA TSLASGKKPGLFLVGGVTASLMAAHGKYSPAKQPGEEQEMKASICVSPAERPAAGPLICP PTVLQAGVKRVTREAPRPRFKLSYDVSMSVMTEQEGEEPAPVSSSAERSEHATYFMRLNR RWKWSIQHRAWHMNSSSAPPPSPSTEYSTPFQQTLQQGPCGIAQMLSDKHLQVPGASAFG AKPNPE >gi568815591f:30811910_31023626|GENSCAN_predicted_CDS_4|741_bp atggatgccagccaaagatacaagattcctgggtcagagacaaaggacttcatcactcac agcaaaagcactagccagagctgcataggattgcttcggttctccgacatcacccaagtc ccacaaagacaatgcaaagggcccatgatggagagtaacactgagcttggaacatttgcc acttctctagcgagtggaaagaaacctggtcttttcctggtgggaggtgttactgcatct ctcatggctgctcatggcaaatacagccctgcgaaacagcccggtgaggagcaagaaatg aaagccagcatctgcgtgtcacctgcagagcgccctgcagctggtcctctcatttgccct ccaacagtgctgcaggctggtgtgaagagagtgacccgggaagctccaagacccagattt aaactcagttatgacgtttccatgagtgtcatgactgagcaagaaggtgaagagcccgcg cctgtttcctcatctgcggaacggagcgaacacgccacctacttcatgagacttaacaga cgatggaagtggagcattcagcacagagcctggcacatgaactcttcctctgcaccccca cccagccctagcaccgaatactccactcccttccagcagaccctccagcagggaccctgt ggcattgcacaaatgctctctgacaaacacctacaagtccctggtgcctcagcctttggg gcaaaacccaaccctgaataa >gi568815591f:30811910_31023626|GENSCAN_predicted_peptide_5|96_aa MGDKEEEKVQSITWCQVLLPHIFPGPPHSPHRPTPEWSAAPPAGIGGAMEEVWPPEQGEN WMNSGGEHGMMMSHQTAPQWDFAFTSDPRLPIQVVE >gi568815591f:30811910_31023626|GENSCAN_predicted_CDS_5|291_bp atgggggacaaggaggaggagaaagtccagagcatcacctggtgtcaagtactcctgcct cacatcttcccaggccctccccacagcccccaccgccccacccctgaatggtcagctgct cctcctgctggcattggtggtgctatggaggaagtgtggcctcctgaacaaggagagaat tggatgaacagtggtggagagcatggcatgatgatgtcccaccagactgcacctcagtgg gacttcgccttcacatcggaccccaggcttccaattcaggttgtggaatga