GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:43:28 Sequence gi568815588r:46361511_46562635 : 201125 bp : 47.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4358 4519 162 2 0 88 94 36 0.256 3.29 1.02 Intr + 5915 6054 140 2 2 58 -4 144 0.623 2.31 1.03 Intr + 9629 9691 63 0 0 97 87 85 0.982 7.99 1.04 Intr + 11867 11996 130 0 1 64 79 166 0.679 13.05 1.05 Intr + 13615 13747 133 1 1 58 34 88 0.898 1.05 1.06 Intr + 14255 14362 108 1 0 58 106 33 0.814 2.58 1.07 Intr + 18375 18465 91 2 1 90 109 120 0.988 13.87 1.08 Intr + 19636 19745 110 2 2 65 44 226 0.619 15.90 1.09 Intr + 21069 21182 114 2 0 82 86 134 0.999 13.24 1.10 Intr + 21946 22036 91 0 1 46 116 172 0.998 15.37 1.11 Intr + 22696 22775 80 2 2 112 38 111 0.574 7.77 1.12 Intr + 23264 23323 60 1 0 81 94 73 0.986 6.13 1.13 Intr + 23870 23963 94 1 1 50 66 161 0.999 9.64 1.14 Intr + 24202 24297 96 2 0 10 86 160 0.984 7.98 1.15 Intr + 24458 24540 83 0 2 112 57 39 0.626 2.56 1.16 Intr + 26883 27005 123 2 0 48 103 127 0.989 10.98 1.17 Term + 29361 29420 60 2 0 101 54 138 0.927 9.50 1.18 PlyA + 30035 30040 6 1.05 2.04 PlyA - 31193 31188 6 1.05 2.03 Term - 34077 33986 92 1 2 31 49 119 0.525 0.18 2.02 Intr - 35212 35101 112 1 1 50 71 65 0.497 0.95 2.01 Init - 36466 36422 45 1 0 93 47 48 0.554 1.98 2.00 Prom - 39650 39611 40 -2.06 3.00 Prom + 42841 42880 40 -2.76 3.01 Init + 50090 50095 6 1 0 61 81 0 0.142 -2.39 3.02 Term + 54430 54846 417 0 0 52 47 343 0.434 21.88 3.03 PlyA + 55227 55232 6 1.05 4.06 PlyA - 55248 55243 6 -1.95 4.05 Term - 55430 55387 44 0 2 96 49 42 0.722 -1.58 4.04 Intr - 56577 56454 124 0 1 95 60 64 0.732 4.46 4.03 Intr - 58793 58717 77 0 2 56 117 72 0.903 6.13 4.02 Intr - 64554 64507 48 1 0 116 63 34 0.088 2.35 4.01 Init - 69996 69948 49 0 1 100 68 -6 0.060 -0.29 4.00 Prom - 84684 84645 40 -5.46 5.00 Prom + 85131 85170 40 -4.16 5.01 Init + 87001 87119 119 0 2 68 82 102 0.781 7.27 5.02 Term + 89603 89837 235 2 1 90 54 86 0.376 1.19 5.03 PlyA + 91776 91781 6 1.05 6.08 PlyA - 92191 92186 6 1.05 6.07 Term - 101184 99998 1187 1 2 95 41 1473 0.013 135.22 6.06 Intr - 106121 105978 144 0 0 110 46 44 0.002 2.65 6.05 Intr - 111670 111532 139 1 1 100 101 16 0.000 4.14 6.04 Intr - 126428 126401 28 1 1 77 92 1 0.018 -2.48 6.03 Intr - 127309 127240 70 2 1 124 85 14 0.034 2.94 6.02 Intr - 137493 137360 134 2 2 59 96 75 0.609 5.69 6.01 Init - 141799 141735 65 1 2 64 100 43 0.598 3.72 6.00 Prom - 142362 142323 40 -6.96 7.00 Prom + 143713 143752 40 -2.46 7.01 Init + 149242 149355 114 0 0 67 71 146 0.367 9.13 7.02 Intr + 154904 155116 213 1 0 85 49 67 0.046 1.41 7.03 Term + 156714 156740 27 2 0 127 36 27 0.062 -0.53 7.04 PlyA + 156900 156905 6 1.05 8.00 Prom + 159274 159313 40 -2.06 8.01 Init + 179869 180031 163 0 1 75 53 171 0.684 10.20 8.02 Intr + 186532 186719 188 2 2 36 -11 129 0.085 -2.79 8.03 Term + 187224 187322 99 2 0 60 49 98 0.210 1.23 8.04 PlyA + 187485 187490 6 1.05 9.05 PlyA - 187558 187553 6 -0.45 9.04 Term - 189232 187850 1383 1 0 137 55 902 0.153 83.08 9.03 Intr - 191353 191269 85 0 1 105 29 33 0.115 -1.08 9.02 Intr - 194661 194544 118 1 1 131 66 32 0.313 4.82 9.01 Intr - 197070 196965 106 0 1 69 76 41 0.144 0.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 101125 99998 1128 1 0 67 41 1482 0.828 138.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_1|579_aa XPREITAPTLAGPGHSSSLTGSLNSVHTSVNSPFPQVPIRICPLASTRSLTDTHRPLQCQ PNALGKASWTLAVTKGGQGHLDPPELQAPDFHSPFWVQKFILHAVEEVVKEVVGHAKETG EKAIAEAIKKAQESGDKKMKEITETVTNTVTNAITHAAESLDKLGHDASEWSRGAVVAGQ SQAGARVSLGGDGAEAITGLTVDQYGMLYKLQEPDVWSPSRGQPVSLHLREKGAPEVKEM AWWKAWIEQEGVTVKSSSHFNPDPDAETLYKAMKGIGTNEQAIIDVLTKRSNTQRQQIAK SFKAQFGKARGRLDLTETLKSELSGKFERLIVALMYPPYRYEAKELHDAMKGLGTKEGVI IEILASRTKNQLREIMKAYEEDYGSSLEEDIQADTSGYLERILVCLLQGSRDDVSSFVDP ALALQDAQDLYAAGEKIRGTDEMKFITILCTRSATHLLRVFEEYEKIANKSIEDSIKSET HGSLEEAMLTVGTAPPLESQCVLPKIVLCAAHVVPSAVWGAGTRDGTLIRNIVSRSEIDL NLIKCHFKKMYGKTLSSMIMEDTSGDYKNALLSLVGSDP >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_1|1740_bp nnacccagggagataacagctcccactcttgctggtccaggacattcctcatctcttact ggttcccttaactcggtccacacttctgtgaacagcccctttcctcaagtacccattcga atatgtcctctggcttccaccaggagcctgactgatacccacaggcctctgcagtgccag cccaatgccctgggcaaagcatcctggactctggctgttacaaaaggaggacagggtcat ctggacccgcctgagctccaggcccccgactttcacagccccttctgggtccagaagttc atacttcatgccgtggaggaagtggtgaaggaggtggtgggacatgccaaggagactgga gagaaagccattgctgaagccataaagaaagcccaggagtcaggggacaaaaagatgaag gaaatcaccgagacagtgaccaacacagtcacaaatgccatcacccatgcagcagaaagt ctggacaaacttggacatgatgcctctgaatggtcccgaggggctgtggtggctgggcag agccaggcaggagccagagtcagcctggggggtgatggagctgaggccatcaccggtctg acagtggaccagtatggcatgctgtataagctgcaggagccagacgtgtggagtcccagc agaggccaacctgtgtctcttcatctccgtgagaaaggtgcccccgaagtgaaagagatg gcctggtggaaagcctggattgaacaggagggtgtcacagtgaagagcagctcccacttc aacccagaccctgatgcagagaccctctacaaagccatgaaggggatcgggaccaacgag caggctatcatcgatgtgctcaccaagagaagcaacacgcagcggcagcagatcgccaag tccttcaaggctcagttcggcaaggcaaggggaaggctggacctcactgagaccttgaag tctgagctcagtggcaagtttgagaggctcattgtggcccttatgtatccgccatacaga tacgaagccaaggagctgcatgacgccatgaagggcttaggaaccaaggagggtgtcatc attgagatcctggcctctcggaccaagaaccagctgcgggagataatgaaggcgtatgag gaagactatgggtccagcctggaggaggacatccaagcagacacaagtggctacctggag aggatcctggtgtgcctcctgcagggcagcagggatgatgtgagcagctttgtggacccg gcactggccctccaagacgcacaggatctgtatgcggcaggcgagaagattcgtgggact gatgagatgaaattcatcaccatcctgtgcacgcgcagtgccactcacctgctgagagtg tttgaagagtatgagaaaattgccaacaagagcattgaggacagcatcaagagtgagacc catggctcactggaggaggccatgctcactgtggggactgctccacctctagagtcccag tgtgtgctgccaaagattgttctctgtgctgcccacgtggtgcccagtgctgtgtgggga gcagggacgcgtgatgggaccctgataagaaacatcgtttcaaggagcgagattgactta aatcttatcaaatgtcacttcaagaagatgtacggcaagaccctcagcagcatgatcatg gaagacaccagcggcgactacaagaacgccctgctgagcctggtgggcagcgacccctga >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_2|82_aa MESGSNCVGMGRLPQGGDPKHAKSNTLALPPESMRTTRMGMCPDLALPSWTAGDLLKKGQ GDQTSPGKVEDEDPDRILGVVR >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_2|249_bp atggagagcggcagcaactgcgttgggatggggcggctgccccagggaggtgaccctaaa catgccaagtccaacacactagctctaccccctgagagcatgagaacaacacggatgggc atgtgcccggatctggctttgccctcatggactgcaggagacctgctgaagaaaggccag ggtgaccagacatcacctgggaaggtggaggatgaggaccctgatcggatattgggagtg gtcagatag >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_3|140_aa MEIVIQKYHTVNDHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFSGNDNFGIG GNFSGLGGFGGSRGGGGYGGSGDGCNGFGNDGSNFGGGGSYNDFGNYNNESSNFGPMKGG NFGGRSSGPCGNGGLYFAKP >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_3|423_bp atggaaattgtcattcagaaataccatactgtgaatgaccacaactgtgaagttaggaaa gccctgtcaaagcaagagatggctagtgcttcatccagccaaagaggtcgaagtggttct ggaaactttggtggtggtcgtggaggtggtttcagtgggaatgacaactttggtattgga ggaaacttcagtggtcttggtggctttggtggcagtcgtggtggtggtggatatggtggc agtggggatggctgtaatggatttggtaatgatggaagcaattttggaggtggtggaagc tacaatgattttggcaattacaacaatgagtcttcaaattttggacccatgaagggagga aattttggaggcagaagctctggcccctgtggcaatggaggcctatactttgcaaaacca tga >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_4|113_aa MKKMKGQVADQEKIYTGLWSDVESMWPPVYAGAFGGRHCRYPTEEESDTVGGEGHSYPTH GRRPDWQGQEAGAKWDMSSVLAKHTLDADEPYAWETEKWGHNSNYQFNYNKGF >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_4|342_bp atgaagaaaatgaaagggcaagttgcagaccaggagaaaatatacacaggcctgtggtct gatgttgagtccatgtggcccccagtctatgccggagccttcggaggcaggcactgccgc tatcccaccgaggaggagtctgacactgtgggcggtgaaggccacagttacccgacccac ggccgtaggccagactggcaggggcaagaggcaggagccaagtgggacatgtccagtgtc ctggccaaacatacactggacgcagatgagccatatgcctgggaaacagagaaatggggt cacaacagtaactatcagttcaactacaacaaaggtttctga >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_5|117_aa MKDNIQYYSNYMAFRKRQNYGDGKKIGGCQGLTGKEVINRGCEVYQNLEEILWEERPSQT ASPQRCKNQISSMGDFQVSSAFSQSSLYPCHSVNHSNNSQLPAEKLDPEAGLDSLQK >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_5|354_bp atgaaagacaacatacagtattattccaactacatggctttccggaaaaggcaaaactac ggggatggtaaaaagatcggtggttgccagggtttaacaggaaaggaggtgataaatagg ggttgtgaggtgtatcagaaccttgaagagatactctgggaagaaaggccttcccagact gcctctccgcagcgctgtaagaatcagatttcttctatgggggacttccaggtctcttca gcattttctcagtcatcactgtatccctgccactcagtgaaccacagcaacaactcccag cttccagccgagaagcttgaccctgaagctggcctcgattctctacagaagtga >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_6|588_aa MVEEFHYRETSGTQSVVVGAGRQNQAVSGIYSFWWVLGLGDFKNEAVDLTVSVTALKGGR LEFAPSESCHITLNGPENTRTPRRMKEGICPGDSLHCLQMTGQLFCSMSLTWVEADASSG LDSLACTFSRKITEAVLRFPCILSCDAVCGLVLGSGGGDKSCDLDIGEWGDKAAQADNQL GCYICFCSRAWEGGVIPQVYHLVQESWNLFTSTMNTSHLLALLLPKSPQGENRSKPLGTP YNFSEHCQDSVDVMVFIVTSYSIETVVGVLGNLCLMCVTVRQKEKANVTNLLIANLAFSD FLMCLLCQPLTAVYTIMDYWIFGETLCKMSAFIQCMSVTVSILSLVLVALERHQLIINPT GWKPSISQAYLGIVLIWVIACVLSLPFLANSILENVFHKNHSKALEFLADKVVCTESWPL AHHRTIYTTFLLLFQYCLPLGFILVCYARIYRCLQRQGRVFHKGTYSLRAGHMKQVNVVL VVMVVAFAVLWLPLHVFNSLEDWHHEAIPICHGNLIFLVCHLLAMASTCVNPFIYGFLNT NFKKEIKALVLTCQQSAPLEESEHLPLSTVHTEVSKGSLRLSGRSNPI >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_6|1767_bp atggtggaggaattccactacagggagacatcaggaacacaaagtgtggttgttggagct ggaagacaaaatcaagctgtgtcaggaatttattccttctggtgggttctcggtcttggt gacttcaagaatgaagctgtggacctcacagtgagtgttacagctcttaaaggtgggcgt ctggaatttgctccttcagaatcctgtcacattactcttaatggtccagaaaatacaagg actcctcgaaggatgaaagaagggatttgccctggagacagccttcactgcttgcagatg acaggccagttattctgtagtatgtccctcacctgggttgaggctgatgcttcttcaggg ttggattcacttgcatgcaccttcagcaggaagatcacagaagctgtgctgcgttttcct tgcatcctgtcatgcgatgctgtgtgtggacttgtgctgggatctggaggcggagacaag agctgtgacctagacataggggagtggggagacaaagctgctcaggcagacaaccaacta ggctgctacatctgcttctgcagcagagcatgggagggtggcgtcatccctcaagtgtat cacttagttcaagagtcctggaatcttttcacatccactatgaacacctctcacctcctg gccttgctgctcccaaaatctccacaaggtgaaaacagaagcaaacccctgggcacccca tacaacttctctgaacattgccaggattccgtggacgtgatggtcttcatcgtcacttcc tacagcattgagactgtcgtgggggtcctgggtaacctctgcctgatgtgtgtgactgtg aggcagaaggagaaagccaacgtgaccaacctgcttatcgccaacctggccttctctgac ttcctcatgtgcctcctctgccagccgctgaccgccgtctacaccatcatggactactgg atctttggagagaccctctgcaagatgtcggccttcatccagtgcatgtcggtgacggtc tccatcctctcgctcgtcctcgtggccctggagaggcatcagctcatcatcaacccaaca ggctggaagcccagcatctcacaggcctacctggggattgtgctcatctgggtcattgcc tgtgtcctctccctgcccttcctggccaacagcatcctggagaatgtcttccacaagaac cactccaaggctctggagttcctggcggataaggtggtctgtaccgagtcctggccactg gctcaccaccgcaccatctacaccaccttcctgctcctcttccagtactgcctcccactg ggcttcatcttggtctgttatgcacgcatctaccggtgcctgcagaggcaggggcgcgtg tttcacaagggcacctacagcttgcgagctgggcacatgaagcaggtcaatgtggtgctg gtggtgatggtggtggcctttgccgtgctctggctgcctctgcatgtgttcaacagcctg gaagactggcaccatgaggccatccccatctgccatgggaacctcatcttcttagtgtgc cacttgcttgccatggcctccacctgtgtcaacccattcatctatggctttctcaacacc aacttcaagaaggagatcaaggccctggtgctgacttgccagcagagcgcccccctggag gagtcagagcatctgcccctgtccacagtacatacggaagtctccaaagggtccctgagg ctaagtggcaggtccaatcccatttaa >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_7|117_aa MHRVLRGAAVLAGFAVHICPFARSLLSLKCSDLSTGNQPQAQPVVTGATRPHLCALHMGP LTQLPPTWICLKNSSIYSFCYQTLISGGQLPQVWPLQPEIQVQILILPLVSGEGQQQ >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_7|354_bp atgcaccgggtgctcagaggagccgctgtcctggcaggctttgctgtccacatttgcccc tttgcacggagcctgctgtccctgaagtgctctgacctgtccacaggaaaccagccccaa gcccagccagtggtcactggggccacacgtccccatctttgtgctttgcacatgggtcct ctgacccagctgcctcccacctggatctgcctgaagaactcttcaatatacagcttctgt tatcagacgctcatttcggggggacagctgccacaagtgtggcctctgcagccagagatc caagttcaaatcctgattctgcctctagtgtctggggaaggacagcagcagtaa >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_8|149_aa MPTRRRRVLSRAAPPRTTPGPGPTPVRGHTPSRKPDRGISRKPPSPAPPCTADQGAEEAP ETGQVVGHDPTLALWSLTRVDWGAVSRDLRIFKTKTQDKNTSPLPFFTGLEKSLGNEGNT HGSAGPGFGSQSHLFKLSSVGHIGYSLAK >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_8|450_bp atgcccaccaggaggcgtcgggtcctgtcacgggccgcgccgcccaggaccacgcctggc ccgggacccacgccagtccggggacacacacctagccgcaaaccagaccgcgggatcagc cgcaagccaccctctccagcgccgccctgcaccgcagatcagggtgcagaagaggctcca gaaacaggccaggtcgtgggccatgaccccacactagccctctggtccctcacacgggtg gattggggggctgtgtcacgggatcttaggatcttcaagacaaagacccaggacaagaac acaagcccactcccattcttcacaggcctcgagaaatctttgggcaatgaaggcaacacc cacggctcagctggccctggctttggatctcagtcccacctcttcaagctgtccagcgtg gggcacatcggctacagccttgccaaatga >gi568815588r:46361511_46562635|GENSCAN_predicted_peptide_9|563_aa LPQPGEPPCQGMNGRRGPVGQGVMPWGEGQPAVFTAYCAAYPAPSLAALWLLLSRIITPA HSPICLQSLPLLLRRDKSGFSPVKQLGWDILTRPGLCSELHSPAAMSSSRPEPGPWAPLS PRLQPLSQSSSSLLGEGREQRPELRKTASSTVWQAQLGEASTRPQAPEEEGNPPESMKPA RASGPKARPSAGGHWWSSTVGNVSTMGGSDLCRLRAPSAAAMQRSHSDLVRSTQMRGHSG ARKASLSCSALGSSPVHRAQLQPGGTSGQGGQAPAGLERDLAPEDETSNSAWMLGASQLS VPPLDLGDTTAHSSSAQAEPKAAEQLATTTCHALPPAALLCGMREVRAGGCCHALPATGI LAFPKLVASVSESGLQAQHGVKIHCRLSGGLPGHSHCCAHLWGPAGLVPEPGSRTKDVWT MTSANDLAPAEASPLSAQDAGVQAAPVAACKAVATSPSLEAPAALHVFPEVTLGSSLEEV PSPVRDVRWDAEGMTWEVYGAAVDLEVLGVAIQKHLEMQFEQLQRAPASEDSLSVEGRRG PLRAVMQSLRRPSCCGCSGAAPE >gi568815588r:46361511_46562635|GENSCAN_predicted_CDS_9|1692_bp ctgccacagccaggggagcctccttgccagggcatgaatggaagaagggggccagttgga caaggagtgatgccctggggtgagggccagccagctgtgttcacagcctactgcgccgcc taccccgcccccagccttgctgccctttggcttctcctgtcccgcatcattaccccggcc cacagccccatttgcctgcagtcgctgcctttgctcctgagaagggacaagtctggcttc agccccgtgaagcagctgggatgggacatccttaccaggccggggctgtgttcagagctg cacagtcctgcagccatgagctccagccgccccgagccgggtccctgggcacccctgagc ccccgccttcagcccctgtcccagagctcttccagcctgctgggtgaaggccgggaacag aggccagagctccgcaagactgccagcagcaccgtgtggcaggcccagctgggcgaggcc agcaccagaccccaggccccggaggaagaggggaacccgcctgagagcatgaagccagca cgggcctctggccccaaggcgcgacccagtgctggaggccactggtggagcagcactgtg ggcaatgtgtccaccatgggcggcagtgacctgtgtcgcctgcgggcccctagtgctgct gctatgcagaggagccattcagacctggtccgtagcacccagatgcggggacacagtggt gctcggaaggccagtctcagctgctcagcccttggcagcagccctgtccacagggctcag ctgcagccaggtggtacttctggccagggtggccaggcccctgcaggcctggaaagggac ctggctcctgaggatgagacttctaactcagcctggatgctgggggcgagtcagttgtca gtgccaccactagacctgggggacacaactgcccacagcagcagtgcccaggctgagccc aaagctgctgaacagctggctaccaccacctgccatgctctgcccccagctgctctactc tgtggcatgagggaggtgagggctggtggctgctgccatgccctacctgccacagggatc ctggcctttcccaaactagtggcgtcagtgagcgagtctgggctgcaggctcagcatggg gtgaagatccactgtaggttgtctggggggctccctgggcattcccattgctgtgcccac ctttggggtcccgctgggttagtcccagagcctggctctaggaccaaagatgtgtggacc atgacctcagccaatgacttggcccctgcagaggcatccccgctgtcagcccaggatgct ggtgtgcaggcggccccagtggcggcctgcaaggctgtggccaccagtccgtccctggaa gcgcctgcagccctgcatgtgttcccagaggtaactctggggtccagcctggaggaggtg ccgtcccctgtgcgggatgtgcgatgggatgctgagggcatgacatgggaggtgtacgga gctgcggtggacctggaggtgctcggtgtggccatccagaagcacctggagatgcagttt gagcagctgcagcgggcgcccgccagcgaggacagcctgtctgtggagggccggaggggg ccactgcgggctgtcatgcagtccctgcggcgccccagctgctgcggctgctccggcgcg gcccccgagtga