GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:43:27 Sequence gi568815595r:128520414_128750800 : 230387 bp : 47.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1528 1560 33 0 0 84 87 16 0.531 0.98 1.02 Intr + 2705 2875 171 1 0 99 69 41 0.721 3.44 1.03 Term + 2950 3090 141 0 0 130 55 41 0.713 2.93 1.04 PlyA + 4562 4567 6 1.05 2.00 Prom + 10643 10682 40 -7.16 2.01 Init + 13705 13863 159 0 0 86 88 64 0.440 6.02 2.02 Term + 17688 17885 198 2 0 23 34 290 0.972 14.50 2.03 PlyA + 18874 18879 6 1.05 3.04 PlyA - 19196 19191 6 1.05 3.03 Term - 20802 20629 174 0 0 76 44 69 0.220 -0.94 3.02 Intr - 26420 26266 155 0 2 31 96 82 0.258 3.09 3.01 Init - 27254 27182 73 2 1 104 17 130 0.316 6.73 3.00 Prom - 28808 28769 40 -5.96 4.15 PlyA - 32723 32718 6 1.05 4.14 Term - 34856 34788 69 2 0 92 52 59 0.747 0.64 4.13 Intr - 35381 35320 62 0 2 110 90 58 0.801 6.65 4.12 Intr - 35636 35500 137 1 2 31 80 77 0.523 1.41 4.11 Intr - 38100 37996 105 2 0 18 91 77 0.241 0.33 4.10 Intr - 42906 42805 102 2 0 74 66 46 0.051 0.29 4.09 Intr - 47828 47680 149 2 2 85 19 92 0.195 1.03 4.08 Intr - 50185 50108 78 1 0 87 56 90 0.507 5.45 4.07 Intr - 53226 53083 144 0 0 79 81 23 0.295 1.18 4.06 Intr - 55439 55402 38 0 2 72 99 26 0.270 0.08 4.05 Intr - 56575 56487 89 0 2 109 47 63 0.337 3.91 4.04 Intr - 63842 63689 154 0 1 87 66 57 0.143 2.53 4.03 Intr - 83869 83846 24 2 0 143 75 13 0.559 3.50 4.02 Intr - 87894 87589 306 1 0 -8 36 218 0.177 3.42 4.01 Init - 92672 92552 121 2 1 79 80 90 0.315 5.68 4.00 Prom - 93383 93344 40 -5.86 5.03 PlyA - 94001 93996 6 1.05 5.02 Term - 97646 97171 476 0 2 81 48 176 0.087 7.95 5.01 Init - 98739 98604 136 0 1 83 100 11 0.116 2.12 5.00 Prom - 99688 99649 40 -6.76 6.11 PlyA - 99783 99778 6 -0.45 6.10 Term - 100180 99998 183 1 0 77 41 425 0.999 34.24 6.09 Intr - 101996 101751 246 2 0 116 89 313 0.999 31.86 6.08 Intr - 105240 105121 120 0 0 112 81 313 0.999 33.69 6.07 Intr - 105599 105461 139 1 1 92 89 203 0.994 21.27 6.06 Intr - 106419 106320 100 1 1 81 89 152 0.999 13.77 6.05 Intr - 109706 109538 169 2 1 65 52 169 0.847 10.52 6.04 Intr - 111744 111535 210 0 0 121 86 194 0.999 21.61 6.03 Intr - 117692 117386 307 1 1 104 115 310 0.999 31.75 6.02 Intr - 124570 124506 65 1 2 111 121 79 0.998 11.02 6.01 Init - 130387 130127 261 1 0 89 109 480 0.999 45.26 6.00 Prom - 157784 157745 40 -5.76 7.00 Prom + 158561 158600 40 -1.26 7.01 Sngl + 160308 160559 252 2 0 80 42 202 0.760 7.91 7.02 PlyA + 161020 161025 6 1.05 8.00 Prom + 165962 166001 40 -5.56 8.01 Init + 166932 167042 111 2 0 76 52 63 0.606 1.71 8.02 Term + 172461 172613 153 2 0 55 55 116 0.780 2.92 8.03 PlyA + 175223 175228 6 1.05 9.05 PlyA - 177165 177160 6 1.05 9.04 Term - 181134 181052 83 1 2 79 45 68 0.450 -0.54 9.03 Intr - 181407 181219 189 1 0 82 47 78 0.380 2.66 9.02 Intr - 193536 193434 103 0 1 87 90 43 0.212 4.15 9.01 Init - 205986 205696 291 0 0 80 45 175 0.218 7.56 9.00 Prom - 207654 207615 40 -5.86 10.05 PlyA - 208048 208043 6 1.05 10.04 Term - 216173 216088 86 0 2 39 43 75 0.069 -4.08 10.03 Intr - 222646 222511 136 1 1 64 85 54 0.534 2.84 10.02 Intr - 222904 222768 137 2 2 82 45 84 0.444 3.79 10.01 Init - 225557 225362 196 2 1 87 84 112 0.592 9.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 197679 197559 121 2 1 128 41 62 0.858 3.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_1|114_aa MAKPKSLKPIKRFLTSGWEHHKAMWITLIPGTMGPKCLWTNVWTNAPRALMLSGDSQALS TGHSQLSVVAMGPPVLRTSSGAPPPMPPGTMLSFPRVSAGPFQACGLCGIQPVD >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_1|345_bp atggccaaacccaagtcactcaagccaattaaaagattcctcacttctggttgggaacac cacaaagccatgtggatcaccctcatccctggaacgatggggcccaaatgcctatggaca aatgtgtggacaaatgccccccgggctctgatgctctctggagacagccaagccctaagc actggccattcacagctctctgtggtggctatgggtccacctgtcctccggactagctct ggagccccccctcccatgcctcctggaacaatgctgagcttcccccgtgtgtctgcaggt cctttccaggcttgtggcctgtgtgggatccagcctgtggactga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_2|118_aa MMRRRDHEFNSGYSKFEMPVWHYTVQEMMQYRGLKYRRGAYTSDQDLKIINKEFLHEEIH KDLLVMGAYEISNKSGGAGSLHSHLKVTDSAGHILYSKEDATKGKFAFILEDYDMFEV >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_2|357_bp atgatgagaaggagagatcatgagttcaactctgggtattccaaatttgagatgcctgtg tggcattacactgttcaggaaatgatgcagtatcgaggtctgaagtacaggagaggagcc tacacttcagaccaagatttgaaaattatcaacaaagagttcctccatgaggagatccac aaggacctgctagtgatgggtgcgtacgagatctccaacaagtctgggggtgctggcagc ctgcacagccacctcaaggtcacagattctgctggccacattctctactccaaagaggat gcaaccaaagggaaatttgcctttatcctggaagattatgacatgtttgaagtgtga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_3|133_aa MGAALLLLRLIWTLVMAGLSQHVGASQLPDMRVRLSWLVQPPAVYRHMKELSRNQSRLPR TQEPFSCSTELRAKQKEPFRGASLVLGPGPSSGATVLPDMSLLSTSGGESQPVLESPSMA PHGCIAKKKQKCV >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_3|402_bp atgggggccgccttgctcctgttgagactgatttggaccttagtcatggctggcctgagt cagcacgtgggagccagccaattgccagatatgcgagtgaggttatcctggcttgttcag ccaccagctgtctataggcacatgaaagagctcagcaggaatcagtccaggttacccaga acacaagaaccgttcagctgctccacagaactgcgagcaaaacaaaaggagccattcagg ggtgcatccttagttctaggccctggaccaagctctggggctacagtattgccagatatg tcccttctttctacaagtggaggtgaaagtcagccagtcctggaaagccccagcatggct ccacatggatgcattgccaagaagaagcagaagtgtgtgtga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_4|525_aa MSLALGIQSGVGRVPATGTLCSPQRCLQSGNDKEPVHALPAGGDGNPRPHALAAPGCPAG RQLPVQETQPREGRRKVPRLRGAQRGLAPDDRLRNGLLPRVSRSLQRPKSPADLEHLQAL GRGPGAIPAALAWELRLLVVAPGVQSPAGAGPFPSTGREPSRTINNDYKTNSLVTNDGSL QVFTLRRSTPLRKKLNNEASLRKHSACCLGGPEDTGWVPVVEVPGQHLHIADVLPVEGDP QMTQGPPDSQPLGTLGQGGWKLLGIVGSLAPETLGGLGTEFGPCTHPLPFDMYPVAIKVA PNGSQAMTKLNIGTGILQQALSWGHAAPRPQQPAHIAALILQNGPSSCNGLAGQQSAIAK GKDSGAWSTIFTLMTSKWIIFSPNFSPDSRLINPTADSSIVRSLGQQQDGSSHGEAEEST PAQEASGSPHQGSEMDRQPARPQEMGGFREAPQLELTQPVQLDLAQFPGGSRGGGRWGDC GAGPMALAALPSENKASCRNSGRMRLFPMPLLRLALRGKHPESGP >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_4|1578_bp atgtccctggccctggggatacagtcgggagtgggcagagtccccgccacagggaccttg tgcagcccacagagatgcctgcagagtggaaatgataaggagcccgttcacgctctgcca gccgggggtgacgggaacccacgtcctcacgccctggctgctcccggctgccccgcgggc cggcagcttcccgttcaggaaactcagcctcgggaagggcggagaaaggtcccgcggctg cgaggagcccagcggggcttggcgcctgatgaccggctccggaacgggctgttaccccgc gtatctcggtctctgcagcggccaaagtcgcccgctgacctggagcacctgcaggccctg ggcagaggacccggggccattcccgcagccctggcctgggagctccggcttctagtggtg gctccgggtgtccagagcccggctggcgcagggcccttcccctccacggggagagagcct tctagaaccataaacaatgactacaagacaaacagccttgtgaccaatgatgggagcttg caagtgtttacactgcgtagatcaacacctttaagaaaaaaacttaacaatgaagcaagt cttaggaagcacagtgcctgctgcctgggggggcctgaagacacaggatgggtccccgtt gtggaggtgcccgggcagcacctgcatatagcagatgtgcttcctgtggaaggagacccc cagatgacacagggacccccagactctcagcccctaggcaccctgggccagggtggttgg aagcttctaggcattgtggggtctctggcaccagagacactcgggggtctggggaccgag tttgggccctgtacccacccactaccatttgacatgtatcctgtggccattaaagtcgca ccgaatggctctcaggccatgaccaaactgaacattgggactggaattttgcagcaagcc ctcagctggggccatgctgccccccgcccccaacagccagcacacatcgctgctttgatt ctgcaaaatggaccaagcagctgcaatggacttgccggacagcagagcgcaattgccaaa ggcaaggactctggagcgtggagcaccatcttcacactgatgacttccaaatggattatc ttcagccccaatttctcccctgactccaggctcataaatccaactgccgactccagcatt gtcagaagcctggggcaacagcaagacggcagcagccatggcgaggcagaagagtcaacg ccagcccaggaagcatcaggaagtccacatcaggggagtgagatggacagacagcctgca cggccgcaggaaatgggtggcttccgcgaagccccccagctggagttgacacagcctgtc cagctggacctggcccagtttccaggcggctccaggggcggggggcgctggggggactgt ggcgccggtccgatggctctagcagcgctgccatctgagaacaaagcgtcctgcaggaat tccggccggatgaggctctttcccatgccccttctgcggctggctctgcgcggaaagcat cctgaatccggcccctga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_5|203_aa MAKWRCSGHPSGSVPGQVDIQAWNYKVWCASGKCGAAGPRHGYGKGNPRAAVPHGGALLY RGCIPPRAIWSAASPALFILLLSLNLFLRFLEQSESLAEIKITRWARVQIGGRPLLGRRA SRRPEERAAPAPWCPFVTLRHVTRRAPGSMGEIAGADGGLGLEKSRMRGANLKAARLQQN QSQFSGCGATYGCGLVNGLLWPD >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_5|612_bp atggcaaaatggaggtgctcgggacatccaagtgggagtgtccctgggcaggtggatatt caggcctggaattataaggtctggtgtgcaagtggtaaatgtggggctgccgggcctaga catggttatggaaaagggaacccccgggcggctgtgcctcacggaggggccctgctttac cgcggctgcataccacctcgggccatttggtcagctgcttcaccagctctcttcatcctc ttactctccctgaatctttttcttcgttttttggagcagtcagaaagcttggccgaaatc aagataactcgctgggcccgcgtgcagattggtgggcgccctctgctgggccggcgggcc tctcgcaggcctgaggagcgagctgcgcctgcgccctggtgtcccttcgtaacactgcgg cacgtcacgaggcgggcaccggggagtatgggcgaaatcgcaggcgcagacggcgggctc gggctagaaaagtcgcgcatgcgtggggctaatttaaaggcggcgcggctccaacagaac caaagccaattctccggctgtggcgccacctacgggtgtgggctcgtaaacggcctcctc tggccggactag >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_6|599_aa MEAPAAGLFLLLLLGTWAPAPGSASSEAPPLINEDVKRTVDLSSHLAKVTAEVVLAHLGG GSTSRATSFLLALEPELEARLAHLGVQVKGEDEEENNLEVRETKIKGKSGRFFTVKLPVA LDPGAKISVIVETVYTHVLHPYPTQITQSEKQFVVFEGNHYFYSPYPTKTQTMRVKLASR NVESYTKLGNPTRSEDLLDYGPFRDVPAYSQDTFKVHYENNSPFLTITSMTRVIEVSHWG NIAVEENVDLKHTGAVLKGPFSRYDYQRQPDSGISSIRSFKDVYYRDEIGNVSTSHLLIL DDSVEMEIRPRFPLFGGWKTHYIVGYNLPSYEYLYNLGDQYALKMRFVDHVFDEQVIDSL TVKIILPEGAKNIEIDSPYEISRAPDELHYTYLDTFGRPVIVAYKKNLVEQHIQDIVVHY TFNKVLMLQEPLLVVAAFYILFFTVIIYVRLDFSITKDPAAEARMKVACITEQVLTLVNK RIGLYRHFDETVNRYKQSRDISTLNSGKKSLETEHKALTSEIALLQSRLKTEGSDLCDRV SEMQKLDAQVKELVLKSAVEAERLVAGKLKKDTYIENEKLISGKRQELVTKIDHILDAL >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_6|1800_bp atggaggcgccagccgccggcttgtttctgctcctgttgcttgggacttgggccccggcg ccgggcagcgcctcctccgaggcaccgccgctgatcaatgaggacgtgaagcgcacagtg gacctaagcagccacctggctaaggtgacggccgaggtggtcctggcgcacctgggcggc ggctccacgtcccgagctacctctttcctgctggctttggagcctgagctcgaggcccgg ctggcgcacctgggcgtgcaggtaaagggagaagatgaggaagagaacaatttggaagta cgtgaaaccaaaattaagggtaaaagtgggagattcttcacagtcaagctcccagttgct cttgatcctggggccaagatttcagtcattgtggaaacagtctacacccatgtgcttcat ccgtatccaacccagatcacccagtcagagaaacagtttgtggtgtttgaggggaaccat tatttctactctccctatccaacgaagacacaaaccatgcgtgtgaagcttgcctctcga aatgtggagagctacaccaagctggggaaccccacgcgctctgaggacctactggattat gggcctttcagagatgtgcctgcctatagtcaggatacttttaaagtacattatgagaac aacagccctttcctgaccatcaccagcatgacccgagtcattgaagtctctcactggggt aatattgctgtggaagaaaatgtggacttaaagcacacaggagctgtgcttaaggggcct ttctcacgctatgattaccagagacagccagatagtggaatatcctccatccgttctttt aaggatgtttattaccgggatgagattggcaatgtttctaccagccacctccttattttg gatgactctgtagagatggaaatccggcctcgcttccctctctttggcgggtggaagacc cattacatcgttggctacaacctcccaagctatgagtacctctataatttgggtgaccag tatgcactgaagatgaggtttgtggaccatgtgtttgatgaacaagtgatagattctctg actgtgaagatcatcctgcctgaaggagccaagaacattgaaattgatagtccctatgaa atcagccgtgccccagatgagctgcactacacctatctggacacatttggccgccctgtg attgttgcctacaagaaaaatctggtagaacagcacattcaggacattgtggtccactac acgttcaacaaggtgctcatgctgcaggagcccctgctggtggtggcggccttctacatc ctgttcttcaccgttatcatctatgttcggctggacttctccatcaccaaggatccagcc gcagaagccaggatgaaggtagcctgcatcacagagcaggtcttgaccctggtcaacaag agaataggcctttaccgtcactttgacgagaccgtcaataggtacaagcaatcccgggac atctccaccctcaacagtggcaagaagagcctggagactgaacacaaggccttgaccagt gagattgcactgctgcagtccaggctgaagacagagggctctgatctgtgcgacagagtg agcgaaatgcagaagctggatgcacaggtcaaggagctggtgctgaagtcggcggtggag gctgagcgcctggtggctggcaagctcaagaaagacacgtacattgagaatgagaagctc atctcaggaaagcgccaggagctggtcaccaagatcgaccacatcctggatgccctgtag >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_7|83_aa MTPIWLTAQAHTQSQEASPTSPQLCIHSRQAVPEAECRIGQRQRRCSGTAARGRGVAVLA CTSLVTSAERLRQWRILPYATTG >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_7|252_bp atgacccccatatggctcactgcgcaagctcacacccagtcccaggaagcgagccccacg tcgccccagctctgcattcacagccgccaagcggtcccggaagccgaatgccggataggt caaagacagcgccgctgctccggcactgccgccagagggcgcggagtcgccgtgttggcc tgcacctcccttgtcacgtcagccgaacggctacggcaatggaggattttgccgtatgcg acgacaggatga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_8|87_aa MVRKDLIELVILDLGPQEKSASGRKNKYKFLDMGMCLSLNTANSRYAGVISPHTKHRFCG FPSRHQLGVLQFSSDTIYLEIVSDPTG >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_8|264_bp atggtcagaaaagacctcattgagttggtgatacttgatctgggaccccaagagaagagt gcttctggcagaaagaacaagtataaattcctcgacatgggaatgtgcttgtcactcaac actgccaacagcagatatgcgggggttatttccccacacaccaaacatcgattctgcgga ttccccagtagacatcagctgggtgtcctccagttcagttctgacaccatctacctggag atagtgtcagatcccacaggttga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_9|221_aa MGPGRRPGWARCLPRPLRWGLRAGRVARAPAEEETKRRTEARRSKFWFQGTLPSSPSRQL RRCRRLRLYGQDSRPAPTSATAAAPSGSTCHETLDARAFTTCLMESHSRTQGLPQSGLRG FGFPSDCPDEPDPGATRNNIFGEVKTLPGHFLMAVVICQIPVPLWWAWSVAESTQEICQA LRLSLQPGCFLNGAVSGLQSAFQEDSTERAAVMVQGDGDAA >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_9|666_bp atggggccaggccgccggccgggctgggcccgctgcttaccgcggccgctgcgctggggg ctccgggccgggcgcgtcgcgagggctcccgccgaggaggagactaaacggaggacagaa gcgagaaggtccaagttctggttccagggaactctcccgagctctccaagccgccaactc cgccgctgccgccgcctcaggctttatggccaagactccaggcccgctcccacttccgcc accgccgccgccccgagcggaagtacctgtcacgagacgctcgacgccagggcctttacc acttgcttaatggaaagtcacagccgcacccaggggctgcctcagagtggactcagaggc tttggcttccccagtgactgcccagatgagccagaccctggagctaccaggaacaatatc tttggagaagtgaaaactcttccaggccacttcctgatggctgtggtcatttgccagatc ccagttcccctctggtgggcctggtctgttgcagagagcactcaggaaatctgccaggcc ctccgcctgagtctccaacctggctgcttcctcaatggtgcagtctctggcctgcagtct gcattccaggaagacagtacagaaagagcagcagtgatggtgcagggggatggggatgct gcctga >gi568815595r:128520414_128750800|GENSCAN_predicted_peptide_10|184_aa MDLNRCQGALVKKPTLLCDQHTQLRGLNQQRMDVCKLPTKSLHGIQVKRTTGTPVAFKSI LFKVRGSCGGRGMGGKLGLRAALADQREFRVGVGLEGPTLGVAGWRHRPGAGSGPAARHA RALLRWAPVKPEPPRRAPPPALQRLVPSAAQGLRSAVLFPFAKTAINTPITAAPLVGVRM MQSL >gi568815595r:128520414_128750800|GENSCAN_predicted_CDS_10|555_bp atggacttaaataggtgccagggggcgctggtcaagaagcccacattgctgtgtgaccag cacactcagctcagaggactcaaccagcagaggatggatgtgtgcaaactgccaacaaag tctctgcatgggattcaagtcaagaggacaactggcacccctgtggccttcaaaagcatc ctattcaaagtaagggggagttgtggagggagaggcatgggcgggaaactggggctgcgc gcggcactcgcggaccagcgggagttccgggtgggtgtgggcttggagggccccacactc ggagtggccggctggcgccaccggcctggggcaggctctggacctgcagcccgccatgcc cgagccctcctgcggtgggctcccgtgaagcccgagcctccccgacgggcaccgccccct gctctgcagcgcctggtcccatcggcagcccaagggctgaggagtgcagttctttttccc tttgccaagacagccatcaatactcctatcacggctgctcctttggttggagtaagaatg atgcagagcctctag