GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:56:00 Sequence gi568815595f:98049386_98250315 : 200930 bp : 36.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6617 6680 64 1 1 39 94 60 0.066 3.06 1.02 Intr + 15698 15864 167 0 2 92 73 63 0.219 3.96 1.03 Intr + 33191 33303 113 1 2 57 70 64 0.022 -0.24 1.04 Intr + 38114 38270 157 2 1 90 84 87 0.057 7.59 1.05 Term + 38995 39009 15 0 0 82 32 31 0.292 -5.84 1.06 PlyA + 39113 39118 6 1.05 2.03 PlyA - 40730 40725 6 1.05 2.02 Term - 42724 42301 424 0 1 55 48 128 0.592 -0.72 2.01 Init - 46151 45928 224 2 2 88 72 201 0.774 16.58 2.00 Prom - 55155 55116 40 -3.95 3.00 Prom + 57280 57319 40 -4.65 3.01 Init + 77462 77527 66 1 0 65 108 6 0.068 1.42 3.02 Intr + 83295 83577 283 2 1 60 52 130 0.109 2.67 3.03 Intr + 83923 84128 206 2 2 102 4 112 0.273 2.20 3.04 Intr + 99983 100265 283 1 1 61 52 147 0.071 4.47 3.05 Intr + 100611 100888 278 1 2 124 29 70 0.112 1.01 3.06 Intr + 105526 105649 124 0 1 53 63 97 0.360 2.94 3.07 Intr + 114932 115035 104 0 2 65 50 80 0.094 0.87 3.08 Intr + 119297 119579 283 1 1 52 52 122 0.103 1.07 3.09 Intr + 119925 120087 163 1 1 102 48 57 0.191 1.21 3.10 Intr + 128141 128266 126 2 0 31 72 105 0.147 2.07 3.11 Intr + 128962 130400 1439 1 2 44 60 281 0.289 9.19 3.12 Intr + 137505 137615 111 1 0 69 68 73 0.375 2.83 3.13 Intr + 142068 142225 158 1 2 66 106 22 0.758 0.61 3.14 Intr + 144044 144113 70 1 1 31 110 100 0.664 4.34 3.15 Intr + 144554 144673 120 0 0 79 92 57 0.892 4.75 3.16 Intr + 146271 146347 77 1 2 63 65 64 0.193 -0.08 3.17 Term + 146675 146785 111 1 0 78 35 55 0.300 -3.22 3.18 PlyA + 147344 147349 6 1.05 4.00 Prom + 150609 150648 40 -2.55 4.01 Init + 151285 151370 86 0 2 59 108 41 0.907 3.54 4.02 Term + 152442 152625 184 0 1 81 54 143 0.956 6.23 4.03 PlyA + 154908 154913 6 1.05 5.00 Prom + 160971 161010 40 -5.15 5.01 Init + 179441 179528 88 1 1 85 71 118 0.816 10.65 5.02 Intr + 194935 195015 81 2 0 55 56 91 0.355 1.29 5.03 Intr + 195199 195419 221 2 2 78 30 121 0.535 2.40 5.04 Intr + 195918 196064 147 2 0 106 47 48 0.632 2.01 5.05 Intr + 196179 196392 214 2 1 27 80 130 0.386 3.47 5.06 Term + 198694 198887 194 2 2 81 42 89 0.492 0.20 5.07 PlyA + 200287 200292 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 28414 28043 372 1 0 36 38 201 0.830 5.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:98049386_98250315|GENSCAN_predicted_peptide_1|171_aa MARTCKFLQRQKEAKRLTHATAFIQVFTFMTLIVSYSYILSAILKKKSEKGRSKAFSTCS AHLLSVSLFYGTLFFMYLSGVEELLQVDLAMRKFWQIPPLSAPCSRRLYGGSWRDATTEC FLLVMMAYDRYVAICNPLLYPVMMSNKLSAQLLSISYVIGFLHPLVHGPYI >gi568815595f:98049386_98250315|GENSCAN_predicted_CDS_1|516_bp atggccaggacctgcaaattcctgcagcgacagaaagaagccaagagactcactcatgca acagcatttatacaagtcttcacttttatgactcttatcgtctcttactcctatattctc tctgccatcctgaaaaagaagtctgagaagggtagaagcaaagccttctctacttgcagt gcccatctgctctctgtctctttgttctacggcaccctcttcttcatgtatctctctgga gtagaagagctattgcaagtagatctggcaatgaggaaattttggcagattccacctctc tcagctccatgcagcagaagattatatggcggcagttggagagatgcaactacagaatgc ttcctcctggtgatgatggcctatgaccgctatgtagccatatgtaatcccttgctttat ccagtgatgatgtccaacaaactcagcgctcagttgctaagtatttcatatgtaattggt ttcctgcatcctctggttcatggaccctacatctaa >gi568815595f:98049386_98250315|GENSCAN_predicted_peptide_2|215_aa MGRNQTRKAENSKNQSASSPPKDRSSSPVREQNWMESVFDKLTEVGFRRSLITNFSELKK HVLTHRKEAKILEKSVGSSGQGNQARERNKEYSIRKRGSQNVSVCRDMIVYLENPIVSGQ NLLKLISNFSKVSGYKISVQKSQAFLYTNNRQTESQIMSQLPFTIATKRIKYLGIQLTRD VKDLFKENYKPLLNEIKQDMYKWKNIPCSWIESIS >gi568815595f:98049386_98250315|GENSCAN_predicted_CDS_2|648_bp atggggagaaatcagaccagaaaagctgaaaattccaaaaaccagagtgcttcttctcct ccaaaggaccgcagctcctcaccagtaagggaacaaaactggatggagagcgtgtttgac aagttgacagaagtaggcttcagaaggtcactaataacaaacttctctgagctaaagaag catgttctaacccatcgcaaggaagctaaaatccttgaaaaaagtgttggaagttctggc cagggcaatcaggcaagagaaagaaataaagagtattcaattaggaaaagaggaagtcaa aacgtctctgtttgcagagacatgattgtatatttagaaaaccccatcgtctcaggccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcagtgtgcaa aaatcacaagcattcctatacacaaacaatagacaaacagagagccaaatcatgagccaa ctcccattcacaattgctacaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaacaggacatg tacaaatggaagaacattccatgctcatggatagaatcaatatcgtga >gi568815595f:98049386_98250315|GENSCAN_predicted_peptide_3|1333_aa MGQAQNKYLNSKREKRRNNRSQRACCEDMEEENATLLTEFVLTGFLYQPQWKIPLFLAFL VIYLITIMGNLGLIAVIWKDPHLHIPMYLLLGNLAFVDAWISSTVTPKMLNNFLAKSSIQ VFSIVTILVSYTFVLFAILKKKSDKGVRKAFSTCGAHLFSVSLYYGPLLFIYVGPASPQA DDQDMRTCSEDMEEENATLLTEFVLTGFLYQPQWKIPLFLAFLVIYLITIMGNLGLIAVI WKDPHLHIPMYLLLGNLAFVDALLSSSVTLKMLINFLAKSSIQVFTIGTVLISYIFVLYT ILKKKSVKGMRKAFSTCGAHLLSVSLYYGPLAFMYMGSASPQADDQDMMESLFYTVIVPL LNPMIYSLRNKQIITRLEIMAPDHHATGGHGILTFLTAPTDNIPIVKSKIGVQERSCSTA ATYFGQCLNMLYNHRYPGPRLLFCSQLIRTCSEDMEEENATLLTEFVLTGFLYQPQWKIP LFLAFLVIYLITIMGNLGLIAVIWKDPHLHIPMYLLLGNLAFVDAWISSTVTPKMLNNFL AKSSIQVFSIVTILISYTFVLFTVLEKKSDKGVRKAFSTCGAHLFSVCLYYGPLLLILNQ EEVEFLNRPTGFEIVAIINSLLTKKSPGPDGFTAEFYQRPKSPSVYKQLQQSLRIQNHVQ KSQAFLYTNNRQTESQIMSDLPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDT NKWKNIPCSWVGRTNIMKMAILPKVIYRFNAIPNKLPMAFFTELEKTTLKFIWSQKRAHI TKSVLSQKNKAGGSTLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTQPSEIMPHIYNHLI FDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKVNSRWIKDLHVRHKTIKT LEENLGNTIQDIGMGKDFMSKTPKATATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKW EKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCS SSLAIREMQIKTTMRYHLTPVRMTIIKMSGNNRCWRRCGEIGTLLHCWLDCKLVQPLWKS VWRFLRDLELEIPFDPAIPLLGIYPKDYKSCCYKDTCTRTDLAQEDSHHCKMAETKTQYC HAVTGHVPKDMKQDGDPCEIWCRDSDRGTSLGRSIPGPPALCYMRKIHLRPLVLRPTSPR NISPILNWQLKTEAARTPQKPPGPSQMLTVEAYFNRIKACYHSPATAWASKTYKLSLNSC ILPVQNRTSLTGSWSHSLQRLSKASDPIALGNAYADKGLFRPPPSPTHQDGGFAPAQDWQ IDFTHVPRVKKLK >gi568815595f:98049386_98250315|GENSCAN_predicted_CDS_3|4002_bp atgggccaagcacagaacaaatatttaaattccaaaagagagaaaagaaggaataacagg tctcaaagggcatgctgtgaggacatggaagaggaaaatgcaacattgctgacagagttt gttctcacaggatttttatatcaaccacagtggaaaatacccctgttcctggcattcttg gtaatatatctcatcaccatcatggggaatcttggtctgattgctgtcatctggaaagac cctcaccttcatatcccaatgtacttactccttgggaatttagcttttgtggatgcttgg atatcatccacagtgaccccaaagatgctgaataacttcttagctaagagttcaattcag gtattcagcattgtgactattcttgtatcttatacatttgttctcttcgcaatcttaaaa aagaaatctgataaaggtgtaaggaaagccttttccacctgtggagcccatctcttctct gtctctttatactatggaccccttctcttcatttatgtgggccctgcatctccgcaagca gatgatcaagatatgaggacatgcagtgaggacatggaagaggaaaatgcaacattgctg acagagtttgttctcacaggatttttatatcaaccacagtggaaaatacccctgttcctg gcattcttggtaatatatctcatcaccatcatggggaatcttggtctgattgctgtcatc tggaaagaccctcatcttcatatcccaatgtacttactccttgggaatttagcttttgtg gatgctttgttatcatcctcagtgactctgaagatgctgatcaacttcttagctaagagt tcaattcaagtttttaccatagggactgttcttatatcttacatatttgtcctctataca atcttgaaaaagaagtctgtcaaaggtatgagaaaagccttctccacctgtggagctcat ctcttatctgtatctttatactatgggcccctcgccttcatgtatatgggctctgcatcc ccacaggctgatgaccaagatatgatggagtctctattttacactgtcatagttccttta ttaaatcccatgatctacagcctgagaaacaagcaaattatcacaagattagaaattatg gctcctgatcatcatgcaactggaggccatggaattctaaccttcctaactgctcccaca gataatattcctattgtgaaatctaagattggtgttcaagaaaggagctgtagtactgca gccacctactttgggcagtgcctgaatatgttgtacaatcatcggtatccaggtccacgt cttctcttctgctctcaattgattaggacatgcagtgaggacatggaagaggaaaatgca acattgctgacagagtttgttctcacaggatttttatatcaaccacagtggaaaataccc ctgttcttggcattcttggtaatatatctcatcaccatcatggggaatcttggtctgatt gctgtcatctggaaagaccctcaccttcatatcccaatgtacttactccttgggaattta gcttttgtggatgcttggatatcatccacagtgaccccaaagatgctgaataacttctta gctaagagttcaattcaggtattcagcattgtgactattcttatatcttacacatttgtt ctcttcacagtcttagaaaagaaatctgataagggtgtaaggaaagccttttccacctgt ggagcccatctcttctctgtctgtttatactatggcccccttctcttaatactaaaccag gaagaagttgaatttctgaatagaccaacaggctttgaaattgtggcaataattaatagc ttgctaaccaaaaaaagtccaggaccagatggattcacagctgaattctaccagaggcca aaatctccttcagtttataagcaacttcagcaaagtctcaggatacaaaatcatgtacaa aaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgac ctcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaaccaatatcatgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccaacaagctaccaatggctttc ttcacagaattggaaaaaactactttaaagttcatatggagccaaaaaagagcccacatc accaagtctgtcctaagccaaaagaacaaagctggaggcagcacgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gatcaatggaacagaacacagccctcagaaataatgccgcatatctacaaccatctgatc tttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaagttaattcaagatggattaaagacttacatgttagacataaaaccataaaaacc ctagaagaaaacctagggaataccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaacggcaacaaaagccaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgg gagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggatatg aacagacacttctcaaaagaagacatttatgcagcaaaaaaacacatgaaaaaatgctca tcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacacca gttagaatgacgatcattaaaatgtcaggaaacaataggtgctggagaagatgtggagaa ataggaacacttttacactgttggttggactgtaaactagttcaaccattgtggaagtca gtgtggcgattccttagggatctagaactggaaataccatttgacccagccatcccatta ctgggaatatacccaaaggattataaatcatgctgctataaagacacatgcacacgaact gacttagcacaagaagacagccaccattgtaaaatggcggagactaaaacacagtattgc catgcggttacaggtcatgttcccaaagacatgaaacaagacggagacccatgtgaaatt tggtgccgtgactcagatcgggggacctcccttgggagatcaatccctggtcctcctgct ctttgctacatgagaaagatccacctacgacctctggtcctcagaccaaccagtccaagg aatatctcaccaattttaaattggcagctgaagactgaggctgcccgaacacctcagaag cctcctggaccatcacagatgcttacagtggaggcatactttaataggattaaagcctgt tatcactcgcctgctacagcatgggcttctaaaacctataaactttccttaaattcctgc attttacctgtccaaaaccggacaagtcttacaggaagctggagtcattcactacaaagg ctatcaaaggcatcagatcccattgctctaggcaacgcttatgctgataagggattgttc aggccccctccctcccctacacatcaagatgggggatttgcccctgcccaggactggcaa attgactttactcacgtgccccgagtcaagaaactaaaataa >gi568815595f:98049386_98250315|GENSCAN_predicted_peptide_4|89_aa MTVKKEFYRSMDGSFNRRIVHREDISVSKSYGLQSSACSSQDATVASGYTGLRSPGSSVS IDLRKIAPFGSLCGGSDPVTRLCLGVEII >gi568815595f:98049386_98250315|GENSCAN_predicted_CDS_4|270_bp atgacagtcaagaaggaattctataggtccatggatggtagcttcaacagaagaattgtg catagggaagacatttctgtatccaagtcctatggattgcagtccagtgcctgtagttct caggatgccactgtagctagtggctataccggtctgaggtctcctggcagctctgtttcc atagatctacggaaaattgccccatttgggagtctctgtggtggctctgaccctgtgaca cgtctctgcttgggtgttgagattatctga >gi568815595f:98049386_98250315|GENSCAN_predicted_peptide_5|314_aa MQPEATGFLLCPVTPMDIITIVKSEFGVQDTGNIWCRRPVTGELLRESSPLSSCSHQTKG DTFYPCIQKLRRRSRTWEDSLPLVSDHRGDACCGHSPTFPWWQVNRKDACFGCSPTLQPR TQSGVPTGSLAESGANSSAASTPPSYNPSITSPPHTESGLQFHSTTSSPQPAQQFPLREE FQYLTQSYSLTWSDLNVILTSTLSPYERERVHFLAQSYSDTCWLHEPGLQEGTRAVPRED LHWQYQTDSPATHYQMGAFQLCITDKPSIIIDKLKNIGSHYCLGRHLPCILGYRPAAHLS LLLAPPLACLYPDP >gi568815595f:98049386_98250315|GENSCAN_predicted_CDS_5|945_bp atgcagccagaagccacaggattcctactctgcccagttactcctatggatatcattact attgtcaaatccgagtttggtgttcaagacacgggtaacatttggtgtcgaagacccgtg acaggggaactcctacgggagagcagtcccctgtcctcgtgctcccaccagacaaaagga gacacattttatccgtgcattcaaaaactccgacgtcggtcacggacatgggaagacagt cttcccttggtgtctgatcaccgcggtgacgcctgctgtggtcattcacccaccttccct tggtggcaggtcaatcgcaaggacgcctgctttggctgctcacccacattacagcccagg actcagtcaggggtgcctactggaagcctggctgaatcaggtgccaattcttccgcagcc tccactccaccatcctataacccttctattacctcccctcctcacaccgagtctggctta cagtttcattccacgactagctccccgcaacctgcccaacaatttcctcttagagaggaa ttccaatatctaactcagtcctacagtttaacctggagtgacttaaatgtcatcctgacc tctaccctctccccgtatgaacgagaaagagttcattttctagcccagtcctactcagac acctgctggcttcatgagccaggcctccaagagggcaccagggcagttccccgagaggat ctccattggcaataccagacggactcaccagccacacattaccagatgggagcattccaa ctttgcattactgataagccctctatcattattgacaaactaaaaaacattggcagtcac tactgtttaggaagacacctaccgtgcatccttggctaccgtcccgctgctcatctgtct ctcctcctggcccctcctcttgcttgcttatacccagacccatga