GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:50:43 Sequence gi568815596f:156376106_156682915 : 306810 bp : 38.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 305 300 6 1.05 1.01 Sngl - 24455 24000 456 2 0 56 42 247 0.567 10.90 1.00 Prom - 54117 54078 40 -4.45 2.04 PlyA - 55299 55294 6 1.05 2.03 Term - 59910 59625 286 2 1 10 47 206 0.857 2.59 2.02 Intr - 60807 60478 330 2 0 104 40 175 0.828 8.12 2.01 Init - 68170 68103 68 1 2 32 90 59 0.641 1.10 2.00 Prom - 68721 68682 40 -5.35 3.00 Prom + 78815 78854 40 -3.75 3.01 Init + 100001 100102 102 1 0 53 108 60 0.127 4.79 3.02 Intr + 119939 120110 172 1 1 42 107 244 0.526 20.29 3.03 Intr + 131889 132014 126 1 0 94 89 3 0.444 0.73 3.04 Intr + 134691 134815 125 1 2 66 91 121 0.962 9.58 3.05 Intr + 136115 136212 98 1 2 102 82 38 0.865 2.49 3.06 Intr + 137228 137391 164 2 2 65 92 86 0.728 5.40 3.07 Term + 138468 138481 14 1 2 104 46 -15 0.157 -6.71 3.08 PlyA + 140211 140216 6 1.05 4.03 PlyA - 141049 141044 6 1.05 4.02 Term - 150935 150914 22 1 1 120 42 11 0.296 -3.49 4.01 Init - 154586 152992 1595 2 2 42 53 357 0.411 19.54 4.00 Prom - 155133 155094 40 -6.15 5.00 Prom + 155377 155416 40 -7.65 5.01 Init + 157703 157759 57 1 0 102 84 31 0.565 5.51 5.02 Intr + 164158 164263 106 0 1 89 37 102 0.186 4.07 5.03 Intr + 173503 173667 165 2 0 120 85 185 0.714 20.41 5.04 Intr + 174497 174641 145 0 1 98 91 219 0.981 21.62 5.05 Intr + 181284 181477 194 0 2 73 88 154 0.953 12.11 5.06 Intr + 192720 192854 135 1 0 71 80 150 0.993 12.12 5.07 Intr + 193258 193433 176 2 2 110 105 156 0.999 18.24 5.08 Intr + 193982 194113 132 1 0 59 87 158 0.988 12.72 5.09 Intr + 195029 195187 159 1 0 72 67 143 0.997 9.86 5.10 Intr + 202784 202896 113 1 2 87 80 67 0.995 4.06 5.11 Intr + 202981 203059 79 1 1 89 76 106 0.999 8.13 5.12 Intr + 203585 203683 99 1 0 77 109 81 0.992 8.49 5.13 Intr + 206688 206809 122 2 2 67 68 80 0.188 2.27 5.14 Intr + 241055 241125 71 2 2 65 101 78 0.164 4.71 5.15 Intr + 241336 241398 63 2 0 46 44 126 0.469 1.77 5.16 Intr + 241844 242119 276 0 0 -4 77 173 0.244 3.57 5.17 Term + 243195 243379 185 1 2 63 43 139 0.156 3.62 5.18 PlyA + 245141 245146 6 1.05 6.04 PlyA - 247096 247091 6 1.05 6.03 Term - 250551 250382 170 1 2 -28 48 192 0.023 0.66 6.02 Intr - 254318 254219 100 2 1 58 53 76 0.003 -0.14 6.01 Init - 269152 269036 117 1 0 54 101 89 0.725 7.09 6.00 Prom - 270372 270333 40 -2.75 7.04 PlyA - 270526 270521 6 1.05 7.03 Term - 272464 272431 34 0 1 92 47 62 0.036 -1.42 7.02 Intr - 288011 287775 237 1 0 43 115 81 0.399 2.11 7.01 Init - 290140 289791 350 1 2 67 99 102 0.519 5.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 92424 92242 183 0 0 63 47 162 0.819 6.16 S.002 Intr + 173243 173321 79 1 1 72 76 55 0.847 0.91 S.003 Term - 273986 273802 185 0 2 16 50 179 0.894 3.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_1|151_aa MKPHSRALLLDVGSLGSWRDPVPCAATAAGLPGAQCLPELAPPQAYCAGSADPGGRGVSG LWLCCRVEPRSKPRARAQRSGRAASVASGAHGDWFMEQKGGDLLFAHENVMDLPEHSSPP KINAHLRKTTCSHQIPIPCLCKGYSQITPCR >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_1|456_bp atgaagcctcactccagagcacttctgctggacgtcgggagcctgggatcctggagggat ccggtgccctgcgcggccacagcggcgggactcccgggtgcccaatgcctgccggagctg gcacctccccaggcttattgcgcaggcagcgcagatccaggaggccgtggtgtgtccggc ctctggctttgctgccgggtagaaccccggtccaaacctcgagcccgggcccagcgctcg ggccgcgccgcgagcgtcgcctcgggtgctcatggagactggttcatggagcagaaaggt ggagaccttctctttgcacacgaaaatgttatggatttacctgaacactcttcacccccg aaaataaatgcacatctcaggaaaaccacgtgttcacaccaaatccccataccttgctta tgtaagggttacagccagataaccccttgcagataa >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_2|227_aa MWQPLDVNAIPGEAESKETPNSRTVTRQQLSFSKKPQPSSRLVLTLASEGSNNPYQTSRI YKLKVCMNFQWGTNPRPKSPATLLFYKCRLQNINNETLEKSESIFCWWEEILNLSRFPPP PTAKTGVPEHSPGSCFVPRRCRTPDLRPALPAGWRTGKPELGQAGEDGGGVRFPVPSPMR AALAICLRERRHSSAGTLAGLAYLTTSRCPPWDLASPTLYLPAGRLP >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_2|684_bp atgtggcagcctctagatgtgaatgctatacctggagaggctgagagcaaggaaacccca aactccaggacggtcacacgccagcagctgagtttctcaaagaagcctcagccctcctcc agactcgttctcactcttgcttcggaagggagcaacaatccttatcagactagccgcatc tacaagctaaaagtttgcatgaactttcaatggggtaccaaccctcgccccaaaagtcct gccactttactattttacaaatgccgcctccaaaacataaacaacgaaaccctagagaag agcgaatccatcttctgctggtgggaagaaattctaaatctttcccgtttcccgccacca cctaccgccaagactggagtcccagaacactcgcccggttcctgctttgtgcctcggcga tgccggactccagacctccgccccgccttgccagcgggctggcggactggaaagcctgag ctaggccaggctggggaagacggtggcggtgtccgatttcctgtgccctcgcccatgcgg gccgccctggccatctgcctgcgggagaggagacactcgtccgccggcacgctggcgggg ctggcttacctgacgacttctcgttgtccgccctgggatctggcttctcctactctgtac ctcccggccggcaggcttccctag >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_3|266_aa MAFQKAVKGTILVGGGALATVLGLSQFAHYRRKQMNLAYVKAADCISEPVNREPPSREAQ LLTLQNTSEFDILVIGGGATGSGCALDAVTRGTRKTPISPILAIVNLHTGNCVTTKLSPK LGGVKQQLSSASFRLKTALVERDDFSSGTSSRSTKLIHGGVRYLQKAIMKLDIEQYRMVK EALHERANLLEIAPHLSAPLPIMLPVYKWWQLPYYWVGIKLYDLVAGSNCLKSSYVLSKS RALEHFPMLQKDKLVGAIVYYDVLHF >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_3|801_bp atggcatttcaaaaggcagtgaaagggacgattcttgttggaggaggtgctcttgcaact gttttaggactttctcagtttgctcattacagaaggaaacaaatgaacctggcctatgtt aaagcagcagactgcatttcagaaccagttaacagggagcctccttccagagaagctcag ctactgactttgcaaaacacatctgaatttgatatccttgttattggaggaggagcaaca ggaagtggctgtgcgctagatgctgtcaccagaggtacaaggaaaactcccatctctccc atcttggctatagttaatcttcacacagggaattgtgtcactacaaaactatccccaaaa cttggtggcgtaaaacaacaattatcatcagccagtttcagactaaaaacagcccttgta gaaagagatgatttctcatcagggaccagcagcagaagcactaaattgatccatggtggt gtgagatatctgcagaaggccatcatgaagttggatattgagcagtataggatggtaaaa gaagcccttcatgagcgtgccaacctgctagaaattgctccccatttatcagctccattg cctataatgcttccagtttacaagtggtggcagttaccttactactgggtaggaatcaag ctgtatgatttggttgcaggaagcaattgcctaaaaagcagttatgtcctcagcaaatca agagcccttgaacatttcccaatgctccagaaggacaaactggtaggagcaattgtctac tatgacgtgttacatttctga >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_4|538_aa MIISIDAEKAFDKIQQRFMLKTLNKLGIDGTYFKIIRAIHDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYPENPIVS AQNLLKLISKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDV KDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPITFF TELENSTLKFIWNQKRAHIAKAILSQKNKAGGITLPDFKLYYKATVTKTAWYWYENRDID QWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTAYT KINSRWIKDLNVRIKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKDKIDKWDLIKLK SFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKEMN RHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRYLGKDS >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_4|1617_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgcta aaaactctcaataaattaggtattgatggaacgtatttcaaaataataagagctatccat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattagacaggagaaagaaataaagggtattcagttaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgatcgtatatccagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaaagtctcaggatacaaaatcaatgtacaaaaa tcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtg aaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaac aaatggaagaacattccatgctcatgggtaggaagaatcaatatagtgaaaatggccata ctgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatcactttcttc acagaattggaaaatagtactttaaagttcatatggaaccaaaaaagagcccacatcgcc aaggcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatggtactggtacgaaaacagagatatagat caatggaacagaacagagccctcagaaataacgccacatatctacaactatctgatcttt gacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgg gaaaactggctagccatatgtagaaagctgaaactggatcccttccttacagcttataca aaaatcaattcaagatggattaaagacttaaacgttagaattaaaaccataaaaacccta gaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatgtctaaa acaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaag agcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggag aaaattttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactc aaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggaaatgaac agacacttctcaaaagaagacatttatgcagccaaaaaacatatgaaaaaatgctcacca tcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagtt agaatggcaatcattaaaaagtcaggaaacaacaggtatctgggcaaagactcctga >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_5|758_aa MLKPLMGGGAHRRAAAGARKWDGTAIIASTSGLEVLWVTTSRSKTVESVGLAKPGQHNDA RMNLAIALTAARYGAATANYMEVVSLLKKTDPQTGKVRVSGARCKDVLTGQEFDVRAKCV INATGPFTDSVRKMDDKDAAAICQPSAGVHIVMPGYYSPESMGLLDPATSDGRVIFFLPW QKMTIAGTTDTPTDVTHHPIPSEEDINFILNEVRNYLSCDVEVRRGDVLAAWSGIRPLVT DPKSADTQSISRNHVVDISESGLITIAGGKWTTYRSMAEDTINAAVKTHNLKAGPSRTVG LFLQGGKDWSPTLYIRLVQDYGLESEVAQHLAATYGDKAFEVAKMASVTGKRWPIVGVRL VSEFPYIEAEVKYGIKEYACTAVDMISRRTRLAFLNVQAAEEALPRIVELMGRELNWDDY KKQEQLETARKFLYYEMGYKSRSEQLTDRSEISLLPSDIDRYKKRFHKFDADQKGFITIV DVQRVLESINVQMDENTLHEILNEVDLNKNGQVELNEFLQLMSAIQKGRVSGSRLAILMK TAEENLDRRVPIPVDRSCGGLYFCVRNWWVLGLTDFKNEAADPSGVKLQTFTVSVTARKG SVDPKNSGAQLASPSGSRTGAAGGAACQSRALRLHSSALGWSMGLGAVEHGAVLIGEARA AQEPMEGVGGSGMAGCRSQVLPRGKAAKTPRNHLRFLGAASSQLLKLKMLTSAKIYFFIP VSSAKIDYTIQGFSLNANTQGLCTDLADTSSVHAALEV >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_5|2277_bp atgctcaaaccccttatgggagggggagcacacagaagggcagctgcaggagccaggaaa tgggatggtactgctatcattgcatctacctctggattagaggttctttgggtcactacc tctagatcgaagactgttgagagcgtcggactggcaaagccaggacaacataacgatgca cggatgaaccttgccattgctctgactgctgccaggtatggggctgccacagccaattac atggaggtagtgagcttgctcaagaagacagacccccagacagggaaagtgcgtgtgagc ggcgcacggtgcaaggatgtcctcacagggcaggaatttgacgtgagagccaaatgtgtt atcaatgccacgggacctttcacggactctgtgcgcaaaatggatgataaagacgcagca gctatctgccagccaagtgctggtgtccatattgtgatgcctggttattacagcccagag agcatgggacttcttgacccagcgaccagtgatgggcgagttattttcttcttaccctgg caaaagatgacgatcgctggcactactgatactccaactgatgttacacaccatccaatt ccttcagaagaagatatcaacttcattttgaatgaagtgcgtaattacctgagttgtgat gttgaagtgagaagaggggatgtcctggcagcatggagtggaatccgtcctcttgttaca gaccccaaatctgcagatactcagtctatctcccgaaatcatgttgttgatatcagtgag agtggccttattactatagcaggtggaaagtggacaacttatcggtctatggcagaagat accataaatgctgctgtcaaaactcataatttaaaagcaggaccaagtagaacagttggg cttttccttcaagggggtaaagattggagccccacactctacattaggcttgtgcaggat tatggacttgaaagcgaggtggcacagcatcttgccgccacctatggtgataaggccttt gaggtggccaaaatggcaagtgtgactggcaaaaggtggcctattgttggagtacgtctt gtgtcagaatttccatatattgaagcagaggtgaaatatgggattaaggagtatgcctgc actgctgtggatatgatttcacgtcgtactcgcctggcctttctaaatgtccaggcagca gaggaagccctacccaggattgttgaactgatgggcagggaactgaattgggatgattat aagaagcaggaacaacttgaaacagccaggaagtttctatattatgaaatgggctataaa tctcgatcagaacagttaacagatcgctctgaaattagcctactgccttcagacattgac aggtataagaagagatttcataagtttgatgcagaccagaaaggctttattaccattgtt gatgttcagcgtgtattagagagtatcaatgtccaaatggatgaaaatacactccatgaa attctaaatgaagttgatttgaataaaaatggacaggttgaactcaatgaatttttgcag ctgatgagtgctattcaaaaaggaagggtatctggaagccggcttgctatactaatgaaa actgcagaagagaacctcgacagaagagttccaattccagtggaccgtagttgtggagga ttgtatttctgtgtccggaattggtgggttcttggtctcactgacttcaagaatgaagcc gcggaccctagcggagtgaagctgcagaccttcacggtgagtgttacagctcgtaaaggc agtgtggacccaaagaactcaggagcccagttggcttcacccagtggatcccgcactggg gctgcaggtggagctgcctgccagtcccgcgccctgcgcctgcactcctcagcccttggg tggtcgatgggactgggcgccgtggagcacggggcagtgctcatcggggaggctcgggcg gcacaggagcccatggagggggtgggaggctcaggcatggcgggctgcaggtcccaagtc ctgccccgcgggaaggcagctaagacaccgagaaatcacctaagatttcttggagctgcc tcctctcagctcctcaaactaaaaatgcttacatcggccaaaatctatttcttcatccca gttagttcagcaaagatagactatactattcaaggcttcagtctcaatgccaatacacaa gggctctgcacagaccttgcagatacctcatctgtgcacgcagctttagaggtatga >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_6|128_aa MGSRKRCLAGSGAQRTSCQIRRDGSQWWVCDGGKQQWWMLLGFTNSSISYPYYGLTAEAQ QQETQRLTESNYRSCDKVICGFGFRVALAEPQELKTERWGKVMHKETSAIVTVEINVKSS VKATGHIP >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_6|387_bp atggggagccgcaaaaggtgcctcgctggatcaggagcacagaggacatcctgccagatc cggagggatgggagtcagtggtgggtctgcgatggtggcaaacagcagtggtggatgtta ctgggttttactaattcttctatttcttacccctattatggtctaacagcagaagctcaa caacaagaaacacaacgactgactgaatcaaattacagatcatgtgacaaggtgatatgt ggctttggcttcagagtggctttagcagaaccccaggaattgaaaactgagagatggggc aaggttatgcacaaagaaacatcagccatcgtgactgtggagatcaatgtgaaaagcagt gtcaaggccacaggtcacattccttga >gi568815596f:156376106_156682915|GENSCAN_predicted_peptide_7|206_aa MSKHHKIQKIKNKINATCLSEWLKFKTLTVSNAGKTVVQQESIHQWKRKILQPLWKIIWQ FLIKLNISLSYEPPIKLPGIYLNELKICVYTKTCIRIYNNLIQNCQKLEATKMSFDSKEG CPKSPLGMIKWEWEGSLYQDSPCSQKTLSAELSLLTKPKRKVKALFLFTHPSNTDVAIQL LNKPVVLHICTVLIGRNCIPKAPVDA >gi568815596f:156376106_156682915|GENSCAN_predicted_CDS_7|621_bp atgtcaaagcatcataaaatacagaaaataaaaaacaaaattaatgcaacatgcctatca gaatggctaaaattcaaaacactgacagtatcaaatgctggcaagactgtggtgcaacag gaatctattcatcagtggaaacgcaaaattttacagccactttggaagataatttggcag tttcttataaagctaaacataagcttatcttatgaaccaccaatcaagctgccaggtatt tacctaaatgagttgaaaatttgtgtttatacaaaaacctgcatacgaatttataataac cttattcaaaattgccaaaaactggaagcaaccaagatgtccttcgacagcaaggaaggc tgtcccaaatctccacttggaatgatcaaatgggaatgggagggttctctctatcaggac tccccttgttctcagaaaacactatcagctgagctgtcattactaacaaaacccaaaagg aaagtgaaagctttgtttctctttacccacccctccaacactgacgtagctattcagttg ttaaataaacctgttgtgttgcacatctgcactgtgctgataggaaggaactgcattcca aaagccccagtggatgcctga