GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:10:49 Sequence gi568815586f:32476358_32740530 : 264173 bp : 40.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4647 4712 66 2 0 83 89 56 0.234 4.39 1.02 Intr + 17295 17389 95 2 2 85 97 32 0.091 1.64 1.03 Term + 21610 21751 142 1 1 80 32 84 0.065 -1.58 1.04 PlyA + 22155 22160 6 1.05 2.07 PlyA - 23089 23084 6 1.05 2.06 Term - 27971 27766 206 0 2 122 48 65 0.447 2.45 2.05 Intr - 28066 27976 91 1 1 40 55 47 0.459 -4.75 2.04 Intr - 29154 29022 133 2 1 86 75 123 0.896 10.53 2.03 Intr - 32489 32405 85 0 1 79 37 39 0.436 -4.14 2.02 Intr - 36303 36181 123 1 0 74 67 84 0.589 4.54 2.01 Init - 38949 38853 97 0 1 74 31 94 0.312 2.82 2.00 Prom - 39211 39172 40 -6.75 3.04 PlyA - 40332 40327 6 1.05 3.03 Term - 46068 45890 179 1 2 80 39 146 0.967 5.77 3.02 Intr - 49686 49532 155 2 2 73 97 90 0.840 7.19 3.01 Init - 50506 50202 305 1 2 60 54 211 0.764 11.48 3.00 Prom - 53639 53600 40 -6.55 4.00 Prom + 55022 55061 40 -8.85 4.01 Init + 58900 59035 136 0 1 53 53 111 0.472 4.45 4.02 Intr + 87780 87932 153 1 0 97 91 134 0.011 13.72 4.03 Intr + 99909 100092 184 1 1 82 110 128 0.261 12.32 4.04 Intr + 105603 106110 508 0 1 85 75 412 0.042 31.65 4.05 Intr + 122140 122229 90 0 0 73 110 85 0.994 8.47 4.06 Intr + 124921 125066 146 0 2 81 95 59 0.992 4.06 4.07 Intr + 125804 125960 157 2 1 31 90 157 0.924 9.29 4.08 Intr + 131600 131738 139 1 1 78 33 141 0.958 6.72 4.09 Intr + 134419 134477 59 2 2 76 115 6 0.867 -0.22 4.10 Intr + 134780 134926 147 1 0 85 64 157 0.999 12.51 4.11 Intr + 143341 143513 173 0 2 105 110 139 0.394 15.62 4.12 Intr + 148065 148095 31 0 1 63 121 35 0.879 1.51 4.13 Intr + 148619 148711 93 1 0 58 65 92 0.894 3.14 4.14 Intr + 149297 149422 126 1 0 115 92 116 0.943 14.66 4.15 Intr + 157192 157332 141 0 0 77 94 38 0.732 2.93 4.16 Intr + 162298 162438 141 0 0 96 70 57 0.936 4.33 4.17 Intr + 163919 164096 178 1 1 105 94 189 0.094 19.77 4.18 Intr + 169984 170066 83 2 2 53 80 48 0.023 -1.06 4.19 Term + 171110 171232 123 1 0 61 43 124 0.049 2.60 4.20 PlyA + 173811 173816 6 1.05 5.00 Prom + 174158 174197 40 -5.15 5.01 Init + 178541 178613 73 1 1 52 47 127 0.220 6.28 5.02 Intr + 183721 183773 53 2 2 75 115 32 0.222 2.31 5.03 Intr + 202845 203108 264 2 0 105 94 169 0.165 15.99 5.04 Intr + 225058 225205 148 0 1 91 54 112 0.924 6.99 5.05 Intr + 231010 231056 47 2 2 93 84 30 0.989 0.31 5.06 Intr + 231796 231867 72 0 0 21 102 139 0.990 7.28 5.07 Intr + 234572 234658 87 1 0 68 116 55 0.951 5.55 5.08 Intr + 236852 237014 163 1 1 73 91 113 0.994 8.73 5.09 Intr + 242286 242406 121 1 1 86 71 145 0.999 11.23 5.10 Intr + 244307 244438 132 2 0 67 91 116 0.995 8.64 5.11 Term + 246070 246280 211 1 1 102 42 162 0.955 8.78 5.12 PlyA + 246651 246656 6 1.05 6.03 PlyA - 246813 246808 6 1.05 6.02 Term - 250990 250356 635 2 2 -16 50 611 0.250 40.36 6.01 Init - 251640 251481 160 0 1 68 66 58 0.594 1.63 6.00 Prom - 252090 252051 40 -11.04 7.00 Prom + 253557 253596 40 -8.95 7.01 Init + 254568 254777 210 2 0 48 74 -1 0.476 -6.67 7.02 Intr + 254999 255154 156 1 0 47 97 127 0.923 8.79 7.03 Intr + 255497 255586 90 1 0 57 89 74 0.922 3.67 7.04 Intr + 257358 257450 93 2 0 84 84 63 0.957 4.74 7.05 Intr + 260748 260804 57 2 0 102 93 30 0.873 3.06 7.06 Intr + 263707 263883 177 0 0 38 91 188 0.927 13.29 7.07 Intr + 264052 264161 110 0 2 63 92 127 0.997 8.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 89837 89709 129 2 0 89 37 136 0.896 8.80 S.002 Sngl - 112278 112114 165 0 0 77 38 160 0.910 4.33 S.003 Term + 163919 164176 258 1 0 105 43 285 0.904 20.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_1|100_aa MGSPYVARAGLELLGSSNPRLAQRFNEHLLHARHWRFLGKQDKAPCPYGFYLLRAEETFG DALSQHPPFVTEEHRTRQSKWLACVIGLAKQELESRFLNL >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_1|303_bp atggggtctccctatgttgcccgggctggccttgaactcctgggctcaagcaatccccgg cttgcgcaaagatttaatgagcacctactgcatgccagacactggagatttcttggtaaa caagacaaagccccctgtccttatggattttaccttttaagagctgaagagacctttgga gatgctctaagccagcatcctccttttgtgactgaagagcacaggaccagacagagtaaa tggcttgcctgtgtcatagggctggctaaacaggaactagagtctaggtttctgaattta taa >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_2|244_aa MEQEPQDLSRSCGILLIGVGPFPEAGTVELEQGRQAIGFTGCSPTWVFPRDETTTLTIFC DKVAVTKPDRGLALGIRQTEIMNLLLLQFLLVHMSHLVTLKVNSRAVIKTEDDDNSEETA ADTQINWHTYREKACQMKPALSYSFCVSLLTVLCMEVDISPGPSCAGQSYDHALLSRTLV GITDAPGTVPAAHSSPISVKGYPQEPLSFLLVVDWTKLFKKHFEKPVVLLLITLEIWSMQ HAYT >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_2|735_bp atggagcaagaaccacaagatttgagcaggagctgtggaattctccttattggagtgggg ccctttccagaagcagggacagtggagctggagcaaggcaggcaagctattggatttact ggctgctcacccacctgggtctttccaagagatgaaaccactaccctgacaattttctgt gacaaggtggcagtgacaaaacctgataggggcttggcgttgggaataagacagacagaa atcatgaacctgctgctcctgcaattcctcttggttcacatgagtcacttggtgactctc aaagtaaactcaagagcagttataaaaacagaagacgatgacaactcagaagaaacagct gccgatactcaaatcaattggcacacatatagagaaaaagcctgccaaatgaaaccagct ttgagctatagcttttgtgtctcactgctgacagtgttgtgcatggaagtggacatctca ccaggtccttcatgtgcaggacagtcttatgaccatgcactcctgtcccgtaccctagtt gggatcacagatgctccagggactgtccctgctgcccactcttcacctatttctgtgaag ggttaccctcaagaaccactaagtttcttgttagtggtggattggacaaagctttttaaa aaacattttgaaaaacctgtggttttattacttattactctggaaatttggtcaatgcaa cacgcttacacttga >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_3|212_aa MKPRTLAVSVTAFKAGHLELFVPPSGLVVLLGSGVKLQIFAVSVTAHKSSMDPKSEQQQD LSQIAKEQSSHSVEGDLSRLPMLARAACFYSLIWPHPHPADCPCVVDGTGRRGAGGGARR RLGPHRSPWSGWEAQAWRAVGPEPCPVGRQLRPDPNAQLAGVTAATWAFCVGPNILKGQI QNVERKDTLNITRIFLSFANHPLYVSTPTHNR >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_3|639_bp atgaagccacggacccttgcggtgagtgttacagcttttaaggcggggcatctggagttg ttcgttcctcccagtgggctcgtggtcttgctgggctcaggagtgaagctgcagatcttc gcggtgagtgttacagctcataaaagcagcatggacccaaagagtgagcagcagcaagat ttatcgcaaatagcgaaagaacaaagctcccacagtgtggaaggggacctgagcaggttg ccaatgctggctcgggcagcctgcttttattctcttatctggccccacccacatcctgct gattgcccttgcgtggtcgatgggactgggcgccgtggagcagggggtggtgctcgtcgg aggctcgggccgcacaggagcccatggagtgggtgggaggctcaggcatggcgggctgta ggtcccgagccctgccccgtgggaaggcagctaaggcccgaccctaacgcacagcttgct ggggtaacagcagcaacttgggccttttgtgttggcccgaatattcttaaaggccaaatt cagaatgtggaaagaaaagataccctaaatataacgaggatcttcctgtcttttgcaaat catccactctacgtatccacccccacccacaacagataa >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_4|935_aa MSTLLNKINYNLEPAAFTVTSILSDGGVALSFLKCQGHTLSVEAQGSSTCPKIALVPPCS TSSTTTLVGENVSEEEAQGINGNRPAKHSAASPKPQVPPKPLHLQNSPSSNIHQTPRHKA LPSAKPRMEEIKPASASCVSKEKPSKVSDLISRFEGGSSLSNYSDLKKESAVNLNAPRTP GRHGLTTTPQQKLLSQHLPQRQGNDTDKTQGAQTCVANGVMAAQNQMECEEEKAATLSSD TSIQASEPLLDTHIVNGERDETATAPASPTTDSCDGNASDSSYRTPGIGPVLPLEERGAE TETKVQERENGESPLELEQLDQHHEMKETNEQKLHKIANELLLTERAYVNRLDLLDQVFY CKLLEEANRGSFPAEMVNKIFSNISSINAFHSKFLLPELEKRMQEWETTPRIGDILQKLA PFLKMYGEYVKGFDNAMELVKNMTERIPQFKSVVEEIQKQKICGSLTLQHHMLEPVQRIP RYEMLLKDYLRKLPPDSLDWNDAKKSLEIISTAASHSNSAIRKMENLKKLLEIYEMLGEE EDIVNPSNELIKEGQILKLAARNTSAQERYLFLFNNMLLYCVPKFSLVGSKFTVRTRVGI DGMKIVETQNEEYPHTFQVSGKERTLELQASSAQDKEEWIKALQETIDAFHQRHETFRNA IAKDNDIHSEVSTAELGKRAPRWIRDNEVTMCMKCKEPFNALTRRRHHCRACGYVVCWKC SDYKAQLEYDGGKLSKVCKDCYQIISGFTDSEEKKRKGILEIESAEVSGNSVVCSFLQYM EKSKPWQKAWCVIPKQDPLVLYMYGAPQDVRAQATIPLLGYVVDEMPRSADLPHSFKLTQ SKSVHSFAADSEELKQKWLKVILLAVTVCGVCLHSRGLSSRPTVSMGDWFQDHSQTDIVA CDAVLTSSRGVLLLLKPSLEYQAYSGKDLVSETID >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_4|2808_bp atgtccacactgctgaacaagataaactacaacttagaaccggcagccttcacagttacc agtattctttctgatggtggtgtggctctttccttccttaagtgtcaggggcatacgctg agtgtggaagcccaaggcagcagtacctgtccaaagatcgctttagttccaccttgctcc acaagcagcacaaccacactggttggtgagaatgtatctgaagaagaggctcagggaata aatgggaacaggccagcaaaacactcagctgcaagtccaaagccacaagtgcctccaaag ccattacacctgcagaattcaccttcgtccaatatacaccaaacccccaggcataaagct ttacctagtgcaaaaccaaggatggaggaaattaaacctgcctctgcttcttgtgtctca aaagaaaaacccagtaaggtatcagatctcatcagtcgctttgaaggaggcagctcatta tcaaattatagtgatttgaagaaagagtctgctgtgaacctaaatgctcctagaacccca ggaaggcatggattgacaaccacacctcaacaaaaactcctctcccagcacttgccacag aggcagggaaatgatacagataagactcagggtgcacagacttgtgtggccaacggtgta atggcagcacaaaaccagatggaatgtgaggaggagaaagctgccactcttagctcagat acttctattcaagcttctgaacccttgcttgatacgcacatagtgaatggagaaagagat gaaactgccacagctcctgcatcacccacaacagacagctgtgatggaaatgcttctgac agtagctacaggactccaggcataggcccagtgctccccctagaagaaagaggggcagaa acagaaaccaaggtacaagagagggaaaatggggaaagccctctggaactggagcagctg gaccagcaccatgagatgaaggagactaatgagcaaaaacttcacaaaatagccaatgaa cttttgcttactgaaagagcttatgtcaaccgacttgacctcttagatcaggtattttat tgcaaactgttggaagaagcaaaccgaggctcgtttccagcagagatggtgaataaaatc ttttctaatatttcatcaataaatgccttccatagtaaattcctcttgccagagctggag aaacgaatgcaagaatgggaaactactcctagaattggagacatccttcagaaattggca ccattccttaagatgtatggagaatatgtgaaaggatttgataatgcaatggaattggtt aaaaacatgacagaacgtattccccagttcaaatcagtggttgaagaaattcagaaacag aaaatctgtgggagcttaactttgcagcatcacatgctagaacctgttcagcggattccc cggtatgagatgctccttaaggactatctaaggaaattgcctcctgattccctggactgg aatgatgctaaaaaatcacttgaaattatatctacagcagcaagccattctaatagtgca ataaggaaaatggagaacctaaagaaactcttagagatttatgaaatgttgggagaagaa gaagacattgtaaacccttcaaatgaactaataaaagaaggacagatcctcaaactagct gctcggaacacttcagcacaagaacgctaccttttcttattcaacaacatgttgctgtac tgtgtgcccaaattcagcttggtaggctctaaattcacagttcgaaccagggttggcatt gatggaatgaaaattgtagagactcaaaatgaagaatatccacatactttccaggtgtct gggaaagagagaacactggaactgcaggccagttctgcgcaagacaaagaagaatggatc aaggcccttcaagaaaccatcgatgcttttcatcaaaggcatgaaaccttcagaaatgca attgcaaaggataatgacattcactcagaggtttctactgctgagctagggaaaagagcc ccaagatggatccgagataatgaagtgacaatgtgtatgaaatgtaaagaacctttcaat gcactgacacgaaggaggcatcattgtcgagcatgtggatatgtggtttgttggaaatgc tccgactacaaagctcaacttgaatatgatggtggtaaattgagcaaagtttgtaaagac tgttatcaaatcataagtggattcacagacagtgaagaaaagaaaagaaaaggaatttta gagattgaatcagcagaagtatctggaaacagtgtggtgtgcagctttcttcagtatatg gagaagtcaaaaccttggcagaaagcttggtgtgtgatccccaagcaagaccctcttgtg ctgtacatgtatggtgccccccaggacgtcagagcccaggccaccattccacttctgggc tatgtggtggatgaaatgccaaggagcgcagacctgccacacagtttcaaactgacccag tctaagtccgtgcacagctttgctgcagacagtgaggaactgaagcagaagtggctgaaa gtcatccttttagctgtcacagtttgtggtgtttgcctacattctagaggcttatccagt cggcccacagtgtctatgggagactggttccaggaccactcacagactgacatagtggcc tgcgatgcagttctaacaagctcccgaggggtgctgctgctcctcaaaccatcactggag taccaagcttatagtgggaaggacttagtgtccgagaccattgactag >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_5|456_aa MVNIETLTTNADKDVEQLELSLMVAHPPAVHPAPQASLRGSQRMACREGAGSRRARSNGC RLPRRGRRRGGRRRTVGPGPIHCRGRRALGPRVFRVMEALIPVINKLQDVFNTVGADIIQ LPQIVVVGTQSSGKSSVLESLVGRDLLPRGTGIVTRRPLILQLVHVSQEDKRKTTGEENG VEAEEWGKFLHTKNKLYTDFDEIRQEIENETERISGNNKGVSPEPIHLKIFSPNVVNLTL VDLPGMTKVPVGDQPKDIELQIRELILRFISNPNSIILAVTAANTDMATSEALKISREVD PDGRRTLAVITKLDLMDAGTDAMDVLMGRVIPVKLGIIGVVNRSQLDINNKKSVTDSIRD EYAFLQKKYPSLANRNGTKYLARTLNRLLMHHIRDCLPELKTRINVLAAQYQSLLNSYGE PVDDKSATLLQLITKFATEYCNTIEGTAKYIETSEL >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_5|1371_bp atggtcaacatcgagacactgaccacaaatgctgacaaggatgtggagcaactggaactc tcgctcatggtagcacaccctcctgctgtccacccagctccacaggcttcactgagaggc tctcagcgcatggcctgccgggagggggcaggtagccggcgggcccggtccaatgggtgc cggcttccgaggagagggcggaggagaggaggaaggaggcgaactgtgggccccggcccc attcattgccgtggccggcgggcactggggccccgtgttttcagagtcatggaggcgcta attcctgtcataaacaagctccaggacgtcttcaacacggtgggcgccgacatcatccag ctgcctcaaatcgtcgtagtgggaacgcagagcagcggaaagagctcagtgctagaaagc ctggtggggagggacctgcttcccagaggtactggaattgtcacccggagacctctcatt ctgcaactggtccatgtttcacaagaagataaacggaaaacaacaggagaagaaaatggg gtggaagcagaagaatggggtaaatttcttcacaccaaaaataagctttacacggatttt gatgaaattcgacaagaaattgaaaatgaaacagaaagaatttcaggaaataataaggga gtaagccctgaaccaattcatcttaagattttttcacccaacgttgtcaatttgacactt gtggatttgccaggaatgaccaaggtgcctgtaggtgatcaacctaaggatattgagctt caaatcagagagctcattcttcggttcatcagtaatcctaattccattatcctcgctgtc actgctgctaatacagatatggcaacatcagaggcacttaaaatttcaagagaggtagat ccagatggtcgcagaaccctagctgtaatcactaaacttgatctcatggatgcgggtact gatgccatggatgtattgatgggaagggttattccagtcaaacttggaataattggagta gttaacaggagccagctagatattaacaacaagaagagtgtaactgattcaatccgtgat gagtatgcttttcttcaaaagaaatatccatctctggccaatagaaatggaacaaagtat cttgctaggactctaaacaggttactgatgcatcacatcagagattgtttaccagagttg aaaacaagaataaatgttctagctgctcagtatcagtctcttctaaatagctacggtgaa cccgtggatgataaaagtgctactttactccaacttattaccaaatttgccacagaatat tgtaacactattgaaggaactgcaaaatatattgaaacttcggagctgtaa >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_6|264_aa MDVLAGSKTVQYANNMQYVMKDSKDVISFNLINLLLQIYPQCLLGHKAPKTDAARVVKRG VNALKNLQVKGAQIEAKFYEEVHDLERKYAVLYQPLFDKQSEITNTIYEPTEEECEWKPD EEDEISEELKEKAKTEDEKNDEEKEDPKGIPEFWLPVFKNVDLLSDMLQKHDEPILKHVK DTKVKFSDAGLPMSFVLEFQFEPSKYFTNEVLTKTYRMRLEPEPDDSDHFSFDGQEIMGC TGGQIDWKKGKNVILKTIKQQQKH >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_6|795_bp atggatgttcttgctggtagtaaaacagtacaatatgctaacaatatgcaatatgtaatg aaagattctaaggatgttatatcctttaatctaataaacttacttctacaaatttatcct cagtgtttacttggacacaaagccccaaagacagatgcagctagggtagttaaaagagga gtgaatgctctcaaaaacctgcaagttaaaggtgcacagatagaagccaaattctatgag gaagttcacgatcttgaaaggaagtatgctgttctctatcagcctctatttgataagcaa tctgagattactaatacaatttatgaacctacagaggaagaatgtgaatggaaaccagat gaagaagatgagatttcagaggagttgaaagaaaaggccaagactgaagatgagaaaaac gatgaagaaaaagaagaccccaaaggaattcctgaattttggttacctgtttttaagaat gttgacttgctcagtgatatgcttcagaaacatgatgaacctattctgaagcatgtgaaa gataccaaagtgaagttctcagatgctggcctgcctatgagttttgtcttagaatttcaa tttgaacccagtaaatattttacaaatgaagtgctgacaaagacatataggatgaggtta gaaccagaaccagatgattctgatcacttttcctttgatggacaagaaattatgggttgt acagggggccagatagattggaaaaaaggaaagaatgtcattttgaaaaccattaagcag cagcagaaacactag >gi568815586f:32476358_32740530|GENSCAN_predicted_peptide_7|298_aa MSLKLLERLLTCIFFPFLQCQKPYTSLPFRCGGARICYIFHETFGRTLESVDPLGGLNTI DILTAIRNATGPRPALFVPEVSFELLVKRQIKRLEEPSLRCVELVHEEMQRIIQHCSNYS TQELLRFPKLHDAIVEVVTCLLRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLMNN NIEEQRRNRLARELPSAVSRDKVASGGGGVGDGVQEPTTGNWRGMLKTSKAEELLAEEKS KPIPIMPASPQKGHAVNLLDVPVPVARKLSAREQRDCEVIERLIKSYFLIVRKNIQDS >gi568815586f:32476358_32740530|GENSCAN_predicted_CDS_7|894_bp atgagcttaaagcttctagaaagacttctaacttgtatcttctttccctttttgcaatgc cagaaaccatatacttcattgcctttcagatgcggtggtgctagaatttgttatattttc catgagacttttgggcgaaccttagaatctgttgatccacttggtggccttaacactatt gacattttgactgccattagaaatgctactggtcctcgtcctgctttatttgtgcctgag gtttcatttgagttactggtgaagcggcaaatcaaacgtctagaagagcccagcctccgc tgtgtggaactggttcatgaggaaatgcaaaggatcattcagcactgtagcaattacagt acacaggaattgttacgatttcctaaacttcatgatgccatagttgaagtggtgacttgt cttcttcgtaaaaggttgcctgttacaaatgaaatggtccataacttagtggcaattgaa ctggcttatatcaacacaaaacatccagactttgctgatgcttgtgggctaatgaacaat aatatagaggaacaaaggagaaacaggctagccagagaattaccttcagctgtatcacga gacaaggttgcatctggaggtggtggggttggagatggtgttcaagaaccaaccacaggc aactggagaggaatgctgaaaacttcaaaagctgaagagttattagcagaagaaaaatca aaacccattccaattatgccagccagtccacaaaaaggtcatgccgtgaacctgctagat gtgccagttcctgttgcacgaaaactatctgctcgggaacagcgagattgtgaggttatt gaacgactcattaaatcatattttctcattgtcagaaagaatattcaagacagn