GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:21:32 Sequence gi568815575f:24072743_24311373 : 238631 bp : 42.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 349 521 173 0 2 100 115 108 0.884 12.52 1.02 Term + 3980 4043 64 2 1 77 49 60 0.647 -2.62 1.03 PlyA + 4704 4709 6 1.05 2.00 Prom + 45914 45953 40 -3.25 2.01 Init + 60541 60707 167 0 2 80 25 200 0.957 12.06 2.02 Intr + 60767 60972 206 2 2 97 75 78 0.969 5.32 2.03 Intr + 67627 67796 170 2 2 51 105 78 0.707 4.54 2.04 Term + 67828 67902 75 0 0 71 48 76 0.674 -1.24 2.05 PlyA + 68296 68301 6 1.05 3.00 Prom + 70922 70961 40 -7.55 3.01 Init + 76559 76795 237 1 0 86 2 228 0.800 10.06 3.02 Term + 77799 77981 183 2 0 87 41 121 0.877 3.86 3.03 PlyA + 78058 78063 6 1.05 4.00 Prom + 82971 83010 40 -2.85 4.01 Init + 100001 100058 58 1 1 78 75 56 0.450 4.82 4.02 Intr + 106441 107028 588 2 0 58 95 825 0.435 71.87 4.03 Intr + 134584 134733 150 2 0 99 82 224 0.995 22.11 4.04 Intr + 134970 135113 144 1 0 63 97 186 0.999 16.33 4.05 Intr + 135452 135628 177 0 0 41 115 323 0.731 29.27 4.06 Intr + 136158 136298 141 1 0 39 94 167 0.998 11.80 4.07 Term + 137451 138634 1184 1 2 105 42 976 0.999 85.63 4.08 PlyA + 138831 138836 6 1.05 5.00 Prom + 158529 158568 40 -3.95 5.01 Init + 170178 170308 131 2 2 59 94 62 0.024 3.57 5.02 Intr + 186023 186161 139 2 1 0 56 176 0.005 5.15 5.03 Term + 191054 191254 201 1 0 115 38 77 0.100 1.81 5.04 PlyA + 193435 193440 6 1.05 6.00 Prom + 198914 198953 40 -5.95 6.01 Init + 205439 205642 204 1 0 86 100 113 0.974 11.20 6.02 Term + 206540 206638 99 1 0 111 43 57 0.851 0.65 6.03 PlyA + 207943 207948 6 1.05 7.08 PlyA - 209157 209152 6 1.05 7.07 Term - 211220 211090 131 0 2 57 33 94 0.534 -1.84 7.06 Intr - 214117 213927 191 0 2 89 91 97 0.287 8.41 7.05 Intr - 219832 219710 123 0 0 60 61 82 0.118 1.48 7.04 Intr - 222703 222590 114 0 0 72 87 37 0.112 0.64 7.03 Intr - 237612 237451 162 2 0 76 97 53 0.213 3.17 7.02 Intr - 238365 238188 178 1 1 85 69 211 0.212 16.96 7.01 Intr - 238551 238425 127 0 1 49 72 105 0.277 4.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 80865 80793 73 0 1 71 94 68 0.825 6.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_1|78_aa VQKLSKNEVLMVNIGSLSTGGRVSAVKADLGKIVLTNPVCTEVGEKIALSRRVEKHWRLI GWGQIRRGVTIKPTVDDD >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_1|237_bp gttcaaaagctgtctaagaatgaagtgctcatggtgaacataggatccctgtcaacagga gggagagttagtgctgtcaaggccgatttgggtaaaattgttttgaccaatccagtgtgc acagaggtaggagaaaaaattgcccttagccgaagagttgaaaaacactggcgtttaatt ggttggggtcagataagaagaggagtgacaatcaagccaacagtagatgatgactga >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_2|205_aa MLLKVNLEALHPWWPVPAGTRGCGHLAPQSAPSHRPQAMAVPDRMYQVTGQALDRRDLDM GAQHHTLAHWRRLQPASAVPQIHQVPAAPPRPAPSAQAMLHYQEDQHLLLVAGASHPSVP HINSVICLSVHEQQNLDGAPGVLVTDFGSLTQYALLVAWLQWAAGSLRSPPKQLPTQFRL EPAPTGFLIASEEQSLKFDNCIQIG >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_2|618_bp atgctccttaaagtgaacttagaggctctacatccctggtggccagtgcctgctgggacc agaggctgtggccatctggccccccagtcagccccaagtcatcgcccccaggcaatggct gtgcctgacagaatgtaccaggtcactggacaggcactcgataggagagacctagacatg ggtgcccagcaccacaccctggcccactggaggagattacagccagcaagtgcagttcca caaatccatcaagttcctgctgctccaccgcgtcctgcaccatcagcacaagccatgctt cactaccaagaggaccaacaccttctgctagttgcaggggcctctcaccccagtgtgccc catataaactcggtgatttgtctttctgtgcacgagcagcagaacctagatggagcccct ggtgttttagtaacagattttggttccctgacccagtatgctttgcttgtggcttggctg cagtgggctgctgggagtctcagaagccctcctaagcagctgcccacccaatttaggctg gagccagccccaacagggttcctgattgcctcggaagaacagtctttgaaatttgacaac tgcatccagataggatga >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_3|139_aa MVGGGRAACLSAGSADRGPARALRPRRRRQRRVCGGGGRERGGGGKKGGAVTETPAAGDP AAARPLGSPVAGRERRRHGLSCRRRPPPPPGRRDRFTLSAPGPAPSPAPVATRGQSGQVA FRTPGLTRAYGTPGNVIFA >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_3|420_bp atggtggggggcggccgggccgcctgcctctccgcggggtcggccgaccgcggccccgcg cgggctttacggccgaggcggcggcggcagcggcgcgtgtgtggcggcggcgggcgggag cgcggaggaggtggaaagaaggggggcgctgtcacggagactccggccgccggagacccc gccgcagcgaggccactgggctccccggtcgcggggcgggagcggcgccgacacgggctg agctgccgcaggcggccaccgccgccgcccggacgccgggaccgtttcaccctcagcgcc cctggccctgcgccttcccccgcgcctgtagccacccgagggcagtcggggcaggtggca ttccggacacctgggcttaccagggcatacgggaccccaggaaatgttatttttgcttaa >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_4|813_aa MDEDGLELQQEPNSFFDATGADGTHMDGDQIVVEVQETVFVSDVVDSDITVHNFVPDDPD SVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLAS DITSASMSMPEHVLTGDSIHVSDVGHVGHVGHVEHVVHDSVVEAEIVTDPLTTDVVSEEV LVADCASEAVIDANGIPVDQQDDDKGNCEDYLMISLDDAGKIEHDGSSGMTMDTESEIDP CKVDGTCPEVIKVYIFKADPGEDDLGGTVDIVESEPENDHGVELLDQNSSIRVPREKMVY MTVNDSQPEDEDLKVFVTFYLDVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEI KTFMPIAWAAAYGNNSDGIENRNGTASALLHIDESAGLGRLAKQKPKKRRRPDSRQYQTA IIIGPDGHPLTVYPCMICGKKFKSRGFLKRHMKNHPEHLAKKKYRCTDCDYTTNKKISLH NHLESHKLTSKAEKAIECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQG LLNRHLLAVHSKNFPHICVECGKGFRHPSELKKHMRIHTGEKPYQCQYCEYRSADSSNLK THVKTKHSKEMPFKCDICLLTFSDTKEVQQHALIHQESKTHQCLHCDHKSSNSSDLKRHI ISVHTKDYPHKCDMCDKGFHRPSELKKHVAAHKGKKMHQCRHCDFKIADPFVLSRHILSV HTKDLPFRCKRCRKGFRQQSELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTK DYPHRCEYCKKGFRRPSEKNQHIMRHHKEVGLP >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_4|2442_bp atggatgaagatgggcttgaattacaacaagagccaaactcattttttgatgcaacagga gctgatggtacacacatggatggtgatcaaattgttgtggaagtacaagaaactgttttt gtttcagatgttgtggattcagacataactgtgcataactttgttcctgatgacccagat tcagttgtaatccaagatgttattgaggacgttgttatagaagatgttcagtgcccagat atcatggaagaagcagatgtgtctgaaacggtcatcattcctgagcaagtgctggactca gatgtaactgaagaagtttctttagcacattgcacagtcccagatgatgttttagcttct gacattacttcagcctcaatgtctatgccagaacacgtcttgacgggtgattctatacat gtgtctgacgttggacatgttggacatgttggacatgttgaacatgtggttcatgatagt gtagtggaagcagaaattgtcactgatcctctgactaccgacgtagtttcagaagaagta ttggtagcagactgtgcctctgaagcagtcatagatgccaatgggatccctgtggaccag caggatgatgacaaaggcaactgtgaggactaccttatgatttccttggatgatgctggc aaaatagaacacgatggttcttctggaatgaccatggacacagagtcggaaattgatcct tgtaaagtggatggcacttgccctgaggtcatcaaggtgtacatttttaaagctgaccct ggagaagatgacttaggtggaactgtagacattgtggagagtgagcctgagaatgatcat ggagttgaactgcttgatcagaacagcagtattcgtgttcccagggaaaagatggtttat atgactgtcaatgactctcagccagaagatgaagatttaaaagtttttgtaactttctac ttagatgttgctgaaatcgctgacgaagtttatatggaagtgatcgtaggagaggaggat gctgcagcagcagcggcagccgccgccgtgcacgagcagcaaatggatgacaatgaaatc aaaaccttcatgccgattgcatgggcagcagcttatggtaataattctgatggaattgaa aaccggaatggcactgcaagtgccctcttgcacatagatgagtctgctggcctcggcaga ctggctaaacaaaaaccaaagaaaaggagaagacctgattccaggcagtaccaaacagca ataattattggccctgatggacatcctttgactgtctatccttgcatgatttgtgggaag aagtttaagtcgagaggttttttgaaaaggcacatgaaaaaccatcccgaacaccttgcc aagaagaaataccgctgtactgactgtgattacactaccaacaagaagataagtttacac aaccacctggagagccacaagctgaccagcaaggcagagaaggccattgaatgcgatgag tgtgggaagcatttctctcatgcaggggctttgtttactcacaaaatggtgcataaggaa aaaggagccaacaaaatgcacaagtgtaaattctgtgaatacgagacagctgaacaaggg ttattgaatcgccacctcttggcagtccacagcaagaactttcctcatatttgtgtggag tgtggtaagggttttcgtcacccgtcagagctcaaaaagcacatgagaatccatactggg gagaagccgtaccaatgccagtactgcgaatataggtctgcagactcttctaacttgaaa acgcatgtcaaaactaagcatagtaaagagatgccattcaagtgtgacatttgtcttctg actttctcggataccaaagaggtgcagcaacatgctcttatccaccaagaaagcaaaaca caccagtgtttgcattgcgaccacaagagttcgaactcaagtgatttgaaacgacacata atttcagttcacacgaaagactacccccataagtgtgacatgtgtgataaaggctttcac aggccttcagaactcaagaaacacgtggctgcccacaagggcaaaaaaatgcaccagtgt agacattgtgactttaagattgcagatccatttgttctaagtcgccatattctctcagtt cacacaaaggatcttccatttaggtgcaagagatgtagaaagggatttaggcaacagagt gagcttaaaaagcatatgaagacacacagtggcaggaaagtgtatcagtgtgagtactgt gagtatagcactacagatgcctcaggctttaaacggcacgttatttccattcacacgaaa gactatcctcaccggtgtgagtactgcaagaaaggcttccgaagaccttcagaaaagaac cagcacataatgcgacatcataaagaagttggcctgccctaa >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_5|156_aa MQRFTWPGTENSFWPTTSKKLTPSGEPRPLVQQLVRNGIWPTTTLEYCSNRITGCPVKTT GCGPTACTCDTENWIPEPLDSGCRCHPHQKLPTISLKPKSNSVIPLFNPSAPFSQNNTRV VLKTCKVPHALHPGLQISIALTLGHCISQGSLEGQN >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_5|471_bp atgcagaggttcacatggccaggaactgagaacagcttctggccaacaaccagcaagaaa ctgacgccctcaggggaaccaaggcccttagtccaacagcttgtgagaaacggaatctgg ccaacaaccaccctggagtactgctcaaaccgcatcactggctgtccagtgaagaccact ggatgtggacccacagcttgcacctgtgatactgagaactggattccagaacctctggat tctggatgccgttgccaccctcaccagaaactgccaacaatctcgttaaaacctaagtca aatagtgttattcctttgttcaatccctctgctccattttctcagaataacactcgagtt gttcttaagacctgcaaggtcccacatgctctgcaccctggacttcaaatctccattgct ctcacccttggtcactgtattagtcagggttctctagagggacagaactaa >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_6|100_aa MVEGERGAKAPLTWQQLRECAGELPFIKPSDLVRLIHYHENSMGKSHPHDSITSHRVPPV IHGALQFKGASSTSAVLQELNNQLISLEFPGCKLTIGDKQ >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_6|303_bp atggtggaaggtgaaagaggagcaaaggcacctcttacatggcagcagttgagagagtgt gcaggggaactgccctttataaaaccatcagatcttgtgagacttattcactatcatgag aacagcatgggaaaatcccacccccatgattcaattacctctcaccgagtccctcccgta atacatggggcgctacaattcaagggtgccagttctacatctgctgtcctccaggaatta aataaccagctcataagcctggaattccctggatgtaaactgactataggggacaaacag taa >gi568815575f:24072743_24311373|GENSCAN_predicted_peptide_7|341_aa QLQQPTAAHPPQPGPQGSTLGLSTQGQAFPAQQLLNVNLTGAGSSAGCPAPLCLGAAATA AAAAAATTTATADPAPDTAFESPAAASVFGNRRCSDSAATSRSQDHLATLKLMPIEHQLP ILPHPVPGSHCSTFFLYELTTLAISWKWNHMAFVTCLTPCNIYLTLDYTETSSLWINHSL TKCMHLESFSSWGCWEHHGLAASRAVLHFLGPSFWPRADKLSLQDINLWDPGPHKEGLYG FQGSDTMWSLTHPLPTGKLWTDQSSLLPPASGFSPPQGGQMELNELSSREELSALERPVR IGKHALEIPVPHGNLIMIPGMTQKIQCTELNEKPQRGFTAM >gi568815575f:24072743_24311373|GENSCAN_predicted_CDS_7|1026_bp cagctccagcagcccacagctgctcaccctcctcagccagggccacagggttccacacta ggtttgagcacgcaagggcaggccttccctgctcagcaacttcttaatgtgaacctcact ggagcaggttcctcagcagggtgtccagctcccctttgtcttggggcagcagccacagcc gctgctgctgctgcagccacaaccacagccacagcagatccagctccagacacagccttt gagagtcctgcagcagccagtgtttttggcaacaggcgctgttcagatagtgcagccaca tccagatctcaagaccatcttgcaacactgaaactcatgcccattgaacaccagctcccc attctccctcatccagtccctggcagccattgttctactttctttctctatgagttgact accctggccatttcatggaagtggaatcatatggcatttgtcacttgcctaaccccctgc aacatctacttaacactggactacaccgaaacatcttccctctggatcaaccattcctta accaagtgtatgcatttggaatccttctcctcttggggttgttgggaacatcacggccta gctgccagcagggctgttcttcactttctcggtccttccttttggccaagagcagacaag ctgagcctgcaggatattaatctgtgggatccaggtccacacaaggagggactatacggg tttcaaggcagtgacacaatgtggtccctcactcaccctctacccacggggaagttatgg accgatcaaagttctcttcttcctcctgcaagtggattctcccctcctcagggtggtcag atggagctgaatgaactttcttctagggaggagctctctgccttggagagaccagtcaga attgggaagcatgcattagagattcctgtgccccatggaaacttgatcatgattccaggt atgactcaaaaaatacagtgtacagaactcaatgaaaaaccacagcgtggcttcacagcc atgtag