GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:28:18 Sequence gi568815591f:44696725_44901419 : 204695 bp : 47.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 190 340 151 2 1 64 97 259 0.490 23.62 1.02 Intr + 646 773 128 1 2 99 92 128 0.999 14.62 1.03 Intr + 880 1058 179 2 2 118 89 302 0.999 32.94 1.04 Intr + 1468 1539 72 0 0 92 81 73 0.984 6.60 1.05 Intr + 3417 3545 129 2 0 105 77 269 0.999 28.39 1.06 Intr + 4819 4891 73 0 1 76 94 43 0.933 2.68 1.07 Intr + 10501 10664 164 2 2 43 94 200 0.979 15.79 1.08 Intr + 10858 11012 155 0 2 102 95 311 0.999 32.07 1.09 Term + 11155 11275 121 1 1 128 37 287 0.995 25.55 1.10 PlyA + 11339 11344 6 -3.84 2.05 PlyA - 12260 12255 6 1.05 2.04 Term - 12644 12442 203 0 2 21 55 160 0.715 3.55 2.03 Intr - 12981 12881 101 2 2 44 47 77 0.659 -1.05 2.02 Intr - 14325 14169 157 1 1 89 69 88 0.862 6.07 2.01 Init - 16585 16483 103 1 1 74 105 40 0.801 3.25 2.00 Prom - 29687 29648 40 -1.66 3.00 Prom + 32672 32711 40 -6.26 3.01 Init + 52210 52267 58 0 1 57 83 109 0.201 6.77 3.02 Intr + 59049 59148 100 1 1 131 50 -30 0.054 -3.33 3.03 Intr + 59701 59815 115 1 1 95 115 12 0.960 5.05 3.04 Intr + 60223 60425 203 0 2 89 102 253 0.939 24.88 3.05 Intr + 60654 60837 184 0 1 54 89 147 0.471 11.19 3.06 Intr + 61124 61384 261 1 0 130 105 40 0.608 7.58 3.07 Intr + 62557 62736 180 0 0 105 97 50 0.989 7.66 3.08 Intr + 63427 63504 78 0 0 79 37 135 0.846 7.25 3.09 Intr + 63701 63869 169 1 1 77 119 97 0.994 11.22 3.10 Intr + 64725 64869 145 1 1 78 83 175 0.964 15.34 3.11 Intr + 64971 65181 211 0 1 48 28 380 0.485 26.92 3.12 Intr + 66157 66262 106 0 1 105 70 114 0.934 11.09 3.13 Intr + 66532 66689 158 2 2 37 96 217 0.994 17.13 3.14 Intr + 67695 67762 68 2 2 107 89 59 0.948 5.60 3.15 Intr + 68217 68285 69 0 0 129 74 94 0.999 10.40 3.16 Intr + 68611 68855 245 1 2 77 110 395 0.998 37.74 3.17 Intr + 69440 69609 170 0 2 108 82 69 0.974 7.97 3.18 Intr + 69697 70006 310 0 1 130 96 54 0.653 6.29 3.19 Term + 72731 73143 413 0 2 51 42 192 0.751 6.30 3.20 PlyA + 73547 73552 6 1.05 4.00 Prom + 80846 80885 40 -4.16 4.01 Init + 98925 98973 49 2 1 69 80 63 0.523 2.72 4.02 Intr + 99966 100124 159 1 0 40 47 173 0.437 8.36 4.03 Intr + 102668 102756 89 0 2 61 96 46 0.944 2.39 4.04 Intr + 102978 103150 173 2 2 70 79 196 0.999 15.64 4.05 Term + 104563 104698 136 1 1 82 38 226 0.997 14.49 4.06 PlyA + 104886 104891 6 1.05 5.00 Prom + 112650 112689 40 -3.36 5.01 Init + 125279 125334 56 1 2 32 86 54 0.162 0.36 5.02 Intr + 126548 126651 104 2 2 94 63 40 0.253 1.92 5.03 Intr + 149467 149547 81 2 0 104 119 10 0.038 5.31 5.04 Term + 151211 151686 476 0 2 95 38 185 0.041 9.25 5.05 PlyA + 152496 152501 6 1.05 6.02 PlyA - 152718 152713 6 1.05 6.01 Sngl - 188624 187686 939 2 0 76 47 2062 0.959 197.41 6.00 Prom - 190831 190792 40 -3.06 7.03 PlyA - 191028 191023 6 1.05 7.02 Term - 191284 191227 58 0 1 100 46 67 0.193 0.86 7.01 Init - 195915 195818 98 0 2 78 84 73 0.700 5.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 151246 151686 441 0 0 49 38 268 0.955 12.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_1|390_aa XLSRILKTRGEMVKNRTVDWALAEYMAFGSLLKEGIHIRLSGQDVERGTFSHRHHVLHDQ NVDKRTCIPMNHLWPNQAPYTVCNSSLSEYGVLGFELGFAMASPNALVLWEAQFGDFHNT AQCIIDQFICPGQAKWVRQNGIVLLLPHGMEGMGPEHSSARPERFLQMCNDDPDVLPDLK EANFDINQLYDCNWVVVNCSTPGNFFHVLRRQILLPFRKPLIIFTPKSLLRHPEARSSFD EMLPGTHFQRVIPEDGPAAQNPENVKRLLFCTGKVYYDLTRERKARDMVGQVAITRIEQL SPFPFDLLLKEVQKYPNAELAWCQEEHKNQGYYDYVKPRLRTTISRAKPVWYAGRDPAAA PATGNKKTHLTELQRLLDTAFDLDVFKNFS >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_1|1173_bp nggctgagccggatcttgaagactcgtggggaaatggtgaagaaccggactgtggactgg gctctagcggagtacatggcgtttggctcgctcctgaaggagggcatccacattcggctg agcggccaggacgtggagcggggcacattcagccaccgccaccatgtgctccatgaccag aatgtggacaagagaacctgcatccccatgaaccatctctggcccaatcaggccccctat actgtgtgcaacagctcactgtctgagtacggcgtgctgggctttgagctgggcttcgcc atggccagtcctaatgccctggtcctctgggaagcccaatttggtgacttccacaacacg gcccagtgtatcatcgaccagttcatctgcccgggacaagccaagtgggtgcggcagaat ggcatcgtgttgctgctgccccatggcatggagggcatgggtccagaacattcctccgcc cgcccagagcggttcttgcagatgtgcaacgatgacccagatgtcctgccagaccttaaa gaagccaacttcgacatcaatcagctatatgactgcaattgggttgttgtcaactgctcc actcctggcaacttcttccacgtgctacgacgccagatcctgctgccattccggaagccg ttaattatcttcacccccaaatccctgttgcgccaccccgaggccagatccagctttgat gagatgcttccaggaacccacttccagcgggtgatcccagaagatggccctgcagctcag aacccagaaaatgtcaaaaggcttctcttctgcaccggcaaagtgtattatgacctcacc cgggagcgcaaagcacgcgacatggtggggcaggtggccatcacaaggattgagcagctg tcgccattcccctttgacctcctgctgaaggaggtgcagaagtaccccaatgctgagctg gcctggtgccaggaggagcacaagaaccaaggctactatgactacgtgaagccaagactt cggaccaccatcagccgcgccaagcccgtctggtatgccggccgggacccagcggctgct ccagccaccggcaacaagaagacccacctgacggagctgcagcgcctcctggacacggcc ttcgacctggacgtcttcaagaacttctcgtag >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_2|187_aa MKGGHLVPLQVLFVQPGAPVGREDSPLSNQTAGEAITSSSTWVRGAGSEVPSPVEQADAL TSSLDISDPEVPFLYPEKRKDVGSDVGTHSTWHWLVVMLPSEEPWAMLQSDNRTNPGTQA GTLAQTERHRVQSTFSRWSQGKDPEQGKLRNRTAPKQPGSPGRLCRIHNDGFAGEGLQVV VITPPVQ >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_2|564_bp atgaaggggggccacctagttcctctccaggtcctctttgtgcagcctggagcaccagtg gggagggaggacagccccctaagcaaccagactgctggggaagccatcaccagcagtagt acttgggtgaggggggctggctctgaagttccctccccagtggagcaagctgacgctctc acatctagcttagatatttcggatccagaagtaccttttttatatccagaaaagcgcaag gatgttggcagtgatgttgggacccactccacttggcactggctggtggtcatgcttcca tcagaagaaccttgggccatgctgcaaagcgacaacaggacaaatccggggacccaggca ggcacgctggctcaaacggagaggcacagagtgcagagtactttttcaaggtggagccag gggaaggacccagagcagggaaagctcagaaacaggactgcacccaaacagcccggcagc ccaggccgcctctgccgcattcacaatgacggctttgccggcgagggcctgcaggtggtg gtcatcactccccctgtccaatga >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_3|1080_aa MERRGPGAATARGRARPGGGPSVGLLATGSSLNPSFHGVARIVPGFIRIARPRDGSFAYE SVPWQQSATQPAGSLSVVTTVWGVGNATQSQVLGNPMGPAGSPSGSSMMPGVAGGSSALT SPQCLGQQAFAEGGANKGYVQQGVYSRGGYPGAPGFTTGYAGGPGGLGLPSHAARPSTDF TQAAAAAAVAAAAATATATATATVAALQEKQSQELSQYGAMGAGQSFNSQFLQHGGPRGP SVPAGMNPTGIGGVMGPSGLSPLAMNPTRAAGMTPLYAGQRLPQHGYPGPPQAQPLPRQG VKRTYSEVYPGQQYLQGGQYAPSTAQFAPSPGQPPAPSPSYPGHRLPLQQGMTQSLSVPG PTGLHYKPTEQFNGQGASFNGGSVSYSQPGLSGPTRSIPGYPSSPLPGNPTPPMTPSSSV PYMSPNQEVKSPFLPDLKPNLNSLHSSPSGSGPCDELRLTFPVRDGVVLEPFRLQHNLAV SNHVFQLRDSVYKTLIMRPDLELQFKCYHHEDRQMNTNWPASVQVSVNATPLTIERGDNK TSHKPLYLKHVCQPGRNTIQITVTACCCSHLFVLQLVHRPSVRSVLQGLLKKRLLPAEHC ITKIKRNFSSGTIPGTPGPNGEDGVEQTAIKVSLKCPITFRRIQLPARGHDCRHIQCFDL ESYLQLNCERGTWRCPVCNKTALLEGLEVDQYMLGILIYIQNSDYEEITIDPTCSWKPVP VKPDMHIKEEPDGPALKRCRTVSPAHVLMPSVMEMIAALGPGAAPFAPLQPPSVPAPSDY PGQGSSFLGPGTFPESFPPTTPSTPTLAEFTPGPPPISYQSDIPSSLLTSEKSTACLPSQ MAPAGHLDPTHNPGTPGLHTSNLGAPPGPQLHHSNPPPASRQSLGQASLGPTGELAFSPA TGVMGPPSMSGAGEAPEPALDVSTRPHAGECVGARARGGVSVPALPPSLDEGTEVEETKE AAARTVLSLGAETAWQSPTQPWPDAGPRPSFLGCQMTVSWTSDAPPALPVANVMLPDGVE KLGDWDKQKAANNPEAHPQKTGEMIEECMGTVALCSITNTSQKQRGMKKQDSSYSMMPFL >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_3|3243_bp atggagcggcgcgggccgggggccgccacggcgaggggccgggccaggccgggcggaggg ccttctgtggggttactggctactggctcctctctaaaccccagtttccacggggtagct aggattgtccctggctttataaggatagccaggcctcgtgatggttcattcgcatatgag tctgtgccttggcaacaaagcgccactcagccggctggatcgctgtctgtggtcactact gtgtggggagttggcaacgcgacacagagccaggttttggggaaccccatgggccctgca gggagtccctctggcagctccatgatgcctggtgtggcagggggcagctccgccttgacc tccccacagtgcctgggacagcaggcgtttgctgaaggcggcgccaacaagggctacgtg cagcaaggcgtgtacagccgcgggggctaccctggggcccccggcttcaccaccgggtat gcaggcggcccggggggcctgggcctcccctcacatgctgcaagaccctccactgacttc acgcaagcggcagctgctgcagctgtggctgctgcggcagccactgccaccgccacagcc acagccaccgtggctgctctccaggagaagcagagccaggagctgagccagtatggagcg atgggggccggacagtcttttaacagccagtttctgcagcatggaggtccccgggggcct agtgtccccgctggcatgaaccctactggcataggaggggtaatgggcccctctggcctc tcccccttggctatgaaccccacccgggcagcaggaatgacacccttgtatgcagggcag cgtctgccccaacatgggtatcctgggcctccccaggcccagccactgccccgacagggg gtcaagagaacctactctgaggtgtatccagggcagcagtatctgcaaggaggccagtat gcacccagcaccgcccagtttgcgcccagccctgggcagccccctgccccctccccttcc taccctgggcacaggctgcccctgcagcagggcatgacccagtccctgtccgtgcctggc cccacgggactgcattataagcccacagagcagttcaacgggcagggcgccagcttcaac gggggcagcgtcagctacagccaacctggcctgagtgggcctacccgttccatcccgggc tatcccagttccccactgccagggaaccccacgccacccatgaccccaagcagcagcgtc ccttacatgtcaccaaaccaagaggtcaagtctcccttcttgcctgatctcaagcccaac ctcaactccttgcactcatcgccctctggaagcgggccttgtgacgagttgcggctgacc ttccctgtgcgcgatggggtggtcctggagcccttccgcctgcagcacaacctggctgta agcaaccatgtcttccagctgcgagactcagtctacaagaccctgataatgaggcctgac ctggagctgcaattcaagtgctaccaccacgaggaccggcagatgaacaccaactggcca gcctcggtgcaggtcagcgtcaatgccacgccgctcaccatcgagcgtggcgacaacaag acctcgcacaagccactctacctgaagcatgtgtgccagccaggccgcaacaccatccag atcaccgtcaccgcctgctgctgctcccacctcttcgtgctgcagctagtgcaccgccca tccgtccgctcggtgctgcagggcctcctcaaaaagcgcctcctgcctgctgagcactgc atcaccaagataaagcggaacttcagcagcggcaccatccctggcacccctgggcccaac ggagaggacggggtggagcagacagctatcaaggtgtccctgaagtgccccatcaccttc cgcaggatccagctccctgcccgaggtcatgactgtcgccacatacagtgctttgacctg gagtcgtacctgcagctcaactgtgagcgggggacttggaggtgtcctgtgtgcaacaag acagctttgctggagggcctggaggtggaccagtacatgctgggcatcctgatttacatt cagaactctgactatgaggagatcaccatcgaccccacgtgcagctggaagccagtgccc gtgaagcctgacatgcacatcaaggaggagccggatgggccagcactgaagcgctgccgc accgtgagccccgcccacgtgctcatgcccagcgtgatggagatgatcgccgccctgggc cccggcgctgccccctttgcccccctgcagcccccctcagtccctgcccccagcgactac cctggccagggttccagcttcctggggcctggaactttccctgagtccttcccacccacc acgcccagcaccccaacccttgctgagttcaccccgggaccaccccccatctcctaccag tctgacattcccagcagcctcctgacttcagagaagtctaccgcctgcctcccaagccag atggcaccagcaggtcacctggaccccactcacaatcctgggacaccaggactacacacc tccaaccttggggcccctccaggtccccagctgcaccattcaaaccctcccccagcgtcc cggcagtccttgggccaagcgagcttaggacctacgggtgaactggccttcagtcctgcc acaggcgtgatggggccccccagcatgtctggagccggggaggccccagaaccagctctg gacgtgagtaccaggccccatgcgggggagtgcgtgggagccagggctagaggtggtgtg tctgttccagctcttccgccctctctggatgagggaacagaagtggaggaaacaaaagaa gcagcagcacgcacagtcctgtcgctgggtgcggagacagcctggcaaagtcccactcag ccatggcctgatgcaggccccaggccctcctttcttgggtgtcaaatgactgtgtcctgg acatctgatgcaccacctgccctgcctgttgcaaacgtgatgctcccggatggagtggag aaactaggagactgggacaagcaaaaggctgcaaacaacccagaagcccatcctcagaag actggagaaatgattgaggaatgcatgggcaccgtggccctgtgctccatcacaaacacc tctcagaaacaacgtgggatgaaaaagcaagacagttcatacagtatgatgccattttta taa >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_4|201_aa MKRSSPALASAPPKAPDATAEENRVLLAMVNPTVFFDIAVDGEPLGRVSFEVGRAAACGN GAQKVGRGRENFRALSTGEKGFGYKGSCFHRIIPGFMCQGGDFTRHNGTGGKSIYGEKFE DENFILKHTGPGILSMANAGPNTNGSQFFICTAKTEWLDGKHVVFGKVKEGMNIVEAMER FGSRNGKTSKKITIADCGQLE >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_4|606_bp atgaagcgatcctccccggccttggcctccgcgcctcctaaagcgccagacgccaccgcc gaggaaaaccgtgtactattagccatggtcaaccccaccgtgttcttcgacattgccgtc gacggcgagcccttgggccgcgtctcctttgaggtcgggcgggcggcggcgtgcgggaat ggggcccagaaagtgggccggggtcgggaaaattttcgtgctctgagcactggagagaaa ggatttggttataagggttcctgctttcacagaattattccagggtttatgtgtcagggt ggtgacttcacacgccataatggcactggtggcaagtccatctatggggagaaatttgaa gatgagaacttcatcctaaagcatacgggtcctggcatcttgtccatggcaaatgctgga cccaacacaaatggttcccagtttttcatctgcactgccaagactgagtggttggatggc aagcatgtggtgtttggcaaagtgaaagaaggcatgaatattgtggaggccatggagcgc tttgggtccaggaatggcaagaccagcaagaagatcaccattgctgactgtggacaactc gaataa >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_5|238_aa MHGVRKKPSYNSTKSSMDGLILHPATGLVFVLSKQCEEIHQPVVWTCEQREAESLKSIKN DNSMSPAFLAYVNTYVFLATGPVPPAHPAALTMFSAPTPPPLGRAPSRCRPAPPPPLSQH RPPPPEPDNTPCPPRAAVANARLSCWFEPQRLFNRAPPSSQTPPPRHRLPNGCLFLHTAR GGGASSPNSDALSAFHQAPKPDEQKGKPMRLDASQTKANKNIAREQGRRATGKDSKIS >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_5|717_bp atgcatggagtaagaaagaaaccatcctacaatagcaccaaatccagcatggatggactc atcctccaccctgctactggacttgtctttgtactctcaaagcagtgtgaggagattcat caaccggtggtgtggacatgtgaacagcgtgaggcagagagtttaaaatctatcaagaac gacaactcaatgtctcctgcatttttagcttatgttaatacttatgtatttttagctact ggccccgtgcccccggcccacccggccgcccttaccatgttctcggcgccgactccgcct ccgctcggccgcgcgccctcccgctgccgacccgcgccgccgccgccgctctcgcagcac cgaccgccgccgccggagccggacaataccccgtgcccgcctcgcgctgctgtggccaat gcccgcttgtcttgctggttcgaaccccagcggctgttcaatcgcgcgcctccttctagc cagaccccgcccccccggcaccgccttcctaacggctgtttgtttttgcacacggcacgc ggaggcggggcctccagccccaatagtgacgcgctctctgcctttcaccaggcgcccaag cctgacgaacagaaaggcaaaccaatgagattggacgcctcgcagacgaaagccaataaa aatatcgcccgcgagcaagggaggcgggccactgggaaggacagcaaaattagctaa >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_6|312_aa MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQNAKGRFLK IAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLAAGAEEGGGPRRALK SEFLVRENRKYYLDLKENQRGRFLRIRQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLI EFRDALAKLIDDYGGEDDELAGGPGGGAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNK YGVFLRVSEVKPSYRNAITVPFKAWGKFGGAFCRYADEMKEIQERQRDKLYERRGGGSGG GEESEGEEVDED >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_6|939_bp atggcggacggcgacagcggcagcgagcgcggcggcggcggtgggccgtgcgggttccag cccgcgtcccgcggcggcggcgagcaagagacgcaggagctggcctcgaagcggctggac atccagaacaagcgcttctacttagatgtgaagcagaacgccaagggccgcttcctcaag atcgccgaggtgggcgcgggcggttccaagagccgcctcacgctgtccatggcggtggcc gccgagttccgcgactcgctgggcgacttcatagaacactacgcgcagctgggccctagc agccccgagcagctggcggctggcgccgaggagggcggcgggccgcggcgcgcgctcaag agcgaattcttggtgcgtgagaaccgcaagtactacctggacctcaaggagaaccagcgc ggccgcttcctgcgcatccgccaaacggtcaaccgcggcggtggcggcttcggcgcgggc cccgggccgggcggcttgcagagcggccagaccatcgcgctgcctgcgcagggcctcatc gagttccgcgacgcgctggcgaagctcatagacgactacggaggcgaggacgacgagctg gcaggcggcccgggaggcggcgccgggggcccagggggcggcctgtatggagagctcccg gagggcacctccatcaccgtggactccaagcgcttcttcttcgatgtgggctgcaacaaa tacggggtgtttctgcgagtgagcgaggtgaagccgtcctaccgcaatgccatcaccgta cccttcaaagcctggggcaagttcggaggcgccttttgccggtatgcggatgagatgaaa gaaatccaggaacgacagagggataagctttatgagcgacgtggtgggggcagcggcggc ggcgaagagtcagagggtgaggaggtggatgaggattga >gi568815591f:44696725_44901419|GENSCAN_predicted_peptide_7|51_aa MKKHKDEKRNDTNTVMLTFWGICVKNTIIAAVLKLMSNPPYLLAAPTPNYY >gi568815591f:44696725_44901419|GENSCAN_predicted_CDS_7|156_bp atgaagaaacataaagatgaaaagcgtaatgacacaaacaccgtgatgttaacattttgg ggaatctgtgtgaagaatactattattgcagctgttctgaaactgatgtcaaatccaccc tacctgctggcagctcccacacctaattactactga