GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:28:58 Sequence gi568815586f:51140341_51342455 : 202115 bp : 46.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 978 973 6 1.05 1.02 Term - 30942 30897 46 2 1 75 50 43 0.821 -4.12 1.01 Init - 32082 31961 122 0 2 101 105 155 0.995 17.61 1.00 Prom - 36584 36545 40 -6.86 2.12 PlyA - 37661 37656 6 1.05 2.11 Term - 50252 49907 346 1 1 85 49 632 0.999 52.97 2.10 Intr - 51424 51256 169 2 1 77 117 224 0.999 23.20 2.09 Intr - 52131 51990 142 0 1 98 100 122 0.971 14.33 2.08 Intr - 55833 55630 204 0 0 116 77 136 0.988 14.70 2.07 Intr - 56587 56432 156 1 0 84 35 77 0.786 2.21 2.06 Intr - 57683 57430 254 0 2 84 71 257 0.864 20.75 2.05 Intr - 58435 58210 226 1 1 121 92 29 0.652 4.16 2.04 Intr - 59568 59407 162 0 0 58 110 84 0.618 7.77 2.03 Intr - 62732 62710 23 0 2 66 110 31 0.576 0.36 2.02 Intr - 64028 63833 196 2 1 125 72 236 0.450 24.69 2.01 Init - 66496 66449 48 1 0 96 70 31 0.503 3.05 2.00 Prom - 83564 83525 40 -3.86 3.00 Prom + 87803 87842 40 -4.86 3.01 Init + 98568 98781 214 2 1 99 -26 213 0.904 9.12 3.02 Intr + 100003 100121 119 2 2 116 34 39 0.902 1.48 3.03 Intr + 100531 100776 246 0 0 107 76 141 0.989 12.46 3.04 Term + 101990 102118 129 1 0 87 55 134 0.906 8.18 3.05 PlyA + 102431 102436 6 1.05 4.20 PlyA - 103242 103237 6 1.05 4.19 Term - 105779 105601 179 0 2 46 39 287 0.901 17.45 4.18 Intr - 106491 106411 81 1 0 76 90 57 0.656 4.31 4.17 Intr - 113253 113234 20 2 2 110 75 -8 0.004 -3.65 4.16 Intr - 128976 128905 72 2 0 96 92 38 0.039 3.52 4.15 Intr - 129625 129403 223 2 1 91 1 125 0.061 1.29 4.14 Intr - 129839 129719 121 2 1 34 57 59 0.039 -2.53 4.13 Intr - 144447 144376 72 0 0 97 94 45 0.826 5.60 4.12 Intr - 147848 147768 81 2 0 94 64 50 0.812 3.03 4.11 Intr - 151357 151251 107 2 2 47 110 97 0.964 7.73 4.10 Intr - 151926 151523 404 2 2 42 18 213 0.437 3.77 4.09 Intr - 155538 155456 83 0 2 54 105 79 0.936 4.64 4.08 Intr - 156824 156749 76 1 1 68 100 66 0.955 5.32 4.07 Intr - 158948 158863 86 2 2 53 63 105 0.967 3.12 4.06 Intr - 159374 159267 108 2 0 29 93 200 0.945 15.08 4.05 Intr - 161775 161680 96 0 0 69 73 94 0.952 6.21 4.04 Intr - 162440 162346 95 0 2 88 86 153 0.997 14.78 4.03 Intr - 162801 162747 55 0 1 108 86 92 0.998 9.55 4.02 Intr - 173563 173483 81 1 0 76 89 86 0.964 7.33 4.01 Init - 183849 183682 168 0 0 78 100 160 0.680 13.85 4.00 Prom - 186366 186327 40 -5.46 5.05 PlyA - 187897 187892 6 1.05 5.04 Term - 188254 188237 18 1 0 100 47 52 0.716 0.62 5.03 Intr - 189493 189344 150 1 0 89 82 35 0.671 3.36 5.02 Intr - 199665 199559 107 1 2 70 66 172 0.747 13.13 5.01 Intr - 201040 200904 137 0 2 105 89 152 0.999 17.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:51140341_51342455|GENSCAN_predicted_peptide_1|55_aa MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMRCVRILYGNVETAAS >gi568815586f:51140341_51342455|GENSCAN_predicted_CDS_1|168_bp atggcctgggctctgaagctgcctctggccgacgaagtgattgaatccgggttggtgcag gactttgatgctagcctgtccgggatcggccaggaactgggtgctggtgcctatagcatg agatgtgtaagaatcctgtatggaaatgttgaaacagctgcatcttag >gi568815586f:51140341_51342455|GENSCAN_predicted_peptide_2|641_aa MDPGAGSETSLTVNEQVIVMSGHETIRVLEVGVDAQLPAEEESKGLEGVAAEGSQSGDPA EASQAAGEAGPDNLGSSAEATGILVTIKQAHSLSFFLSMPCEVKSPPGIPPSPATAIATF SQAPSQPQASQTLTPLAVQAAPQVLTQENLATVLTGVMVPAGAVTQPLLIPISIAGQVAG QQGLAVWTIPTATVAALPGLTAASPTGGVFKPPLAGLQAAAVLNTALPAPVQAAAPVQAS STAQPRPPAQPQTLFQTQPLLQTTPAILPQPTAATAAAPTPKPVDTPPQITVQPAGFAFS PGIISAASLGGQTQILGSLTTAPVITSAIPSMPGISSQILTNAQGQASGQRQGWRVIGTL PWVVNSASVAAPAPAQSLQVQAVTPQLLLNAQGQVIATLASSPLPPPVAVRKPSTPESPA KSEVQPIQPTPTVPQPAVVIASPAPAAKPSASAPIPITCSETPTVSQLVSKPHTPSLDED GINLEEIREFAKNFKIRRLSLGLTQTQVGQALTATEGPAYSQSAICRFEKLDITPKSAQK LKPVLEKWLNEAELRNQEGQQNLMEFVGGEPSKKRKRRTSFTPQAIEALNAYFEKNPLPT GQEITEIAKELNYDREVVRVWFCNRRQTLKNTSKLNVFQIP >gi568815586f:51140341_51342455|GENSCAN_predicted_CDS_2|1926_bp atggatcctggagccgggtcagagacatctctgactgtcaatgagcaggtcatcgtgatg tcaggtcatgagaccatccgagtgctggaagtcggagtggatgcccaactccctgctgag gaagagagcaaaggactggagggtgtggccgccgagggctcccagagcggagaccctgct gaagccagtcaagctgctggtgaagctgggccagacaacctgggctcctctgcagaggca actggaattttagtcaccatcaagcaggctcactccttgtccttctttctctcaatgcct tgtgaagtgaagtcacccccggggatccctccgagccctgccactgccattgccaccttc agccaagccccaagccagcctcaggcatcgcagaccctgacgccactggctgtacaagct gccccccaggtcttgactcaggaaaacttagccacagttctgacaggagttatggttcca gcaggggcagttactcaacctcttcttatccccatcagtattgcaggtcaagtggctggt cagcaggggctggccgtgtggacaattcctacagcaactgtggctgccctcccaggactg accgctgcttctcctacggggggagtgttcaagccacctttagccggtctccaagcagct gctgtgctgaacaccgctcttccggcaccggtacaagctgccgcaccagtacaggcctcc tcgacggcccaaccccggccaccagcccagccccagacgctgttccagacccagccgctg ctgcagaccacacctgccatcctcccgcagcccactgctgccaccgctgctgcccctacc cccaagccagtggacacccccccacagatcaccgtccagcctgcaggcttcgcatttagc ccaggaatcatcagtgctgcttccctcgggggacagacccagatcctggggtccctcact acagctccagtcattaccagcgccattcccagcatgccagggatcagcagtcagatcctc accaatgctcagggacaggcaagtggccaaagacagggctggagggttattggaaccctt ccatgggtagtgaactcagctagtgtggcggccccagcaccagcccaaagcctgcaggtc caggccgtgaccccccagctgttgttgaacgcccagggccaggtgattgcgaccctggct agcagccccctgcctccacctgtggctgtccggaagccaagcacacctgagtcccctgct aagagtgaggtgcagcccatccagcccacaccaaccgtgccccagcctgctgtggtcatt gccagcccagctccagccgccaagccatctgcctctgctcctatcccaattacctgctca gagacccccaccgtcagccagttggtgtccaagccacatactccaagtctggatgaggat gggatcaacttagaagagatccgggagtttgccaagaactttaagatccggcggctctcg ctgggccttacacagacccaggtgggtcaggctctgactgcaacggaaggtccagcctac agccagtcagccatctgccggttcgagaagctagacatcacacccaagagtgcccagaag ctaaagccggtgctggaaaagtggctaaacgaagctgaactgcggaaccaggaaggccag cagaacctgatggagtttgtgggaggcgagccctccaagaaacgcaaacgccgcacctcc ttcaccccccaggccatagaggctctcaatgcctattttgagaagaacccactgcccaca ggccaggagatcactgaaattgctaaggagctcaactacgaccgtgaggtagtgcgggtc tggttctgcaatcggcgccagacgctcaagaacaccagcaagctgaacgtctttcagatc ccttag >gi568815586f:51140341_51342455|GENSCAN_predicted_peptide_3|235_aa MNSKGKDRGWQRPSGGVLLAQSERIRSPGSPNARFGVGCAMLLGRLQSRALRLTPSSYPN YGSLLPPGPFLRQYPTQPTYPVQPPGNPVYPQTLHLPQAPPYTDAPPAYSELYRPSFVHP GAATVPTMSAAFPGASLYLPMAQSVAVGPLGSTIPMAYYPVGPIYPPGSTVLVEGGYDAG ARFGAGATAGNIPPPPPGCPPNAAQLAVMQGANVLVTQRKGNFFMGGSDGGYTIW >gi568815586f:51140341_51342455|GENSCAN_predicted_CDS_3|708_bp atgaacagcaaaggcaaggaccgagggtggcagaggccgtcggggggagtactgctggcc cagagcgagcggattcggagcccagggtcaccaaacgccaggtttggggtgggctgcgcc atgctccttggccggctgcagtccagggcgctgcgcctgacgccttcgtcatacccaaat tacggcagcttgctgcctccaggccctttcctccgtcaatatccaacacagccaacctac cctgtgcagcctcctgggaatccagtataccctcagaccttgcatcttcctcaggctcca ccctataccgatgctccacctgcctactcagagctctatcgtccgagctttgtgcaccca ggggctgccacagtccccaccatgtcagccgcatttcctggagcctctctgtatcttccc atggcccagtctgtggctgttgggcctttaggttccacaatccccatggcttattatcca gtcggtcccatctatccacctggctccacagtgctggtggaaggagggtatgatgcaggt gccagatttggagctggggctactgctggcaacattcctcctccacctcctggatgccct cccaatgctgctcagcttgcagtcatgcagggagccaacgtcctcgtaactcagcggaag gggaacttcttcatgggtggttcagatggtggctacaccatctggtga >gi568815586f:51140341_51342455|GENSCAN_predicted_peptide_4|735_aa MPGARTSSSGASENHRARGQGGGPQGVGRMAEGKAGGAAGLFAKQVQKKFSRAQEKVLQK LGKAVETKDERFEQSASNFYQQQAEGHKLYKDLKNFLSAVKVMHESSKRVSETLQEIYSS EWDGHEELKAIVWNNDLLWEDYEEKLADQAVRTMEIYVAQFSEIKERIAKRGRKLVDYDS ARHHLEAVQNAKKKDEAKTAKAEEEFNKAQTVFEDLNQELLEELPILYNSRIGCYVTIFQ NISNLRDVFYREMSKLNHNLYEVMSKLEKQHSNKVFVVKGLSSPSTLSLKSESESVSATE DLAPDAAQGEDNSEIKELLEEEEIEKEGSEASSSEEDEPLPACNGPAQAQPSPTTERAKS QEEVLPSSTTPSPGGALSPSGQPSSSATEVVLRTRTASEGSEQPKKRASIQRTSAPPKPP EKPVRTPEAKENENIHNQNPEELCTSPTLMTSQVASEPGEAKKMEDKEKDNKLISANSSE GQDQLQVSMVPENNNLTAPEPQEEEPKPGAAEAQRSAGAEGRGGWERRSPTHRPPPAQRA PRGAGASRISGARGAGWGPVRLGRLCADRALDGESRPGGGGQSGVPASEASQKAAHGRRL PVPARLLRCAHSALEPGRGSASSPRNSTTSDNDQPPDYSFSKRISFLFKEELMTTPILQP TEALSPEDGASTALIAVVITVVFLTLLSVVILIFFYLYKNKGSYVTYEPTEGEPSAIVQM ESDLAKGSEKEEYFI >gi568815586f:51140341_51342455|GENSCAN_predicted_CDS_4|2208_bp atgccgggagcccgcacttcctcctcgggggcctcagaaaaccacagggcgcggggccag ggcggcggcccccagggagttggcaggatggcagagggcaaggcaggcggcgcggccggc ctcttcgccaagcaggtgcagaagaagtttagcagggcccaggagaaggtgctgcagaaa ttggggaaagctgtagaaaccaaagatgaacgatttgaacaaagcgctagcaacttctac caacaacaggcagaaggccacaagctgtacaaggacctgaagaacttccttagtgcagtc aaagtgatgcatgaaagttcaaaaagagtgtcagaaaccctgcaggagatctacagcagc gagtgggacggtcatgaggagctgaaggccatcgtatggaataatgatctcctttgggaa gactacgaggagaaactggctgaccaggctgtaaggaccatggaaatctatgttgcccag ttcagtgaaattaaggagagaattgccaagcggggtcggaaactcgtggactatgacagt gcccgacaccacctggaggcagtgcagaatgccaagaagaaagatgaggccaagactgcc aaggcagaggaagagttcaacaaagcccagactgtgtttgaagatctgaaccaagaacta ctagaggagctgcctattctttataatagtcgtattggctgctatgtgaccatcttccaa aacatttccaacttgagggatgtcttctacagggaaatgagcaagctgaaccacaatctc tacgaggtgatgagcaaactggagaagcaacattccaataaagtctttgtggtgaaggga ctgtcaagtccctctacactttccttgaagagtgagagtgaatctgtctcagcaactgaa gatctggcacctgatgcagcccaaggggaagacaattctgagatcaaggagctcttagaa gaggaggaaatagagaaggaaggatctgaagcaagctcctctgaggaagatgagcctcta ccagcctgcaatggccccgcccaggcccagccctctcctaccactgaaagggccaagtcc caggaggaagttctccccagctccacaactccatcaccaggcggagccctgagcccttca gggcagccttcatcatctgccacagaagtagtcctccgaacccgcaccgcaagtgaagga tctgaacaaccaaagaagagagcctctatccagaggacctcagcaccccctaaaccacca gagaagccagtaagaactcctgaggccaaagaaaatgaaaacatccacaatcagaaccct gaagaactttgtacttcccccaccttaatgacatctcaggttgcttcagagcctggagag gcaaagaagatggaagacaaggaaaaggataataagcttatctcagctaactcctcggag ggccaagaccagcttcaagtctccatggtaccagaaaacaacaacctcacagcacctgaa cctcaagaagaggaaccgaaacccggagcggccgaagctcagcgctccgctggggcagag ggtcgcggcggctgggaacgccgctccccgacgcaccggccgcccccagcgcagcgcgct ccgcggggtgctggggcgtcgaggatctccggggcgcggggcgcgggctggggcccagtg aggcttggcaggctgtgcgcggaccgcgccctggacggcgaaagcaggcccggagggggc ggccagtccggcgtcccagcgtccgaggcgagccagaaggcggcccacggccgtcgcctc ccggtcccggcccggctactgcgctgcgcccactccgctctggagcctgggcgcggatct gcctcttctccaagaaactcaaccactagtgacaatgaccagcctcctgactactccttc tccaagagaatttccttcctctttaaggaagaactgatgaccaccccaattttacagccc actgaggccctgtccccagaagatggagccagcacagcactcattgcagttgttatcacc gttgtcttcctcaccctgctctcggtcgtgatcttgatcttcttttacctgtacaagaac aaaggcagctacgtcacctatgaacctacagaaggtgagcccagtgccatcgtccagatg gagagtgacttggccaagggcagcgagaaagaggaatatttcatctaa >gi568815586f:51140341_51342455|GENSCAN_predicted_peptide_5|137_aa XYDIALLRLAQSVTLNSYVQLGVLPQEGAILANNSPCYITGWGKTKTNGQLAQTLQQAYL PSVDYAICSSSSYWGSTVKNTMGDSGGPLHCLVNGKYSVHGVTSFVSSRGCNVSRKPTVF TQVSAYISWINNVIASN >gi568815586f:51140341_51342455|GENSCAN_predicted_CDS_5|414_bp nnctatgacatcgccctgctgcgcctggcccagagcgttaccctcaatagctatgtccag ctgggtgttctgccccaggagggagccatcctggctaacaacagtccctgctacatcaca ggctggggcaagaccaagaccaatgggcagctggcccagaccctgcagcaggcttacctg ccctctgtggactacgccatctgctccagctcctcctactggggctccactgtgaagaac accatgggtgactctgggggccccctccattgcttggtgaatggcaagtattctgtccat ggagtgaccagctttgtgtccagccggggctgtaatgtctccaggaagcctacagtcttc acccaggtctctgcttacatctcctggataaataatgtcatcgcctccaactga