GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:29:10 Sequence gi568815586r:51229682_51445878 : 216197 bp : 46.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9227 9440 214 1 1 99 -26 213 0.904 9.12 1.02 Intr + 10662 10780 119 1 2 116 34 39 0.902 1.48 1.03 Intr + 11190 11435 246 2 0 107 76 141 0.990 12.46 1.04 Term + 12649 12777 129 0 0 87 55 134 0.906 8.18 1.05 PlyA + 13090 13095 6 1.05 2.20 PlyA - 13901 13896 6 1.05 2.19 Term - 16438 16260 179 2 2 46 39 287 0.901 17.45 2.18 Intr - 17150 17070 81 0 0 76 90 57 0.656 4.31 2.17 Intr - 23912 23893 20 1 2 110 75 -8 0.004 -3.65 2.16 Intr - 39635 39564 72 1 0 96 92 38 0.039 3.52 2.15 Intr - 40284 40062 223 1 1 91 1 125 0.061 1.29 2.14 Intr - 40498 40378 121 1 1 34 57 59 0.039 -2.53 2.13 Intr - 55106 55035 72 2 0 97 94 45 0.826 5.60 2.12 Intr - 58507 58427 81 1 0 94 64 50 0.812 3.03 2.11 Intr - 62016 61910 107 1 2 47 110 97 0.964 7.73 2.10 Intr - 62585 62182 404 1 2 42 18 213 0.437 3.77 2.09 Intr - 66197 66115 83 2 2 54 105 79 0.936 4.64 2.08 Intr - 67483 67408 76 0 1 68 100 66 0.955 5.32 2.07 Intr - 69607 69522 86 1 2 53 63 105 0.967 3.12 2.06 Intr - 70033 69926 108 1 0 29 93 200 0.945 15.08 2.05 Intr - 72434 72339 96 2 0 69 73 94 0.952 6.21 2.04 Intr - 73099 73005 95 2 2 88 86 153 0.997 14.78 2.03 Intr - 73460 73406 55 2 1 108 86 92 0.998 9.55 2.02 Intr - 84222 84142 81 0 0 76 89 86 0.964 7.33 2.01 Init - 94508 94341 168 2 0 78 100 160 0.680 13.85 2.00 Prom - 97025 96986 40 -5.46 3.08 PlyA - 98556 98551 6 1.05 3.07 Term - 98913 98896 18 0 0 100 47 52 0.716 0.62 3.06 Intr - 100152 100003 150 0 0 89 82 35 0.671 3.36 3.05 Intr - 110324 110218 107 0 2 70 66 172 0.747 13.13 3.04 Intr - 111699 111563 137 2 2 105 89 152 0.999 17.21 3.03 Intr - 113019 112894 126 2 0 73 57 192 0.999 14.39 3.02 Intr - 117056 116942 115 0 1 84 115 41 0.778 5.91 3.01 Init - 122424 122289 136 0 1 45 63 105 0.703 4.18 3.00 Prom - 123866 123827 40 -7.66 4.11 PlyA - 124077 124072 6 -0.45 4.10 Term - 124811 124698 114 2 0 133 48 49 0.967 3.97 4.09 Intr - 126277 126125 153 1 0 91 109 183 0.999 20.97 4.08 Intr - 127769 127668 102 2 0 106 85 151 0.992 16.97 4.07 Intr - 128580 128449 132 0 0 11 97 138 0.921 7.84 4.06 Intr - 129651 129451 201 0 0 87 78 201 0.952 18.48 4.05 Intr - 131157 131040 118 2 1 98 80 156 0.970 16.27 4.04 Intr - 134674 134440 235 2 1 91 80 279 0.889 24.15 4.03 Intr - 135898 135749 150 2 0 72 74 252 0.447 22.33 4.02 Intr - 147686 147514 173 1 2 82 72 292 0.998 26.59 4.01 Init - 150076 149610 467 1 2 66 80 609 0.811 51.02 4.00 Prom - 150829 150790 40 -5.66 5.04 PlyA - 152501 152496 6 1.05 5.03 Term - 152828 152731 98 0 2 110 50 49 0.179 1.43 5.02 Intr - 158666 158508 159 0 0 68 51 89 0.106 3.16 5.01 Init - 161749 161674 76 1 1 57 66 112 0.263 5.15 5.00 Prom - 171981 171942 40 -0.86 6.00 Prom + 175823 175862 40 -4.86 6.01 Init + 195307 195354 48 0 0 99 91 59 0.323 8.27 6.02 Intr + 211027 211108 82 0 1 79 116 102 0.729 11.31 6.03 Intr + 214362 214445 84 1 0 69 74 44 0.159 0.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:51229682_51445878|GENSCAN_predicted_peptide_1|235_aa MNSKGKDRGWQRPSGGVLLAQSERIRSPGSPNARFGVGCAMLLGRLQSRALRLTPSSYPN YGSLLPPGPFLRQYPTQPTYPVQPPGNPVYPQTLHLPQAPPYTDAPPAYSELYRPSFVHP GAATVPTMSAAFPGASLYLPMAQSVAVGPLGSTIPMAYYPVGPIYPPGSTVLVEGGYDAG ARFGAGATAGNIPPPPPGCPPNAAQLAVMQGANVLVTQRKGNFFMGGSDGGYTIW >gi568815586r:51229682_51445878|GENSCAN_predicted_CDS_1|708_bp atgaacagcaaaggcaaggaccgagggtggcagaggccgtcggggggagtactgctggcc cagagcgagcggattcggagcccagggtcaccaaacgccaggtttggggtgggctgcgcc atgctccttggccggctgcagtccagggcgctgcgcctgacgccttcgtcatacccaaat tacggcagcttgctgcctccaggccctttcctccgtcaatatccaacacagccaacctac cctgtgcagcctcctgggaatccagtataccctcagaccttgcatcttcctcaggctcca ccctataccgatgctccacctgcctactcagagctctatcgtccgagctttgtgcaccca ggggctgccacagtccccaccatgtcagccgcatttcctggagcctctctgtatcttccc atggcccagtctgtggctgttgggcctttaggttccacaatccccatggcttattatcca gtcggtcccatctatccacctggctccacagtgctggtggaaggagggtatgatgcaggt gccagatttggagctggggctactgctggcaacattcctcctccacctcctggatgccct cccaatgctgctcagcttgcagtcatgcagggagccaacgtcctcgtaactcagcggaag gggaacttcttcatgggtggttcagatggtggctacaccatctggtga >gi568815586r:51229682_51445878|GENSCAN_predicted_peptide_2|735_aa MPGARTSSSGASENHRARGQGGGPQGVGRMAEGKAGGAAGLFAKQVQKKFSRAQEKVLQK LGKAVETKDERFEQSASNFYQQQAEGHKLYKDLKNFLSAVKVMHESSKRVSETLQEIYSS EWDGHEELKAIVWNNDLLWEDYEEKLADQAVRTMEIYVAQFSEIKERIAKRGRKLVDYDS ARHHLEAVQNAKKKDEAKTAKAEEEFNKAQTVFEDLNQELLEELPILYNSRIGCYVTIFQ NISNLRDVFYREMSKLNHNLYEVMSKLEKQHSNKVFVVKGLSSPSTLSLKSESESVSATE DLAPDAAQGEDNSEIKELLEEEEIEKEGSEASSSEEDEPLPACNGPAQAQPSPTTERAKS QEEVLPSSTTPSPGGALSPSGQPSSSATEVVLRTRTASEGSEQPKKRASIQRTSAPPKPP EKPVRTPEAKENENIHNQNPEELCTSPTLMTSQVASEPGEAKKMEDKEKDNKLISANSSE GQDQLQVSMVPENNNLTAPEPQEEEPKPGAAEAQRSAGAEGRGGWERRSPTHRPPPAQRA PRGAGASRISGARGAGWGPVRLGRLCADRALDGESRPGGGGQSGVPASEASQKAAHGRRL PVPARLLRCAHSALEPGRGSASSPRNSTTSDNDQPPDYSFSKRISFLFKEELMTTPILQP TEALSPEDGASTALIAVVITVVFLTLLSVVILIFFYLYKNKGSYVTYEPTEGEPSAIVQM ESDLAKGSEKEEYFI >gi568815586r:51229682_51445878|GENSCAN_predicted_CDS_2|2208_bp atgccgggagcccgcacttcctcctcgggggcctcagaaaaccacagggcgcggggccag ggcggcggcccccagggagttggcaggatggcagagggcaaggcaggcggcgcggccggc ctcttcgccaagcaggtgcagaagaagtttagcagggcccaggagaaggtgctgcagaaa ttggggaaagctgtagaaaccaaagatgaacgatttgaacaaagcgctagcaacttctac caacaacaggcagaaggccacaagctgtacaaggacctgaagaacttccttagtgcagtc aaagtgatgcatgaaagttcaaaaagagtgtcagaaaccctgcaggagatctacagcagc gagtgggacggtcatgaggagctgaaggccatcgtatggaataatgatctcctttgggaa gactacgaggagaaactggctgaccaggctgtaaggaccatggaaatctatgttgcccag ttcagtgaaattaaggagagaattgccaagcggggtcggaaactcgtggactatgacagt gcccgacaccacctggaggcagtgcagaatgccaagaagaaagatgaggccaagactgcc aaggcagaggaagagttcaacaaagcccagactgtgtttgaagatctgaaccaagaacta ctagaggagctgcctattctttataatagtcgtattggctgctatgtgaccatcttccaa aacatttccaacttgagggatgtcttctacagggaaatgagcaagctgaaccacaatctc tacgaggtgatgagcaaactggagaagcaacattccaataaagtctttgtggtgaaggga ctgtcaagtccctctacactttccttgaagagtgagagtgaatctgtctcagcaactgaa gatctggcacctgatgcagcccaaggggaagacaattctgagatcaaggagctcttagaa gaggaggaaatagagaaggaaggatctgaagcaagctcctctgaggaagatgagcctcta ccagcctgcaatggccccgcccaggcccagccctctcctaccactgaaagggccaagtcc caggaggaagttctccccagctccacaactccatcaccaggcggagccctgagcccttca gggcagccttcatcatctgccacagaagtagtcctccgaacccgcaccgcaagtgaagga tctgaacaaccaaagaagagagcctctatccagaggacctcagcaccccctaaaccacca gagaagccagtaagaactcctgaggccaaagaaaatgaaaacatccacaatcagaaccct gaagaactttgtacttcccccaccttaatgacatctcaggttgcttcagagcctggagag gcaaagaagatggaagacaaggaaaaggataataagcttatctcagctaactcctcggag ggccaagaccagcttcaagtctccatggtaccagaaaacaacaacctcacagcacctgaa cctcaagaagaggaaccgaaacccggagcggccgaagctcagcgctccgctggggcagag ggtcgcggcggctgggaacgccgctccccgacgcaccggccgcccccagcgcagcgcgct ccgcggggtgctggggcgtcgaggatctccggggcgcggggcgcgggctggggcccagtg aggcttggcaggctgtgcgcggaccgcgccctggacggcgaaagcaggcccggagggggc ggccagtccggcgtcccagcgtccgaggcgagccagaaggcggcccacggccgtcgcctc ccggtcccggcccggctactgcgctgcgcccactccgctctggagcctgggcgcggatct gcctcttctccaagaaactcaaccactagtgacaatgaccagcctcctgactactccttc tccaagagaatttccttcctctttaaggaagaactgatgaccaccccaattttacagccc actgaggccctgtccccagaagatggagccagcacagcactcattgcagttgttatcacc gttgtcttcctcaccctgctctcggtcgtgatcttgatcttcttttacctgtacaagaac aaaggcagctacgtcacctatgaacctacagaaggtgagcccagtgccatcgtccagatg gagagtgacttggccaagggcagcgagaaagaggaatatttcatctaa >gi568815586r:51229682_51445878|GENSCAN_predicted_peptide_3|262_aa MRVLWDHSCLCSHCNSTLREAAYIPESLLEPTEETLASQKERSGQALPVGKKTVSFAAKR SIKRAWSKQEGSGLLHRQHAGPLCQKTFRVVAGDHNLSQNDGTEQYVSVQKIVVHPYWNS DNVAAGYDIALLRLAQSVTLNSYVQLGVLPQEGAILANNSPCYITGWGKTKTNGQLAQTL QQAYLPSVDYAICSSSSYWGSTVKNTMGDSGGPLHCLVNGKYSVHGVTSFVSSRGCNVSR KPTVFTQVSAYISWINNVIASN >gi568815586r:51229682_51445878|GENSCAN_predicted_CDS_3|789_bp atgagggtgctatgggaccacagctgtctttgttcccattgcaactcaaccctgcgggag gccgcctacatccctgagagccttctggagcctacagaggagacattggccagccaaaag gaaaggagtggccaggccctgccagttggcaagaagacagtcagctttgctgctaagagg agtataaagagggcttggtccaagcaagaaggcagtggtctactccatcggcaacatgct ggtcctttatgccagaagactttccgcgtggtggctggagaccataacctgagccagaat gatggcactgagcagtacgtgagtgtgcagaagatcgtggtgcatccatactggaacagc gataacgtggctgccggctatgacatcgccctgctgcgcctggcccagagcgttaccctc aatagctatgtccagctgggtgttctgccccaggagggagccatcctggctaacaacagt ccctgctacatcacaggctggggcaagaccaagaccaatgggcagctggcccagaccctg cagcaggcttacctgccctctgtggactacgccatctgctccagctcctcctactggggc tccactgtgaagaacaccatgggtgactctgggggccccctccattgcttggtgaatggc aagtattctgtccatggagtgaccagctttgtgtccagccggggctgtaatgtctccagg aagcctacagtcttcacccaggtctctgcttacatctcctggataaataatgtcatcgcc tccaactga >gi568815586r:51229682_51445878|GENSCAN_predicted_peptide_4|614_aa MPLRLAMVGCAFVLFLFLLHRDVSSREEATEKPWLKSLVSRKDHVLDLMLEAMNNLRDSM PKLQIRAPEAQQTLFSINQSCLPGFYTPAELKPFWERPPQDPNAPGADGKAFQKSKWTPL ETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNE AWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQVVRVVRQEERKG LITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEF AKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEH IGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTSVIARNQ VRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEM FVPDLTPTFYGAIKNLGTNQCLDVGENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNI AKQLCLHVSKGALGLGSCHFTGKNSQVPKDEEWELAQDQLIRNSGSGTCLTSQDKKPAMA PCNPSDPHQLWLFV >gi568815586r:51229682_51445878|GENSCAN_predicted_CDS_4|1845_bp atgcccctgcgcctggccatggtgggctgcgcctttgtgctcttcctcttcctcctgcat agggatgtgagcagcagagaggaggccacagagaagccgtggctgaagtccctggtgagc cggaaggatcacgtcctggacctcatgctggaggccatgaacaaccttagagattcaatg cccaagctccaaatcagggctccagaagcccagcagactctgttctccataaaccagtcc tgcctccctgggttctataccccagctgaactgaagcccttctgggaacggccaccacag gaccccaatgcccctggggcagatggaaaagcatttcagaagagcaagtggacccccctg gagacccaggaaaaggaagaaggctataagaagcactgtttcaatgcctttgccagcgac cggatctccctgcagaggtccctggggccagacacccgaccacctgagtgtgtggaccag aagttccggcgctgccccccactggccaccaccagcgtgatcattgtgttccacaacgaa gcctggtccacactgctgcgaacagtgtacagcgtcctacacaccacccctgccatcttg ctcaaggagatcatactggtggatgatgccagcacagaggagcacctaaaggagaagctg gagcagtacgtgaagcagctgcaggtggtgagggtggtgcggcaggaggagcggaagggg ctgatcaccgcccggctgctgggggccagcgtggcacaggcggaggtgctcacgttcctg gatgcccactgtgagtgcttccacggctggctggagcccctcctggctcgaatcgctgag gacaagacagtggtggtgagcccagacatcgtcaccatcgaccttaatacttttgagttc gccaagcccgtccagaggggcagagtccatagccgaggcaactttgactggagcctgacc ttcggctgggaaacacttcctccacatgagaagcagaggcgcaaggatgaaacctacccc atcaaatccccgacgtttgctggtggcctcttctccatctccaagtcctactttgagcac atcggtacctatgataatcagatggagatctggggaggggagaacgtggaaatgtccttc cgggtgtggcagtgtgggggccagctggagatcatcccctgctctgtcgtaggccatgtg ttccggaccaagagcccccacaccttccccaagggcactagtgtcattgctcgcaatcaa gtgcgcctggcagaggtctggatggacagctacaagaagattttctataggagaaatctg caggcagcaaagatggcccaagagaaatccttcggtgacatttcggaacgactgcagctg agggaacaactgcactgtcacaacttttcctggtacctgcacaatgtctacccagagatg tttgttcctgacctgacgcccaccttctatggtgccatcaagaacctcggcaccaaccaa tgcctggatgtgggtgagaacaaccgcggggggaagcccctcatcatgtactcctgccac ggccttggcggcaaccagtactttgagtacacaactcagagggaccttcgccacaacatc gcaaagcagctgtgtctacatgtcagcaagggtgctctgggccttgggagctgtcacttc actggcaagaatagccaggtccccaaggacgaggaatgggaattggcccaggatcagctc atcaggaactcaggatctggtacctgcctgacatcccaggacaaaaagccagccatggcc ccctgcaatcccagtgacccccatcagttgtggctctttgtctag >gi568815586r:51229682_51445878|GENSCAN_predicted_peptide_5|110_aa MKGLGQAGPSCLRSLLVAVACSSSLVKQCSRNMNTAVLIVKEREGPEAKEAGRQLLLILQ MLVDKGPISNEGFESRKDNHCRYCAFCLKCAHLASLTPTRQLVFGPFLPP >gi568815586r:51229682_51445878|GENSCAN_predicted_CDS_5|333_bp atgaaggggctgggccaggctggtccctcctgccttcgctctctgctggtcgcggtggcc tgtagctcctccctcgtcaaacagtgctcaaggaatatgaatacggctgtcctgattgtg aaagagagagaggggcccgaggcaaaggaggctggcaggcagctcctgctgatcctccag atgctagttgataaaggcccaatttcaaatgaaggttttgaaagcagaaaggacaaccat tgcaggtactgtgccttctgcctgaaatgtgctcacctcgcctcgctgactcctactcgc cagttagtgttcggcccatttctgcccccgtga >gi568815586r:51229682_51445878|GENSCAN_predicted_peptide_6|72_aa MPAAGSNEPDGVLSYQRPDEEAVVDQGGTSTILNIHYEKEELEDSSKRLGLTFFGKKNDE LDGKKMGFVRVX >gi568815586r:51229682_51445878|GENSCAN_predicted_CDS_6|216_bp atgccggccgccgggagtaacgagccggacggcgtcctcagctatcagagaccagatgaa gaagctgtggtggatcagggtgggaccagtacaattctcaacattcactatgaaaaagaa gagctggaagatagttccaaaagactaggtttgaccttttttggaaaaaagaatgacgag ttagatgggaaaaaaatgggatttgtgcgagttgnn