GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:17:58 Sequence gi568815588f:93658145_93897800 : 239656 bp : 40.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 757 864 108 1 0 84 53 119 0.539 6.48 1.02 Intr + 960 1023 64 0 1 87 109 40 0.917 3.80 1.03 Intr + 3915 3989 75 2 0 102 50 65 0.877 2.89 1.04 Intr + 4416 4499 84 2 0 77 115 20 0.864 2.70 1.05 Term + 4884 5039 156 2 0 55 37 156 0.283 4.15 1.06 PlyA + 5906 5911 6 1.05 2.11 PlyA - 6480 6475 6 1.05 2.10 Term - 11724 11682 43 2 1 86 53 36 0.147 -4.45 2.09 Intr - 12704 12626 79 0 1 50 98 86 0.195 3.49 2.08 Intr - 18547 18509 39 2 0 91 83 37 0.181 0.78 2.07 Intr - 27215 27102 114 0 0 107 97 16 0.418 3.90 2.06 Intr - 31852 31692 161 0 2 95 47 44 0.411 -0.29 2.05 Intr - 34585 34502 84 0 0 59 19 139 0.420 2.12 2.04 Intr - 40252 40157 96 0 0 59 52 116 0.878 3.31 2.03 Intr - 41962 41886 77 1 2 99 115 122 0.925 13.39 2.02 Intr - 44433 44231 203 1 2 -41 61 306 0.844 13.08 2.01 Init - 44652 44505 148 0 1 39 63 143 0.294 7.20 2.00 Prom - 46582 46543 40 -5.85 3.00 Prom + 51409 51448 40 -2.15 3.01 Init + 53891 53993 103 1 1 53 115 44 0.637 4.05 3.02 Intr + 62661 62841 181 1 1 19 40 138 0.108 0.10 3.03 Intr + 69955 70068 114 1 0 129 83 18 0.135 4.04 3.04 Intr + 77005 77170 166 1 1 39 92 56 0.070 0.14 3.05 Term + 85454 85711 258 1 0 38 54 184 0.370 4.57 3.06 PlyA + 85946 85951 6 1.05 4.07 PlyA - 86223 86218 6 1.05 4.06 Term - 89074 88922 153 1 0 16 55 159 0.773 2.34 4.05 Intr - 91764 91663 102 0 0 76 78 130 0.881 10.25 4.04 Intr - 99157 99054 104 2 2 97 40 79 0.048 2.97 4.03 Intr - 99868 99804 65 0 2 93 48 51 0.184 -0.96 4.02 Intr - 105032 104951 82 0 1 113 51 69 0.234 3.48 4.01 Init - 108085 107998 88 1 1 68 95 19 0.502 1.45 4.00 Prom - 109749 109710 40 -6.85 5.00 Prom + 118140 118179 40 -4.25 5.01 Init + 124660 124794 135 0 0 68 34 86 0.239 1.39 5.02 Intr + 127825 127997 173 0 2 22 72 143 0.356 3.92 5.03 Intr + 134599 134768 170 1 2 39 95 111 0.756 5.57 5.04 Intr + 135042 135206 165 1 0 82 80 80 0.772 5.61 5.05 Term + 138824 139659 836 0 2 110 53 406 0.988 31.46 5.06 PlyA + 139989 139994 6 1.05 6.06 PlyA - 141564 141559 6 1.05 6.05 Term - 144071 143870 202 1 1 43 42 142 0.424 0.98 6.04 Intr - 147800 147633 168 1 0 84 71 111 0.843 7.14 6.03 Intr - 156807 156753 55 1 1 96 76 41 0.059 0.82 6.02 Intr - 157902 157738 165 1 0 24 76 102 0.013 1.61 6.01 Init - 175455 175152 304 0 1 57 19 242 0.249 11.68 6.00 Prom - 180462 180423 40 -6.95 7.05 PlyA - 181632 181627 6 1.05 7.04 Term - 186990 186774 217 2 1 36 47 161 0.441 2.43 7.03 Intr - 187508 187292 217 0 1 47 53 190 0.557 8.04 7.02 Intr - 188133 188086 48 1 0 65 117 50 0.891 3.33 7.01 Init - 190833 190716 118 0 1 60 77 112 0.972 7.71 7.00 Prom - 200822 200783 40 -4.05 8.00 Prom + 212566 212605 40 -3.45 8.01 Init + 212682 212773 92 2 2 92 100 110 0.972 12.61 8.02 Term + 217161 217260 100 0 1 52 38 85 0.512 -3.48 8.03 PlyA + 219921 219926 6 1.05 9.00 Prom + 229054 229093 40 -6.45 9.01 Init + 235954 236067 114 0 0 74 100 146 0.906 12.66 9.02 Term + 236379 236468 90 2 0 89 49 76 0.880 0.54 9.03 PlyA + 236510 236515 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 200274 200420 147 2 0 82 33 142 0.989 5.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_1|162_aa XKRTMFQKIVDACEQMQTEEEAIKYVTVDPTKKEIIMAMMMTACDLSAITKPWEVQSQVA LMVANEFWEQGDLERTVLQQQPIPMMDRNKRDELPKLQVGFIDFVCTFVYKEFSRFHKEI TPMLSGLQNNRVEWKSLADEYDAKMKVIEEEAKKQEGGAEKG >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_1|489_bp nngaagaggaccatgtttcaaaaaattgttgatgcctgtgaacaaatgcaaacggaagaa gaagccatcaaatatgtaactgttgatccaaccaagaaagagattatcatggcaatgatg atgacggcatgtgacttgtctgctattaccaagccctgggaggtgcaaagtcaggtagca cttatggttgcaaatgaattttgggaacaaggagatctggagagaacagtgttgcagcaa caacccattcctatgatggacagaaacaaaagagatgaattacctaaacttcaagttgga tttattgattttgtttgtacttttgtatataaggagttctcacggtttcacaaagaaatc acacctatgctgagtggtcttcagaataacagagtagaatggaaatcactagctgatgag tatgatgcaaagatgaaggtcattgaagaggaggcaaaaaagcaagaaggaggagccgaa aaaggttag >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_2|347_aa MPDSLEGKGRRPREKLGAATWRKLEASSSLSGPFLPLPLFPGAVLAVAQVGTRRERRVAG GGGGGGGGGGCGEAVRQGARDLCCSGSLRPWVVAAAAGKEGNDDSGGELGTQGGRERMHG HGGYDSDFSDDERCGESSKRKKRTVEDDLLLQKPFQKEKHGKVAHKQVAAELLDRENDKT DLDVIRENHRFLWNEEDEMDMTCWINVSVRIRSSCKDREDSPKKTTGLNELESLFFLYIK VQVDGHKVRILLVYCLRQFFCGNKYCDKKEGLKSWEVNFGYIEHGEKRNALVKLRHSSSK KSEDSLLRNSDEEESASESELWKGPLPETDEKSQEEEFDEYFQDLFL >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_2|1044_bp atgccggattccctagaggggaaaggaaggaggcccagggaaaaactgggagccgcgact tggcgaaagctggaagcctcttcgtccctttccgggccgtttctcccgctgcccttgttt cccggagctgttctggctgtagcacaagtcgggacccgtagggagaggcgggttgcgggc ggcggcggcggcggcggcggcggtggtggttgtggcgaggctgtgcggcagggcgcacgg gacctgtgctgcagcggctctctcaggccgtgggtcgtcgctgcagctgccgggaaagaa ggaaacgacgactccgggggcgaacttggcacacagggaggaagggaaaggatgcatggt catggaggctatgattctgattttagtgatgatgaacgctgtggagaatccagcaaaagg aaaaaaaggacagttgaagatgacttactgctccaaaaaccatttcagaaagaaaaacat ggaaaggtggcccataaacaagttgcagcagaattgctggatagggaaaatgacaagaca gacttggatgttatacgagaaaatcatagattcctatggaatgaggaggacgaaatggac atgacttgttggatcaatgtatcagttaggattaggtccagctgcaaggacagagaagac tccccgaaaaaaaccactggcttaaatgagctggaaagtttatttttcttatacataaaa gttcaagtcgatggtcataaagtcagaatccttcttgtttactgcttgcgtcaatttttc tgtggaaataaatattgtgataaaaaagaaggcttaaagagttgggaagttaattttggt tatattgagcatggtgagaagagaaatgcacttgttaaattaagacattcatcttcaaag aaatctgaagattctctacttagaaactctgatgaggaagaaagtgcttcagaatctgaa ctttggaagggtccactaccagagacagatgaaaaatcacaggaagaagaatttgatgag tattttcaggatttgtttctatga >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_3|273_aa MIHFPQDCAFSDGGLEAEVMTLSQMHLLGVRQAAGKAVDFGKKGVIFNFPDNFNRASLGN FWKLKRKYVGKTLKQQAKVTSSSSEKPTAGGKIFGTVPTLGTTALIRSELLMSQHLWGAD LPVYTHRATLCLSLMVTTMQKPVIDSLKIKSNKLKHTTRENLLTTKEDSKKRSKELQNNQ KISNEMTIASTGVLNYYSPLTLKIKWSCGIDFQGSNIWKSHGDQVPGTASVVRMRISRTE ESGGNQPPSYPKRLAAARWKKRGCRQLARDWDS >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_3|822_bp atgattcatttccctcaggactgtgcctttagcgatggtggtttggaagcagaggttatg acgctgtctcagatgcatctcttgggggttagacaggctgcaggaaaggccgttgatttt gggaagaagggggtcatttttaactttccagataatttcaatagagcatccttgggaaat ttctggaaattaaaaaggaaatacgtgggtaagacgctgaagcagcaggcgaaggtcaca agttcaagttcagaaaaacccacagcaggaggaaagatttttgggactgtgcccaccctt ggaacaactgcactaatcaggtctgagcttctgatgtcacagcacctgtggggtgccgac cttccagtttacacacacagagccactttgtgcctaagcctcatggtaaccacaatgcaa aaacctgtaatagattcactaaaaataaaaagcaacaaattaaaacatactaccagagaa aatctcttaaccacaaaggaagacagtaagaaaagaagcaaggagttacaaaacaatcag aaaataagcaacgaaatgacaatagcatccaccggggtcttgaactactatagccctctg acccttaaaataaaatggtcctgtgggattgacttccagggcagtaacatctggaaaagc catggagaccaagtcccaggtacagcgagtgttgtaagaatgaggatttcaaggactgag gagagtggtggaaaccaaccaccaagctaccctaaacgcctggcagctgcaagatggaag aaaagaggatgcaggcagctggctcgtgactgggattcatga >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_4|197_aa MGLTIFFQSLLHERDTETSFYHQPPQGEPGSHESTESCVSGKAALMEMGQMGTESRQDLQ LIRELLLAPAMHCSGEETLAPEAWKELPASRRRPALASWGFEKGLAQPRGCDMQRKNNVI HDDDDNDNDDNNNNQPLTHPYNVPDPANSELSIAARDLWPVLHPLVSVGMMTQEAGNVTA ENSKAMRKLVMTIGLVP >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_4|594_bp atgggcctaacaatttttttccaatccttgctccatgagagggatactgaaacatcattt tatcatcaaccaccacagggagagcctgggagtcatgaaagcacggagagctgtgtctca ggaaaggcagcgctaatggagatgggtcagatgggaactgaaagcagacaagacctgcag ctgattcgtgagcttctcctcgctccagcaatgcactgctcgggggaagagaccttagca ccagaagcctggaaggaactccctgccagcaggaggcggccagctctagcttcctggggc tttgaaaaaggcctggctcagccccgaggctgtgacatgcaaaggaaaaacaatgtaatt catgatgatgatgacaatgacaatgacgacaacaacaataaccaacctttaacacatcct tataatgtgccagaccctgcgaactcagagctttccattgctgcccgtgacttatggcct gtgttacacccattggtttctgtaggaatgatgacccaagaagctgggaatgtgaccgca gaaaatagcaaagcaatgaggaaactggttatgaccataggattagtgccctga >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_5|492_aa MAVGDAFLAAEPGGTETNAVFEPVWSALERRLSSGFHSHNRRFSWPGALSSPPLQQISLK PSDAHDDLWRQQALITAGFKLAQRTTAVASSLGCPVAPNLPKGDLRGNSFNCDCKLKWLV EWLGHTNATVEDIYCEGPPEYKKRKINSLSSKDFDCIITEFAKSQDLPYQSLSIDTFSYL NDEYVVIAQPFTGKCIFLEWDHVEKTFRNYDNITGTSTVVCKPIVIETQLYVIVAQLFGG SHIYKRDSFANKFIKIQDIEILKIRKPNDIETFKIENNWYFVVADSSKAGFTTIYKWNGN GFYSHQSLHAWYRDTDVEYLEIVRTPQTLRTPHLILSSSSQRPVIYQWNKATQLFTNQTD IPNMEDVYAVKHFSVKGDVYICLTRFIGDSKVMKWGGSSFQDIQRMPSRGSMVFQPLQIN NYQYAILGSDYSFTQVYNWDAEKAKFVKFQELNVQAPRSFTHVSINKRNFLFASSFKGNT QIYKHVIVDLSA >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_5|1479_bp atggctgtgggcgatgctttcttagcagcagagcccggtggcactgaaacaaacgcggtg tttgagccagtctggtctgccttagagagaagactttccagtgggttccacagtcacaat cggaggttttcttggcccggtgccctcagcagccctccgctgcagcaaatctccctgaag ccctcagatgcccatgatgatttgtggaggcaacaggccctcatcacagctggctttaag ctggcgcaaagaaccacagcggtggcttcaagcctcggctgccctgtagccccaaacctc cctaaaggggacctgaggggtaattcatttaattgtgactgtaaactgaaatggctagtg gaatggcttggccacaccaatgcaactgttgaagacatctactgcgaaggccccccagaa tacaagaagcgcaaaatcaatagtctctcctcgaaggattttgattgcatcattacagaa tttgcaaagtctcaagacctgccttatcaatcattgtccatagacactttttcttatttg aatgatgagtatgtagtcatcgctcagccttttactggaaaatgcattttccttgaatgg gaccatgtggaaaagaccttccggaattatgacaacattacaggcacatccactgtagta tgcaagcctatagtcattgaaactcagctctatgttattgtggcccagctgtttggtggc tctcacatctataagcgagacagttttgcaaataaattcataaaaatccaggatattgaa attctcaaaatccgaaaacccaatgacattgaaacattcaagattgaaaacaactggtac tttgttgttgctgacagttcaaaagctggttttactaccatttacaaatggaacggaaac ggattctactcccatcaatccttacacgcgtggtacagggacactgatgtggaatatcta gaaatagtcagaacacctcagacactcagaacgcctcatttaattctgtctagtagttcc cagcgtcctgtaatttatcagtggaacaaagcaacacaattattcactaaccaaactgac attcctaacatggaggatgtgtacgcagtgaagcacttctcagtgaaaggggacgtgtac atttgcttgacaagattcattggtgattccaaagtcatgaaatggggaggctcctcgttc caggatattcagaggatgccatcgcgaggatccatggtgttccagcctcttcaaataaat aattaccaatatgcaattcttggaagtgattactcctttactcaagtgtataactgggat gcagagaaagccaaatttgtgaaatttcaggaattaaatgttcaggcaccaagatcattc acacatgtgtccattaataagcgtaattttctttttgcttccagttttaagggaaataca cagatttacaaacatgtcatagttgacttaagcgcatga >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_6|297_aa MHGKAWKSRQKAVAGVEPSWRTSTRAMQRGNGLEPSHRIPVGSLLSGAVRRGPPSSRPQN GTSTSSLLPATGKAAGIQCKPLRAAMGQSLAEPRGQAAQGLASVTRPFDLGVPFAPSFSR VPVLAPSLHCALPSSAVTMVTGGPVSVVPSSIACFRAPSGLFQNTNSDTLLRPNLGPQTS SALTEKDVMGCKHFPPILISDYSSYTIMYRFYNSYQLSRMEWPKSQMQAPSSPVTITVVY AVGISPRCLYWADVLILQLLWVSVNNSSYVYPSLEDYPLPTGSFSPGDAQKVLPHMP >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_6|894_bp atgcatggaaaagcctggaagtccaggcagaaggctgttgcaggggtggagccctcatgg agaacctctactagggcaatgcagaggggaaatggattggagccttcacacagaatccca gttgggtcactgcttagtggagctgtgagaagagggccaccttcctccagaccccagaat ggtacatccaccagcagcttgctccctgcaactggaaaagctgcaggaattcaatgcaag cccttgagagcagctatggggcagagccttgcagagccacggggacaagctgctcaaggc cttgcttctgtcactcgcccctttgacctgggggtccccttcgctcccagcttttcccgt gtccctgtccttgctccctctctccactgtgcccttccttcttcagccgtcactatggtt acagggggacctgtttctgtggtgccctcttccatagcctgcttccgagctccttctggt ctctttcagaatacaaattctgacaccctcctacggcctaacctaggtccacagacatcc tcagctctcacagagaaagatgtgatggggtgtaagcattttccacctatcctgatcagt gattacagctcctatacaataatgtaccgcttctacaactcataccagcttagcagaatg gagtggccaaaatcccagatgcaggctcccagctccccagtcaccatcactgtggtatat gctgttggtatctcaccccgatgcctttactgggctgatgtcctcatcttgcagctgctg tgggtgtcagttaataacagctcatatgtgtacccttctctggaggactatcctttacca actggaagcttctctcctggagatgcccaaaaggttttgccccacatgccttag >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_7|199_aa MRGSVLVLLLTAQNANHCDNESIAGEEDFNQVLQARMTGVDSWSRWRRSEAADLPARHKG SPSPHQTQEPSWLHSVAPTLGLQVELPANPAPCACSSQPLGGRWDWAPWSRGGARRGGSG RSGAYGGGKGLPQRSGRLKGSSSATKVGAQAEERVPRVSEGCKGCQHGVTSQSDGVGGTG VSLGGLGGTGREGVGFESF >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_7|600_bp atgaggggttcagtcctggtcctgctgctcactgcacagaatgccaatcactgtgacaat gagtcaattgccggagaagaagactttaatcaggtgctgcaggctaggatgacaggagtg gattcgtggtctcgctggcgaaggagtgaagctgcagaccttcccgctagacataaaggt tctccaagtccccaccagactcaggagcccagctggcttcactcagtagctcccacactg gggctgcaggtggagctgcctgccaatcccgcgccctgtgcctgcagttctcagcccttg ggtggtcgatgggactgggcgccttggagcaggggcggtgctcgtcggggaggctcaggc cgctcaggagcctacggcggcgggaaggggctcccacagcgcagcggcaggctgaagggc tcttcaagtgccaccaaagtgggagcccaggcagaggagagagtgccgagagtgagtgag ggctgcaagggctgccagcacggtgtcacctctcagagtgatggagtaggtggcactgga gtatcccttggaggtctagggggtactggcagagaaggggttggatttgagagcttctga >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_8|63_aa MVQDSKHLIEFLWVWYDVEHQESQAPKKVPGHWENKDEKLQEVETSKRDTYVNQKMRLSM MVP >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_8|192_bp atggtgcaggacagcaaacacctcatagagttcctgtgggtctggtatgatgtggagcac caagaatcccaggctcctaagaaggtgcccgggcactgggaaaacaaagatgaaaagctt caagaagttgaaactagcaagagagatacatatgttaaccaaaagatgcgcttaagcatg atggtgccataa >gi568815588f:93658145_93897800|GENSCAN_predicted_peptide_9|67_aa MHPRAPLRSRRPPRQLGRQTAVGAGSAFPRRVAPAPSRVVGGGSAREAILQWGDVARKLT LKSDPGG >gi568815588f:93658145_93897800|GENSCAN_predicted_CDS_9|204_bp atgcacccccgggcgccactgaggagccggcggccgccgaggcagctggggcgccagacc gcggtaggtgctggctctgcctttcctcgccgtgttgctcccgcaccgagccgggtcgta ggtggggggtcagcgagagaggccatcctgcagtggggcgacgttgcgaggaagctgacc ctgaaaagtgatcccggtggttaa