GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:07:14 Sequence gi568815586r:93540825_93741586 : 200762 bp : 40.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8587 8803 217 0 1 70 93 39 0.275 1.55 1.02 Intr + 23693 23782 90 0 0 111 98 35 0.637 5.85 1.03 Intr + 29574 29718 145 1 1 100 21 188 0.134 11.72 1.04 Intr + 30749 30803 55 2 1 127 27 55 0.056 1.36 1.05 Intr + 31972 32212 241 0 1 81 91 83 0.007 4.10 1.06 Term + 33898 34355 458 2 2 30 36 385 0.014 21.80 1.07 PlyA + 34604 34609 6 1.05 2.05 PlyA - 35325 35320 6 1.05 2.04 Term - 50943 50675 269 1 2 56 48 180 0.009 5.47 2.03 Intr - 56378 56357 22 2 1 50 90 36 0.009 -3.80 2.02 Intr - 62166 61984 183 0 0 44 113 130 0.574 10.16 2.01 Init - 91596 91411 186 0 0 83 66 127 0.686 9.10 2.00 Prom - 96156 96117 40 -3.95 3.02 PlyA - 97877 97872 6 1.05 3.01 Sngl - 100762 99998 765 1 0 122 49 1051 0.870 100.44 3.00 Prom - 102391 102352 40 -7.45 4.00 Prom + 103276 103315 40 -1.55 4.01 Init + 117143 117288 146 1 2 77 71 99 0.527 6.74 4.02 Intr + 122656 122777 122 1 2 70 18 127 0.089 3.12 4.03 Intr + 137001 137056 56 1 2 110 65 111 0.979 8.78 4.04 Intr + 137945 138248 304 1 1 105 96 224 0.986 20.14 4.05 Term + 147151 147311 161 2 2 27 33 143 0.221 -0.18 4.06 PlyA + 150184 150189 6 1.05 5.04 PlyA - 150267 150262 6 1.05 5.03 Term - 174072 173788 285 0 0 71 41 122 0.092 0.22 5.02 Intr - 192822 192654 169 2 1 -15 85 126 0.103 1.03 5.01 Init - 196942 196875 68 1 2 110 116 61 0.545 11.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 31861 32063 203 2 2 123 46 158 0.875 11.67 S.002 Init - 62690 62664 27 2 0 105 81 25 0.949 3.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:93540825_93741586|GENSCAN_predicted_peptide_1|401_aa MANTSLLSSTQAGCHLEWRVAVVIGQGGRPSREDGVQVLRTQGGKGKRDVSQVSKPPCLL HCPTGLRKHKLEGTIVLGPGSWRLSRNFKKAKLFLVAHQSLKGPDQGDRLRSPAVGTGTS TASPRHRCVLEPESLVSEDPQSLLHAHPPQKVAAGGKPGSSPRVEVSQPGLVLGFALTSR KDANPSLTPARAATCLCRGDPSLMTLRCLEPSGNGGEGTRSQWGTAGSAEEPSPQAARLA KALRELGQTGWYWGSMTVNEAKEKLKEAPEGTFLIRDSSHSDYLLTISVKTSAGPTNLRI EYQDGKFRLDSIICVKSKLKQFDSVVHLIDYYVQMCKDKRTGPEAPRNGTVHLYLTKPLY TSAPSLQHLCRLTINKCTGAIWGLPLPTRLKDYLEEYKFQV >gi568815586r:93540825_93741586|GENSCAN_predicted_CDS_1|1206_bp atggcaaacacaagcctcctctcatcaacacaagctggctgccacctcgagtggagggtg gcagtggtcattgggcagggtgggagaccaagtagggaggatggggtgcaggtgctgaga acccagggtggaaaggggaagcgggatgtgagccaagtctccaaacccccatgcttgctc cattgtccaactggacttaggaaacacaaattagaaggtactattgttttggggccaggc tcatggagattaagcaggaacttcaagaaagccaagctcttcttggtagcacaccagagt ctgaagggacctgatcaaggggaccgcctccggtccccggccgtgggcaccgggacgagc acggcgtccccacgccatcgatgtgtcttagagccggagagtctggtttccgaggaccca cagtcgctcctgcacgcccaccccccgcaaaaggtcgcagccggagggaaacccggcagc agtccgagagtggaggtgtcccagcccggactcgttttgggattcgcactgacttcaagg aaggacgcgaacccttctctgaccccagctcgggcggccacctgtctttgccgcggtgac ccttctctcatgaccctgcggtgccttgagccctccgggaatggcggggaagggacgcgg agccagtgggggaccgcggggtcggcggaggagccatccccgcaggcggcgcgtctggcg aaggccctgcgggagctcggtcagacaggatggtactggggaagtatgactgttaatgaa gccaaagagaaattaaaagaggcaccagaaggaactttcttgattagagatagctcgcat tcagactacctactaacaatatctgttaaaacatcagctggaccaactaatcttcgaatc gaataccaagacggaaaattcagattggactctatcatatgtgtcaaatccaagcttaaa caatttgacagtgtggttcatctgatcgactactatgttcagatgtgcaaggataagcgg acaggtccagaagccccccggaacggcactgttcacctttatctgaccaaaccgctctac acgtcagcaccatctctgcagcatctctgtaggctcaccattaacaaatgtaccggtgcc atctggggactgcctttaccaacaagactaaaagattacttggaagaatataaattccag gtataa >gi568815586r:93540825_93741586|GENSCAN_predicted_peptide_2|219_aa MAEGEEEADTFFTTWQERDSKGGNATFKTIRSHENSLTIMRTARGKLPPMIQSPPTRCRP RHGSLDNPVKQAQQRSLGQALCQLLHLENYKDGKDPDDNLAVSCMSSLTSVFSLSPPSPR RVKWWLRQSTVKRLDVGQPVTGSLLPTLGYIRSWETRAAPRTRLGGAATPQLRGAPHSTL ATDKKNFCCITPWRQDHFPLERQCVHIVDARASPCRNSV >gi568815586r:93540825_93741586|GENSCAN_predicted_CDS_2|660_bp atggcagaaggtgaagaggaagcagacaccttcttcacaacgtggcaggagagagatagc aaagggggaaatgccacttttaaaaccatcagatctcatgagaactcacttactatcatg agaacagcacggggcaaactgccccccatgatccaatcacctcccaccaggtgccgccct cgacacggaagtttggacaacccagtaaagcaggcccagcagcgctctctaggccaggcc ctttgtcagctacttcacctagaaaactacaaagatggaaaggacccagatgacaacctg gctgtgagttgtatgagttcactgacttctgtgttttccctgtcaccaccctctcctcgc agagtcaagtggtggttacgccaatctacagtgaaacgccttgacgttggtcagcctgtg accggctccttgctgccgacgctgggttacataaggtcctgggagacccgagcggcgccc cggacgcgtctgggaggagccgccacgccacagctccgtggggctccgcactctaccctg gccacagacaaaaagaacttctgctgcatcaccccgtggcgccaggatcacttccctctt gagagacagtgtgtgcacatcgtagacgctagagccagcccctgcaggaactccgtttga >gi568815586r:93540825_93741586|GENSCAN_predicted_peptide_3|254_aa MAAYKLVLIRHGESAWNLENRFSGWYDADLSPAGHEEAKRGGQALRDAGYEFDICFTSVQ KRAIRTLWTVLDAIDQMWLPVVRTWRLNERHYGGLTGLNKAETAAKHGEAQVKIWRRSYD VPPPPMEPDHPFYSNISKDRRYADLTEDQLPSCESLKDTIARALPFWNEEIVPQIKEGKR VLIAAHGNSLRGIVKHLEGLSEEAIMELNLPTGIPIVYELDKNLKPIKPMQFLGDEETVR KAMEAVAAQGKAKK >gi568815586r:93540825_93741586|GENSCAN_predicted_CDS_3|765_bp atggccgcctacaaactggtgctgatccggcacggcgagagcgcatggaacctggagaac cgcttcagcggctggtacgacgccgacctgagcccggcgggccacgaggaggcgaagcgc ggcgggcaggcgctacgagatgctggctatgagtttgacatctgcttcacctcagtgcag aagagagcgatccggaccctctggacagtgctagatgccattgatcagatgtggctgcca gtggtgaggacttggcgcctcaatgagcggcactatgggggtctaaccggtctcaataaa gcagaaactgctgcaaagcatggtgaggcccaggtgaagatctggaggcgctcctatgat gtcccaccacctccgatggagcccgaccatcctttctacagcaacatcagtaaggatcgt aggtatgcagacctcacagaagatcagctaccctcctgtgagagtctgaaggatactatt gccagagctctgcccttctggaatgaagaaatagttccccagatcaaggaggggaaacgt gtactgattgcagcccatggcaacagcctccggggcattgtcaagcatctggagggtctc tctgaagaggctatcatggagctgaacctgccgactggtattcccattgtctatgaattg gacaagaacttgaagcctatcaagcccatgcagtttctgggggatgaagagacggtgcgc aaagccatggaagctgtggctgcccagggcaaggccaagaagtga >gi568815586r:93540825_93741586|GENSCAN_predicted_peptide_4|262_aa MVTAEHYTKQEALSSLGPMKPVLPKGTEKISLELNLSFWWECPSQIRIRTYISSPARLSG IWPWTESYTIGFPGSKAFGLGLSHAVNISGLRGGVWDRVSPSRGRTARGEMEARDKQVLR SLRLELGAEVLVEGLVLQYLYQEGILTENHIQEINAQTTGLRKTMLLLDILPSRGPKAFD TFLDSLQEFPWVREKLKKAREEAMTDLPAVITNYMQYLVLCRSGYVQMETESPAEERVGC SSQLKSELKMARGWKREGETNG >gi568815586r:93540825_93741586|GENSCAN_predicted_CDS_4|789_bp atggtgacagcagagcattacaccaagcaggaagccctttctagcctggggcccatgaag ccagtcctgccaaaggggaccgaaaagatttcactggagctcaacctttcattttggtgg gaatgcccaagccagatcagaattaggacttacatcagcagccccgccaggctttcaggc atttggccttggactgagagttacacaattggcttccctggttctaaggcctttggactt ggactgagccatgctgtcaacatctcagggttgcgagggggagtgtgggatcgggtgtcg cctagccgtggtcgtactgcccggggagaaatggaggccagagacaaacaagtactccgc tcacttcgcctggagctgggtgcagaggtattggtggagggactggttcttcagtacctc taccaggaaggaatcttgacggaaaaccatattcaagaaatcaatgctcaaaccacaggc ctccggaaaacaatgctcctgctggatatcctaccttccaggggccctaaagcatttgat acattcctagattccctacaggagtttccctgggtcagggagaagctgaagaaggcaagg gaagaggccatgaccgacctgcctgcagtcattacaaactacatgcagtaccttgtgctc tgtagatcaggttatgtccaaatggagacagagtctcctgctgaagaacgagttggatgc tcctctcagttaaaaagtgagctgaagatggctcgaggatggaagagagaaggagagaca aatggctag >gi568815586r:93540825_93741586|GENSCAN_predicted_peptide_5|173_aa MDNLKVYNIRGSISPSGSTEWTRQPPQELNVTASPMLAKQPGLPSVLAHWAPPAAIPDGW VMGASTASEAHDITSSVWLSLEEQTFTLQPPLYQGVAMRQFCPMNQKRNCVEDFWKDYFF SQEKVTDAVGAEQLPCLFQPYGCAVGSCDCLATTRKRKAKRIAETLALMPLSQ >gi568815586r:93540825_93741586|GENSCAN_predicted_CDS_5|522_bp atggacaatttaaaggtgtacaacatccgaggttccatttcaccctcaggcagcacggag tggacaaggcagcctccccaggaactcaatgtcacagcatcaccaatgcttgccaagcag ccaggccttccttcagtgctggcacactgggcacctccagcagctattccagacggctgg gtgatgggggcaagcacagcatcagaggcccatgacatcacctcttcagtgtggctgagt ctagaagagcaaacattcactttgcagcctcccttgtaccaaggagtggccatgagacaa ttctgcccaatgaatcagaagaggaattgtgttgaggatttctggaaagactacttcttt tcccaagaaaaagtgacagacgcagtaggggctgagcaacttccgtgcctcttccagcct tacggatgtgctgttggtagctgtgactgccttgctactacgaggaaaaggaaggccaag agaatagcagagacactcgctctgatgcctttgagccaatga