GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:00:34 Sequence gi568815597f:109556711_109688168 : 131458 bp : 45.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5238 5295 58 2 1 67 105 8 0.387 1.87 1.02 Intr + 17027 17069 43 0 1 103 100 40 0.902 4.00 1.03 Intr + 17186 17327 142 2 1 75 92 121 0.999 11.56 1.04 Intr + 22494 22651 158 2 2 89 92 46 0.983 3.91 1.05 Intr + 25727 25855 129 2 0 16 108 86 0.678 3.21 1.06 Intr + 29506 29635 130 1 1 105 80 116 0.997 13.20 1.07 Intr + 30019 30172 154 0 1 113 110 31 0.999 7.45 1.08 Term + 35333 35523 191 0 2 63 54 190 0.999 10.71 1.09 PlyA + 38296 38301 6 1.05 2.09 PlyA - 38979 38974 6 1.05 2.08 Term - 46834 46644 191 2 2 43 36 271 0.999 15.01 2.07 Intr - 47394 47241 154 0 1 70 115 56 0.988 6.15 2.06 Intr - 49389 49260 130 2 1 97 89 127 0.995 14.40 2.05 Intr - 49726 49598 129 0 0 68 111 126 0.999 12.71 2.04 Intr - 52078 51921 158 1 2 71 46 130 0.951 5.91 2.03 Intr - 53471 53330 142 1 1 79 80 253 0.696 23.96 2.02 Intr - 53797 53755 43 2 1 53 81 57 0.957 -1.20 2.01 Init - 56160 56043 118 0 1 67 92 190 0.987 17.66 2.00 Prom - 59597 59558 40 -7.06 3.00 Prom + 62568 62607 40 -4.36 3.01 Init + 64304 64556 253 1 1 79 109 203 0.932 19.03 3.02 Intr + 68593 68723 131 2 2 99 54 121 0.979 10.21 3.03 Intr + 68952 69082 131 2 2 127 63 251 0.999 25.99 3.04 Intr + 69450 69518 69 0 0 119 51 120 0.999 9.70 3.05 Intr + 69609 69717 109 0 1 98 98 160 0.999 18.29 3.06 Intr + 70016 70202 187 1 1 73 48 378 0.999 31.56 3.07 Intr + 70465 70606 142 2 1 129 55 221 0.989 22.31 3.08 Intr + 70719 70808 90 0 0 66 84 166 0.999 13.11 3.09 Intr + 71064 71193 130 0 1 75 86 238 0.999 23.00 3.10 Intr + 71373 71567 195 2 0 93 26 438 0.743 37.71 3.11 Intr + 71654 71785 132 1 0 105 116 221 0.998 27.44 3.12 Intr + 71933 72096 164 1 2 80 70 270 0.997 23.17 3.13 Intr + 72399 72525 127 0 1 83 68 137 0.997 11.88 3.14 Intr + 72617 72780 164 1 2 42 51 293 0.998 19.77 3.15 Intr + 73086 73206 121 0 1 118 77 84 0.999 10.80 3.16 Intr + 73523 73696 174 1 0 104 77 316 0.982 32.24 3.17 Intr + 73973 74083 111 1 0 39 74 240 0.997 18.28 3.18 Intr + 74233 74413 181 0 1 115 36 382 0.812 35.14 3.19 Intr + 94654 94834 181 2 1 84 -42 154 0.089 0.93 3.20 Term + 97973 98081 109 2 1 117 49 133 0.721 10.28 3.21 PlyA + 98352 98357 6 1.05 4.00 Prom + 98847 98886 40 -9.06 4.01 Init + 99680 99715 36 1 0 74 84 61 0.942 4.32 4.02 Intr + 100002 100077 76 2 1 90 47 126 0.651 7.79 4.03 Intr + 100505 100569 65 0 2 81 51 50 0.779 -0.96 4.04 Intr + 100880 100961 82 1 1 73 56 149 0.602 9.41 4.05 Intr + 101062 101162 101 2 2 62 59 192 0.989 13.53 4.06 Intr + 102104 102199 96 1 0 80 90 66 0.855 6.21 4.07 Intr + 102290 102400 111 1 0 88 49 136 0.868 10.28 4.08 Term + 104455 104544 90 0 0 103 38 81 0.900 2.32 4.09 PlyA + 104973 104978 6 1.05 5.00 Prom + 110466 110505 40 -8.76 5.01 Init + 111406 111441 36 0 0 98 72 71 0.797 6.51 5.02 Intr + 111715 111790 76 0 1 69 47 118 0.943 4.89 5.03 Intr + 112215 112283 69 1 0 86 29 77 0.398 0.75 5.04 Intr + 112488 112661 174 1 0 10 56 237 0.674 12.61 5.05 Intr + 112761 112861 101 1 2 -21 78 199 0.993 7.83 5.06 Intr + 112926 113003 78 2 0 115 113 -1 0.897 4.85 5.07 Intr + 114577 114672 96 0 0 88 90 81 0.697 8.51 5.08 Intr + 114763 114873 111 0 0 51 49 137 0.693 6.68 5.09 Term + 118037 118126 90 1 0 103 49 113 0.907 6.62 5.10 PlyA + 120767 120772 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:109556711_109688168|GENSCAN_predicted_peptide_1|334_aa MSEQIDLIEMKGWYRKSVEGAGESGKSTIVKQMKIIHEDGYSEDECKQYKVVVYSNTIQS IIAIIRAMGRLKIDFGEAARADDARQLFVLAGSAEEGVMTPELAGVIKRLWRDGGVQACF SRSREYQLNDSASYYLNDLDRISQSNYIPTQQDVLRTRVKTTGIVETHFTFKDLYFKMFD VGGQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFT ETSIILFLNKKDLFEEKIKRSPLTICYPEYTGSNTYEEAAAYIQCQFEDLNRRKDTKEIY THFTCATDTKNVQFVFDAVTDVIIKNNLKECGLY >gi568815597f:109556711_109688168|GENSCAN_predicted_CDS_1|1005_bp atgagtgagcaaatagatttgattgaaatgaagggttggtacaggaagagtgtagaaggt gctggagaatctggtaaaagcaccattgtgaaacagatgaaaatcattcatgaggatggc tattcagaggatgaatgtaaacaatataaagtagttgtctacagcaatactatacagtcc atcattgcaatcataagagccatgggacggctaaagattgactttggggaagctgccagg gcagatgatgcccggcaattatttgttttagctggcagtgctgaagaaggagtcatgact ccagaactagcaggagtgattaaacggttatggcgagatggtggggtacaagcttgcttc agcagatccagggaatatcagctcaatgattctgcttcatattatctaaatgatctggat agaatatcccagtctaactacattccaactcagcaagatgttcttcggacgagagtgaag accacaggcattgtagaaacacatttcaccttcaaagacctatacttcaagatgtttgat gtaggtggccaaagatcagaacgaaaaaagtggattcactgttttgagggagtgacagca attatcttctgtgtggccctcagtgattatgaccttgttctggctgaggacgaggagatg aaccgaatgcatgaaagcatgaaactgtttgacagcatttgtaataacaaatggtttaca gaaacttcaatcattctcttccttaacaagaaagacctttttgaggaaaaaataaagagg agtccgttaactatctgttatccagaatacacaggttccaatacatatgaagaggcagct gcctatattcaatgccagtttgaagatctgaacagaagaaaagataccaaggagatctat actcacttcacctgtgccacagacacgaagaatgtgcagtttgtttttgatgctgttaca gatgtcatcattaaaaacaacttaaaggaatgtggactttattga >gi568815597f:109556711_109688168|GENSCAN_predicted_peptide_2|354_aa MGSGASAEDKELAKRSKELEKKLQEDADKEAKTVKLLLLGAGESGKSTIVKQMKIIHQDG YSPEECLEFKAIIYGNVLQSILAIIRAMTTLGIDYAEPSCADDGRQLNNLADSIEEGTMP PELVEVIRRLWKDGGVQACFERAAEYQLNDSASYYLNQLERITDPEYLPSEQDVLRSRVK TTGIIETKFSVKDLNFRMFDVGGQRSERKKWIHCFEGVTCIIFCAALSAYDMVLVEDDEV NRMHESLHLFNSICNHKFFAATSIVLFLNKKDLFEEKIKKVHLSICFPEYDGNNSYDDAG NYIKSQFLDLNMRKDVKEIYSHMTCATDTQNVKFVFDAVTDIIIKENLKDCGLF >gi568815597f:109556711_109688168|GENSCAN_predicted_CDS_2|1065_bp atgggaagtggagccagtgctgaggacaaagaactggccaagaggtccaaggagctagaa aagaagctgcaggaggatgctgataaggaagccaagactgtcaagctgctactgctgggt gctggggagtcaggaaagagcaccatcgtcaaacagatgaagatcattcaccaggatggc tattcaccagaagaatgcctggagttcaaggctatcatctatggaaatgtgctgcagtcc atcctggctatcatccgggccatgaccacactgggcatcgattatgctgaaccaagctgt gcggatgacgggcgacagctcaacaacctggctgactccattgaggagggaaccatgcct cctgagctcgtggaggtcattaggaggttgtggaaggatggtggggtgcaagcctgcttc gagagagctgcagaataccagcttaatgactccgcatcttactacctgaaccaattagaa cgaattacagaccctgagtacctccctagtgagcaagatgtgctccgatccagagtcaaa accacgggcatcattgaaaccaagttttccgtcaaagacttgaatttcaggatgtttgat gtgggagggcagagatccgagagaaagaagtggatccactgcttcgagggagtcacctgc atcattttctgtgcagccctcagtgcctatgatatggtgctggtggaagatgacgaagtg aatcgtatgcatgagtctttgcatctgttcaacagcatatgtaaccacaaattctttgcg gctacttccattgtcctctttctcaacaagaaggacctctttgaggaaaaaatcaagaaa gtccatctcagcatttgttttccagagtatgatggtaacaactcctatgatgatgcgggg aattacataaagagccagttccttgacctcaatatgcgaaaagatgtcaaagaaatctac agtcacatgacctgtgctacagatacacagaatgtcaaatttgtgtttgatgcagttaca gatattatcatcaaagaaaacctcaaggactgcggcctcttctaa >gi568815597f:109556711_109688168|GENSCAN_predicted_peptide_3|966_aa MRNRGQGLFRLRSRCFLHQSLPLGAGRRKGLDVAEPGPSRCRSDSPAVAAVVPAMASYPS GSGKPKAKYPFKKRASLQASTAAPEARGGLGAPPLQSARSLPGPAPCLKHFPLDLRTSMD GKCKEIAEELFTRSLAESELRSAPYEFPEESPIEQLEERRQRLERQISQDVKLEPDILLR AKQDFLKTDSDSDLQLYKEQGEGQGDRSLRERDVLEREFQRVTISGEEKCGVPFTDLLDA AKSVVRALFIREKYMALSLQSFCPTTRRYLQQLAEKPLETRTYEQGPDTPVSADAPVHPP ALEQHPYEHCEPSTMPGDLGLGLRMVRGVVHVYTRREPDEHCSEVELPYPDLQEFVADVN VLMALIINGPIKSFCYRRLQYLSSKFQMHVLLNEMKELAAQKKVPHRDFYNIRKVDTHIH ASSCMNQKHLLRFIKRAMKRHLEEIVHVEQGREQTLREVFESMNLTAYDLSVDTLDVHAD RNTFHRFDKFNAKYNPIGESVLREIFIKTDNRVSGKYFAHIIKEVMSDLEESKYQNAELR LSIYGRSRDEWDKLARWAVMHRVHSPNVRWLVQVPRLFDVYRTKGQLANFQEMLENIFLP LFEATVHPASHPELHLFLEHVDGFDSVDDESKPENHVFNLESPLPEAWVEEDNPPYAYYL YYTFANMAMLNHLRRQRGFHTFVLRPHCGEAGPIHHLVSAFMLAENISHGLLLRKAPVLQ YLYYLAQIGIAMSPLSNNSLFLSYHRNPLPEYLSRGLMVSLSTDDPLQFHFTKEPLMEEY SIATQVWKLSSCDMCELARNSVLMSGFSHKVKSHWLGPNYTKEGPEGNDIRRTNVPDIRV GYRYETLCQELALITQAVQSEMLETIPEEAAGTIEGVKEKKKKVPAVPETLQKKKRNFAE LKIKCLRNKFAQKMLQKARRKLICEKANRQIPTSRPGVILVATVAPTLEILLVQFFPPSA EPYSSF >gi568815597f:109556711_109688168|GENSCAN_predicted_CDS_3|2901_bp atgagaaatcgtggccagggcctcttccgcctgcggagccgctgcttcctgcatcagtca ctcccgctgggggcggggcggaggaaggggttggatgtggcagagccaggccccagccgg tgccgctcagactcccccgctgtcgccgccgtggtcccagccatggcatcctatccatct ggctctggcaagcccaaggccaaatatccctttaagaagcgggccagcctgcaggcctcc actgcagctccagaggctcggggtggtctgggggcccctccgctgcagtctgcccgatcc ctgccgggccccgccccctgcctcaagcacttcccgctcgacctgcgcacgtctatggat ggcaaatgcaaggagatcgccgaggagctgttcacccgctcactggctgagagcgagctc cgtagtgccccgtatgagttccccgaggagagccccattgaacagctggaggagcggcgg cagcggctggagcggcagatcagccaggatgtcaagctggagccagacatcctgcttcgg gccaagcaagatttcctgaagacggacagtgactcggacctacagctctacaaggaacag ggtgaggggcagggtgaccggagcctgcgggagcgtgatgtgctggaacgggagtttcag cgggtcaccatctctggggaggagaagtgtggggtgccgttcacagacctgctggatgca gccaagagtgtggtgcgggcgctcttcatccgggagaagtacatggccctgtccctgcag agcttctgccccaccacccgccgctacctgcagcagctggctgaaaagcctctggagacc cggacctatgaacagggccccgacacccctgtgtctgctgatgccccggtgcacccccct gcgctggagcagcacccgtatgagcactgtgagccaagcaccatgcctggggacctgggc ttgggtctgcgcatggtgcggggtgtggtgcacgtctacacccgcagggaacccgacgag cattgctcagaggtggagctgccataccctgacctgcaggaatttgtggctgacgtcaat gtgctgatggccctgattatcaatggccccataaagtcattctgctaccgccggctgcag tacctgagctccaagttccagatgcatgtgctactcaatgagatgaaggagctggccgcc cagaagaaagtgccacaccgagatttctacaacatccgcaaggtggacacccacatccat gcctcgtcctgcatgaaccagaagcatctgctgcgcttcatcaagcgggcaatgaagcgg cacctggaggagatcgtgcacgtggagcagggccgtgaacagacgctgcgggaggtcttt gagagcatgaatctcacggcctacgacctgagtgtggacacgctggatgtgcatgcggac aggaacactttccatcgctttgacaagtttaatgccaaatacaaccctattggggagtcc gtcctccgagagatcttcatcaagacggacaacagggtatctgggaagtactttgctcac atcatcaaggaggtgatgtcagacctggaggagagcaaataccagaatgcagagctgcgg ctctccatttacgggcgctcgagggatgagtgggacaagctggcgcgctgggccgtcatg caccgcgtgcactcccccaacgtgcgctggctggtgcaggtgccccgcctctttgatgtg taccgtaccaagggccagctggccaacttccaggagatgctggagaacatcttcctgcca ctgttcgaggccactgtgcaccctgccagccacccggaactgcatctcttcttagagcac gtggatggttttgacagcgtggatgatgagtccaagcctgaaaaccatgtcttcaacctg gagagccccctgcctgaggcgtgggtggaggaggacaacccaccctatgcctactacctg tactacacctttgccaacatggccatgttgaaccacctgcgcaggcagaggggcttccac acgtttgtgctgaggccacactgtggggaggctgggcccatccaccacctggtgtcagcc ttcatgctggctgagaacatttcccacgggctccttctgcgcaaggcccccgtcctgcag tacctgtactacctggcccagatcggcatcgccatgtctccgctcagcaacaacagcctc ttcctcagctatcaccggaatccgctaccggagtacctgtcccgcggcctcatggtctcc ctgtccactgatgatcccttgcagttccacttcaccaaggagccgctgatggaggagtac agcatcgccacccaggtgtggaagctcagctcctgcgatatgtgtgagctggcccgcaac agcgtgctcatgagcggcttctcgcacaaggtaaagagccactggctgggacccaactat accaaggaaggccctgaggggaatgacatccgccggaccaatgtgccagacatccgcgtg ggctaccgctacgagaccctgtgccaggagctggcgctcatcacgcaggcagtccagagt gagatgctggagaccattccagaggaggcggctggaaccatagagggtgtcaaagagaag aagaagaaggttcctgctgtgccagaaacccttcagaaaaagaaaaggaattttgcagag ctaaagatcaagtgcctgagaaataagtttgcccaaaagatgcttcaaaaggcaaggagg aaacttatctgtgaaaaagcgaataggcagatacccacttcccgccctggtgtgatcctc gtggccactgtggcccctactttggagatcctgctggtccagttcttcccgccttccgca gagccgtactcctccttctga >gi568815597f:109556711_109688168|GENSCAN_predicted_peptide_4|218_aa MSMTLGYWDIRGLAHAIRLLLEYTDSSYEEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNL PYLIDGAHKITQSNAILCYIARKHNLCGETEEEKIRVDILENQAMDVSNQLARVCYSPDF EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKITFVDFLAYDVLDLHRIFEPNCLDAFPN LKDFISRFEGLEKISAYMKSSRFLPKPLYTRVAVWGNK >gi568815597f:109556711_109688168|GENSCAN_predicted_CDS_4|657_bp atgtccatgacactggggtactgggacatccgcgggctggcccacgccatccgcctgctc ctggaatacacagactcaagctacgaggaaaagaagtatacgatgggggacgctcctgac tatgacagaagccagtggctgaatgaaaaattcaagctgggcctggactttcccaatctg ccctacttgattgatggggctcacaagatcacccagagcaacgccatcctgtgctacatt gcccgcaagcacaacctgtgtggggagacagaagaggagaagattcgtgtggacattttg gagaaccaggctatggacgtctccaatcagctggccagagtctgctacagccctgacttt gagaaactgaagccagaatacttggaggaacttcctacaatgatgcagcacttctcacag ttcctggggaagaggccatggtttgttggagacaagatcacctttgtagatttcctcgcc tatgatgtccttgacctccaccgtatatttgagcccaactgcttggacgcctttccaaat ctgaaggacttcatctcccgctttgagggcttggagaagatctctgcctacatgaagtcc agccgcttcctcccaaaacctctgtacacaagggtggctgtctggggcaacaagtaa >gi568815597f:109556711_109688168|GENSCAN_predicted_peptide_5|276_aa MPMTLGYWNIRGLAHSIRLLLEYTDSSYEEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNV GPGLLFALAYGKGMLGSLVAQLSFPGFPSIQLPYLIDGTHKITQSNAILRYIARKHNLCG ESEKEQIREDILENQFMDSRMQLAKLCYDPDFSLCPKLIPSGEFLVLLTLRMKPRSYTEK LKPEYLQALPEMLKLYSQFLGKQPWFLGDKITFVDFIAYDVLERNQVFEPSCLDAFPNLK DFISRFEGLEKISAYMKSSRFLPRPVFTKMAVWGNK >gi568815597f:109556711_109688168|GENSCAN_predicted_CDS_5|831_bp atgcccatgacactggggtactggaacatccgcgggctggcccattccatccgcctgctc ctggaatacacagactcaagctacgaggaaaagaagtacacgatgggggacgctcctgat tatgacagaagccagtggctgaatgaaaaattcaagctgggcctggactttcccaatgta ggccctggtctcctctttgcccttgcatatgggaaggggatgctggggagcctggtggcc caactgagcttccccggtttcccatctatccagctgccctacttgattgatgggactcac aagatcacccagagcaacgccatcctgcggtacattgcccgcaagcacaacctgtgcggg gaatcagaaaaggagcagattcgcgaagacattttggagaaccagtttatggacagccgt atgcagctggccaaactctgctatgacccagattttagtttgtgtccaaaattgattcct tctggtgagttcttggtcttgctgactctaagaatgaagccgcggtcctacacggagaaa ctgaaaccagaatacctgcaggcactccctgaaatgctgaagctctactcacagtttctg gggaagcagccatggtttcttggggacaagatcacctttgtggatttcatcgcttatgat gtccttgagagaaaccaagtatttgagcccagctgcctggatgccttcccaaacctgaag gacttcatctcccgatttgagggcttggagaagatctctgcctacatgaagtccagccgc ttcctcccaagacctgtgttcacaaagatggctgtctggggcaacaagtag