GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:04:05 Sequence gi568815581f:45707008_45934761 : 227754 bp : 50.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13403 13552 150 1 0 76 1 157 0.120 6.16 1.02 Term + 21649 21918 270 0 0 43 44 141 0.066 0.68 1.03 PlyA + 22268 22273 6 1.05 2.14 PlyA - 24712 24707 6 1.05 2.13 Term - 32469 32345 125 1 2 79 48 88 0.823 2.45 2.12 Intr - 33152 33080 73 2 1 104 31 81 0.399 2.98 2.11 Intr - 37085 37028 58 1 1 47 111 -3 0.007 -3.21 2.10 Intr - 49895 49614 282 1 0 42 72 164 0.069 6.73 2.09 Intr - 59081 58973 109 0 1 89 75 -7 0.138 -2.56 2.08 Intr - 59881 59829 53 0 2 128 71 37 0.904 4.55 2.07 Intr - 61853 61729 125 2 2 93 39 114 0.685 6.38 2.06 Intr - 63630 63542 89 1 2 42 107 18 0.337 -1.21 2.05 Intr - 66868 66710 159 2 0 110 101 -4 0.244 3.06 2.04 Intr - 67114 66992 123 2 0 101 76 7 0.386 1.36 2.03 Intr - 68907 68850 58 0 1 102 93 55 0.215 5.86 2.02 Intr - 74922 74811 112 2 1 48 71 20 0.080 -3.32 2.01 Init - 77406 77255 152 0 2 74 34 197 0.916 10.42 2.00 Prom - 79247 79208 40 -5.96 3.00 Prom + 81995 82034 40 -7.06 3.01 Init + 87259 87328 70 0 1 80 74 14 0.051 0.41 3.02 Intr + 88040 88080 41 0 2 91 70 65 0.127 2.94 3.03 Intr + 100048 100090 43 0 1 131 109 55 0.627 9.71 3.04 Intr + 108835 108899 65 2 2 110 89 9 0.728 1.64 3.05 Intr + 109297 109369 73 0 1 82 64 65 0.901 2.48 3.06 Intr + 109456 109575 120 2 0 108 80 133 0.925 14.97 3.07 Intr + 114348 114433 86 1 2 136 78 167 0.988 20.04 3.08 Intr + 122208 122314 107 2 2 105 94 162 0.993 17.51 3.09 Intr + 123087 123207 121 0 1 30 76 247 0.969 18.20 3.10 Intr + 123410 123563 154 1 1 4 72 344 0.995 24.05 3.11 Intr + 123873 123933 61 1 1 118 100 125 0.999 14.49 3.12 Intr + 126131 126203 73 2 1 119 99 151 0.972 18.71 3.13 Intr + 126445 126530 86 0 2 40 91 105 0.994 4.62 3.14 Intr + 126707 126842 136 2 1 94 92 249 0.985 26.47 3.15 Intr + 127000 127041 42 0 0 143 78 16 0.955 4.54 3.16 Term + 127617 127757 141 2 0 128 49 157 0.996 13.73 3.17 PlyA + 132322 132327 6 1.05 4.00 Prom + 135045 135084 40 -9.75 4.01 Sngl + 137900 139954 2055 1 0 81 43 2618 0.972 248.85 4.02 PlyA + 143667 143672 6 1.05 5.00 Prom + 147616 147655 40 -6.16 5.01 Init + 148198 148246 49 0 1 60 74 57 0.249 2.61 5.02 Intr + 154908 154970 63 1 0 69 73 86 0.463 3.89 5.03 Term + 161610 161797 188 1 2 -26 55 176 0.252 0.55 5.04 PlyA + 162049 162054 6 1.05 6.04 PlyA - 163930 163925 6 1.05 6.03 Term - 175177 175044 134 2 2 35 48 158 0.480 4.75 6.02 Intr - 187862 187605 258 0 0 45 86 113 0.058 4.23 6.01 Init - 188572 188473 100 1 1 81 66 202 0.127 15.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:45707008_45934761|GENSCAN_predicted_peptide_1|139_aa LEPLANDFSHQEDTPVGRHSQTQQAHVLRKRTLCHSFSLLQCPASEPPYRVIELVEWADQ EEGSVPSIGVMSHGIGIGVAVVTSGRNSSAGTSSLQGEDPGAPALVAWREMDNFWGALWL LEMEGIEGGGDGTCEGMEL >gi568815581f:45707008_45934761|GENSCAN_predicted_CDS_1|420_bp ctagagcccctggcaaatgacttctcacaccaggaagacacccctgtgggaagacactcc cagacccagcaggcccacgtgctccgcaaacggaccctgtgccacagtttttctctgctg cagtgcccggcttcagagcccccgtacagggtaattgaactagtggagtgggctgaccaa gaggaggggagcgttcccagcattggggtgatgagccatggaattggcattggagtggct gtagtcacaagtggaaggaattcatctgctggcacatccagcctgcagggagaggaccca ggggcgcctgcacttgtggcctggagggagatggacaacttctggggtgccctctggttg ttggagatggagggcattgaaggaggaggggacggcacatgcgaaggcatggagttgtga >gi568815581f:45707008_45934761|GENSCAN_predicted_peptide_2|505_aa MPGSALPGPAARPGPARLSLAPSRGSLAAVSRRSPARGRLLPASPPPPPPRSQTFSLSEV ASTDSSFHPVCVDFTDLTSTPPHPADGQGMAAAPGPLPDSVDLKSKSGLYPQICLPDLTQ PSPCCVALGTSCLLDLGFLVCEIAVVLRAPCRGPHQKDWWMVNLSRVTRGHSFLGPNVSR VFSFQWVCGAADFKNEAVDLRGWGDKFSVRTNIRRKKKTLSNCIRTRHTIEMRQPKSRDV TGPRSHSGRELIPQAIRFPNAAPSGPFVLLLDGLQGYQFLGRFAEIVDPIMHGQPVKGAA HPTAPPQAGTDHSAPAGSLLAAKPWGRGLNVGREVGGNCSSLADQSREAQKQSLSPLPIP VPLLPPTDSQDSLQSGTFIMKTDVTRQGSEQGFAVSDFDGSNLELLVTRSTQLRYSPPIT RVWVWQRSNIPHASCRERGLATGPKVQARAVPVEQERDSNFLVKGRGDSRDRKKGSESLR YNIKYQLNKAGTRTHCLVMQNDAQF >gi568815581f:45707008_45934761|GENSCAN_predicted_CDS_2|1518_bp atgcccggctcggcgcttcccggccccgcggcccggcccggcccggcccggctctcgctc gccccttcccggggaagtctggccgccgtttcccgacgcagcccggcccgcggccgcctc ttgccggcctcgcccccgccacctcccccacggagtcagaccttctcgctttctgaagtg gcgtctacagatagctcatttcatcctgtctgtgttgatttcactgacttaacctccacc ccaccacatcccgctgacggacagggcatggctgctgccccagggcccctgccagactca gtggacctgaagtctaagtctgggttgtatccccaaatctgcctgccagacctgacccag ccatccccttgctgtgtggccttgggcacatcatgtcttctggatctcggtttcctggtc tgtgaaatagcagtggtgctgagagcaccctgtagagggccccaccagaaggactggtgg atggtaaatctttctagagtgaccaggggccattcattccttggccctaatgtgtccaga gttttttccttccagtgggtttgtggtgccgctgacttcaagaatgaagctgtggacctt cgcggatggggagataaatttagtgtcagaaccaacattcgaagaaagaaaaaaacttta agcaactgcatcagaacacggcacacaatagagatgaggcagccgaaatccagagatgtg actggcccaaggagtcacagtgggcgagaacttattcctcaggccattagattccctaat gctgcaccttcgggcccctttgtgctgctgctggacgggctccaaggttatcagtttctt ggtagatttgcagagatcgtcgatccaataatgcatggacagcctgtgaagggggctgcc caccccacagccccgccccaggctggcaccgatcactctgcccctgcagggtccctccta gctgctaagccctggggaagggggttgaatgttggacgagaagtcggaggaaattgctct tcgctggcagatcaaagccgtgaagctcagaaacaaagcctctcccctctccccatccct gtccctctgcttccacccaccgacagccaagacagcctgcaatcaggcaccttcattatg aagactgatgttaccagacagggatctgagcaaggttttgctgtgtctgattttgatggt agtaatttagagctcttggtaactagatcaacacagctccgttactccccaccaatcacc agagtgtgggtttggcagaggagcaacatccctcatgccagctgcagggagagaggcttg gccacaggacccaaagtccaggccagagcagtccctgtggaacaggagcgggacagcaat ttcttggtgaaaggaaggggagacagcagagatagaaagaagggctctgaaagtctgagg tacaatatcaagtaccagctgaacaaagctggaactcgcacacattgcttggtaatgcaa aatgacgcacagttttga >gi568815581f:45707008_45934761|GENSCAN_predicted_peptide_3|472_aa MDSHEMLTLQFCTSQVVDQEPIVGGDLEELTQQGQDRDQHCESLSLASNISAHFSISPVL GPCTEQSTRDVWSDELISALVIVVILSLAGVSGNEWGGLQCNASVDLIGTCWPRSPAGQL VVRPCPAFFYGVRYNTTNNGYRECLANGSWAARVNYSECQEILNEEKKSKVHYHVAVIIN YLGHCISLVALLVAFVLFLRLRSIRCLRNIIHWNLISAFILRNATWFVVQLTMSPEVHQS NVGWCRLVTAAYNYFHVTNFFWMFGEGCYLHTAIVLTYSTDRLRKWMFICIGWGVPFPII VAWAIGKLYYDNEKCWFGKRPGVYTDYIYQGPMILVLLINFIFLFNIVRILMTKLRASTT SETIQYRKAVKATLVLLPLLGITYMLFFVNPGEDEVSRVVFIYFNSFLESFQGFFVSVFY CFLNSEVRSAIRKRWHRWQDKHSIRARVARAMSIPTSPTRVSFHSIKQSTAV >gi568815581f:45707008_45934761|GENSCAN_predicted_CDS_3|1419_bp atggattcacatgaaatgcttactctccagttttgtacttctcaagttgtagaccaggaa ccaattgttggaggagatctggaggagctgacccagcagggccaggacagagaccagcac tgcgagagcctgtccctggccagcaacatctcagctcacttctccatcagtcctgtccta gggccctgcacagaacaaagtacacgtgacgtatggtcagatgagctcatttctgcactg gtcattgttgtcatcctgtccctagctggtgtgagtgggaacgagtggggaggactgcag tgcaacgcatccgtggacctcattggcacctgctggccccgcagccctgcggggcagcta gtggttcggccctgccctgcctttttctatggtgtccgctacaataccacaaacaatggc taccgggagtgcctggccaatggcagctgggccgcccgcgtgaattactccgagtgccag gagatcctcaatgaggagaaaaaaagcaaggtgcactaccatgtcgcagtcatcatcaac tacctgggccactgtatctccctggtggccctcctggtggcctttgtcctctttctgcgg ctcaggagcatccggtgcctgcgaaacatcatccactggaacctcatctccgccttcatc ctgcgcaacgccacctggttcgtggtccagctaaccatgagccccgaggtccaccagagc aacgtgggctggtgcaggttggtgacagccgcctacaactacttccatgtgaccaacttc ttctggatgttcggcgagggctgctacctgcacacagccatcgtgctcacctactccact gaccggctgcgcaaatggatgttcatctgcattggctggggtgtgcccttccccatcatt gtggcctgggccattgggaagctgtactacgacaatgagaagtgctggtttggcaaaagg cctggggtgtacaccgactacatctaccagggccccatgatcctggtcctgctgatcaat ttcatcttccttttcaacatcgtccgcatcctcatgaccaagctccgggcatccaccacg tctgagaccattcagtacaggaaggctgtgaaagccactctggtgctgctgcccctcctg ggcatcacctacatgctgttcttcgtcaatcccggggaggatgaggtctcccgggtcgtc ttcatctacttcaactccttcctggaatccttccagggcttctttgtgtctgtgttctac tgtttcctcaatagtgaggtccgttctgccatccggaagaggtggcaccggtggcaggac aagcactcgatccgtgcccgagtggcccgtgccatgtccatccccacctccccaacccgt gtcagctttcacagcatcaagcagtccacagcagtctga >gi568815581f:45707008_45934761|GENSCAN_predicted_peptide_4|684_aa MACLGFLLPVGFLLLISTVAGGKYGVAHVVSENWSKDYCILFSSDYITLPRDLHHAPLLP LYDGTKAPWCPGEDSPHQAQLRSPSQRPLRQTTAMVMRGNCSFHTKGWLAQGQGAHGLLI VSRVSDQQCSDTTLAPQDPRQPLADLTIPVAMLHYADMLDILSHTRGEAVVRVAMYAPPE PIIDYNMLVIFILAVGTVAAGGYWAGLTEANRLQRRRARRGGGSGGHHQLQEAAAAEGAQ KEDNEDIPVDFTPAMTGVVVTLSCSLMLLLYFFYDHFVYVTIGIFGLGAGIGLYSCLSPL VCRLSLRQYQRPPHSLWASLPLPLLLLASLCATVIIFWVAYRNEDRWAWLLQDTLGISYC LFVLHRVRLPTLKNCSSFLLALLAFDVFFVFVTPFFTKTGESIMAQVALGPAESSSHERL PMVLKVPRLRVSALTLCSQPFSILGFGDIVVPGFLVAYCCRFDVQVCSRQIYFVACTVAY AVGLLVTFMAMVLMQMGQPALLYLVSSTLLTSLAVAACRQELSLFWTGQGRAKMCGLGCA PSAGSRQKQEGAADAHTASTLERGTSRGAGDLDSNPGEDTTEIVTISENEATNPEDRSDS SEGWSDAHLDPNELPFIPPGASEELMPLMPMAMLIPLMPLMPPPSELGHVHAQAQAHETG LPWAGLHKRKGLKVRKSMSTQAPL >gi568815581f:45707008_45934761|GENSCAN_predicted_CDS_4|2055_bp atggcgtgcctgggcttcctcctccccgtgggcttcctcctcctcatcagcaccgtggcc gggggaaagtacggcgtggcccacgtggtgtcggagaattggagcaaggactactgtatc ctgttcagctccgactacatcaccctcccccgggacctgcaccacgccccactcctgccc ctgtatgatggcaccaaggcaccctggtgcccgggtgaggattccccccaccaggcccag ctccgctcccccagccagcggcccctccgccagaccactgccatggtcatgaggggtaac tgcagcttccacacgaaaggctggctggctcagggccaaggtgcccacgggctgctcatc gtgagccgggtcagtgaccaacagtgctcagacaccaccctggcaccccaggatccccgc cagcccctggcagacctcaccatccctgtggctatgctccactatgctgacatgctggac atcctcagccacactcgtggggaggccgtcgtccgcgtggccatgtacgcacccccagag cccatcatcgactacaacatgctggtcatcttcatcctggctgtgggcacagtggctgca ggcggctactgggccggcctgaccgaagccaaccggctacagcggcgccgtgcccgaaga ggaggggggtctggtggtcaccatcagctgcaggaagctgcagcagctgagggagcccag aaggaagataatgaggacatcccagtggacttcacgccggccatgacaggcgtggtggtc accctgtcctgctcgctcatgctgctgctctacttcttctatgaccactttgtctatgtc accattgggatctttggcctgggtgctggcattggcctctacagctgcctgtcacccctg gtgtgccgcctgtccctgcggcaataccagaggcctccgcacagcctctgggcctctctg ccgctgcctctgctgctgctggcgagcctgtgcgcaaccgtgatcatcttctgggtggcc taccgcaatgaggaccgctgggcgtggctcctgcaggacacactgggcatttcctactgc ctgttcgtcctgcaccgtgtgcggctgcccactctcaagaactgctcctccttcctgctg gccctgctggcctttgatgtcttctttgtcttcgtcacccccttcttcaccaaaaccggt gagagcatcatggcgcaggttgccttgggccctgcagagtcttcaagccatgagaggctg cccatggtactcaaagtgccccggctaagagtctccgccttgaccctgtgcagccagccc ttctccatccttggcttcggtgacattgtggtccccggcttcctggttgcttactgttgc cgctttgatgtgcaagtctgctcccgtcagatctacttcgtggcctgcaccgtggcctat gctgtgggcctgctggtcacattcatggccatggtcctcatgcagatgggccaacctgcc ttgctctacctagtgtccagcaccctgctcaccagcctggctgtggctgcctgccgccaa gagctcagcctcttctggactggccagggcagagctaagatgtgtgggctcggctgtgcc ccttcagctggctctaggcagaagcaggagggcgcagcagatgcccacacagccagcaca cttgagagaggcaccagccgaggagcaggggacttagacagcaaccctggagaagacacc actgagattgtcaccatatctgagaatgaagccaccaatccagaggaccgcagtgatagc tccgagggctggagtgacgcccacttggatcctaatgagctgcccttcatcccccctggg gcctcggaggagctgatgccactgatgccaatggccatgctgatcccactcatgcccctg atgcccccgccctcagagctgggccatgtccatgcccaggcccaggcccacgagactggc ctgccctgggcgggactccacaagaggaagggtttgaaagtaagaaagagcatgtcgacc caggctcccttgtga >gi568815581f:45707008_45934761|GENSCAN_predicted_peptide_5|99_aa MWPQAKDCQQPARAGRVGSFQWVRGLADFQNEASDPRSKVNPLDGYTVAQHRTQCTHNAA RQNPTFIPGPGNSIFEESPELKQLVKGLSTNYERGPGQD >gi568815581f:45707008_45934761|GENSCAN_predicted_CDS_5|300_bp atgtggccacaagccaaggactgccagcagccagcacgagctggaagggtcggttccttc cagtgggttcgtggtcttgctgacttccagaatgaagcctcggaccctcgcagcaaggtg aacccactggacggttacactgttgctcagcacagaactcagtgtacccacaacgctgca cgccagaatcccaccttcataccaggtccaggtaacagcatctttgaggagagcccagag ctgaagcaattagtcaaaggtttaagtaccaattatgagagaggccctggacaggactga >gi568815581f:45707008_45934761|GENSCAN_predicted_peptide_6|163_aa MHLQPRAHPCVAAAVRARERGPAAAESVHIARPGPFPGRPGPSPRAGEAGRGPAVAPTDA RRTKPQSAEREGEGRRRRGAQRRTARQISEPRRLPDSRQRRGRERTAEEEKVAVVAAAEE MTYYGRDSGQPSRQEDESGGCQLASAMGRPNADLMDIRMKQQW >gi568815581f:45707008_45934761|GENSCAN_predicted_CDS_6|492_bp atgcacctgcagccccgcgcccatccgtgcgtggctgcggctgtgcgtgcccgcgaacgg ggaccagcggccgccgagtccgtccacatcgccaggccaggtccctttccaggccgccct ggccccagcccccgagcaggggaggcagggaggggccctgcggtggccccgaccgatgcc cggcgcacgaagccccagtctgcggagagggagggcgaggggcggcggcgcaggggtgca cagaggcggacggcgaggcagatttcggagccgcggcgcttacctgatagtcgacagagg cgaggacgggagaggacagcggaggaggagaaggtggctgtggtggcggcggcagaagag atgacctactatggaagagacagtggacaaccaagtagacaagaggacgagagtggggga tgccagttggcctctgccatgggccgtcccaatgctgacctgatggacatacgaatgaag cagcagtggtga