GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:21:07 Sequence gi568815597r:241397831_241619722 : 221892 bp : 38.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12822 12901 80 0 2 63 100 75 0.021 4.48 1.02 Intr + 21290 21441 152 0 2 84 94 31 0.004 2.26 1.03 Term + 25975 26313 339 0 0 -8 45 264 0.015 6.05 1.04 PlyA + 27602 27607 6 1.05 2.04 PlyA - 28393 28388 6 1.05 2.03 Term - 32188 31916 273 1 0 68 38 174 0.459 4.89 2.02 Intr - 32743 32533 211 0 1 54 30 127 0.420 1.49 2.01 Init - 38654 38527 128 2 2 75 109 157 0.461 16.18 2.00 Prom - 41348 41309 40 -8.75 3.04 PlyA - 41624 41619 6 1.05 3.03 Term - 46088 45982 107 0 2 92 45 82 0.118 1.89 3.02 Intr - 48465 48375 91 0 1 91 20 91 0.086 1.25 3.01 Init - 58319 58122 198 2 0 81 91 164 0.772 14.95 3.00 Prom - 59501 59462 40 -5.25 4.00 Prom + 65311 65350 40 -8.45 4.01 Sngl + 68804 69199 396 1 0 86 50 227 0.739 14.70 4.02 PlyA + 70251 70256 6 1.05 5.14 PlyA - 70821 70816 6 1.05 5.13 Term - 78578 78484 95 0 2 76 42 40 0.048 -4.79 5.12 Intr - 85885 85781 105 2 0 13 115 153 0.800 9.77 5.11 Intr - 97118 97027 92 1 2 114 63 43 0.182 3.12 5.10 Intr - 100140 100002 139 1 1 107 -6 109 0.061 2.00 5.09 Intr - 102760 102607 154 1 1 80 97 71 0.965 5.92 5.08 Intr - 104740 104613 128 2 2 21 103 122 0.947 6.48 5.07 Intr - 106415 106212 204 0 0 80 89 223 0.988 19.85 5.06 Intr - 108338 108173 166 2 1 115 92 164 0.999 18.11 5.05 Intr - 110955 110773 183 0 0 24 80 261 0.909 17.96 5.04 Intr - 114313 114137 177 1 0 94 82 122 0.931 11.39 5.03 Intr - 115883 115773 111 2 0 47 75 150 0.995 9.26 5.02 Intr - 119486 119352 135 2 0 80 100 149 0.999 15.14 5.01 Init - 121892 121761 132 2 0 80 89 188 0.999 16.29 5.00 Prom - 125116 125077 40 -6.75 6.00 Prom + 134516 134555 40 -4.45 6.01 Init + 137719 137775 57 0 0 61 93 37 0.078 2.86 6.02 Intr + 150999 151068 70 2 1 108 102 30 0.146 4.24 6.03 Intr + 151847 151944 98 0 2 54 80 127 0.168 7.31 6.04 Intr + 153125 153214 90 1 0 78 96 22 0.394 1.27 6.05 Intr + 164299 164502 204 0 0 25 95 112 0.003 4.07 6.06 Intr + 168661 168782 122 0 2 138 89 95 0.997 12.97 6.07 Intr + 168981 169170 190 0 1 6 87 174 0.975 7.77 6.08 Intr + 188849 188906 58 1 1 110 83 71 0.295 6.34 6.09 Term + 189947 189966 20 0 2 94 55 8 0.102 -4.40 6.10 PlyA + 191563 191568 6 1.05 7.07 PlyA - 191714 191709 6 1.05 7.06 Term - 196861 196598 264 1 0 72 49 217 0.652 10.72 7.05 Intr - 200167 199916 252 1 0 117 121 110 0.998 13.91 7.04 Intr - 206749 206430 320 2 2 117 101 215 0.863 20.45 7.03 Intr - 207678 207567 112 0 1 29 17 145 0.437 0.53 7.02 Intr - 218106 218024 83 1 2 70 85 88 0.099 5.14 7.01 Intr - 220875 220750 126 1 0 25 38 138 0.064 2.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 12400 12210 191 2 2 81 28 156 0.942 5.53 S.002 Sngl + 65667 65975 309 2 0 47 49 194 0.806 7.05 S.003 Intr + 164337 164502 166 0 1 82 95 158 0.966 14.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_1|190_aa XKKTEHEARDMISCTSTQTCFYTGYQEGSFQLGKIVGGGGGGEEAGLGNITSPLPRVRGS ITTYSEARINIEVKKCVSQQAVLSGRCHRAGCSGKVQAVAAGAAVGAAVAAVGALCHASP VRRVPKAADYTSPTLLQQGGSHLAGLPTHPSRPSPLRTLATCHRCRPPPLQGGCTEGADR LRSPPPWGLQ >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_1|573_bp nntaaaaagacagaacacgaggcccgtgatatgatcagctgcacatcaactcagacatgc ttctacacaggataccaggaaggaagctttcagcttgggaaaattgtgggcgggggtggc ggaggggaagaggcaggactggggaatattacttctcctcttcctcgagtcagagggagt attactacatactctgaagccagaataaacattgaagtgaaaaaatgtgttagccagcag gcagtcctgagcggccgctgccatcgcgccggctgcagcggaaaggtacaagcggtggcg gcaggagccgctgtgggagcagcagtggcggcagtgggagccctgtgccacgcgtcccct gtgcgccgtgtccccaaggcagctgactacacctcccccaccctcctgcagcagggtgga tcccacctagcgggactccccacccacccctccaggccttctccactccggactctggcc acttgccaccgttgtcgcccgccgccgctgcagggagggtgcacggagggagcagaccgt ctccggagcccgccgccctgggggctgcagtga >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_2|203_aa MVYGASEAIGQRQSSAAKPRRSGKKSVREPWARVPGALGVDARAPGGCGHNFSRLKGSCL LALKRAADLPAQRSSSAKGQTASSSGTLTPMPPDWETTPSRSRQTPHTGELWLDYNSLPA REQNWMENEFDELTEVGFSRWVIRNSSELKEHVLTQCKEARNLEKRLEKLLTRITRIEKN INNLMELKNTARELREVCTSINS >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_2|612_bp atggtctacggggcttctgaggcaatcgggcagcgtcagtcttcagctgctaagccgaga agatctgggaagaagtcagtcagagagccttgggccagagttccaggggctctgggagtg gatgccagagcacctggcggctgtgggcacaacttcagcagacttaaaggctcctgcctg ctggctctgaagagagcagcagatctcccagcacagcgctcgagctctgctaagggacag actgcctcctcaagtgggaccctgacccccatgcctcctgactgggagacaactcccagc aggagtcgacagacacctcatacaggagagctctggctggattacaactcattgccagca agggaacaaaactggatggagaatgagtttgacgaattgacagaagtaggcttcagcagg tgggtaatcagaaactcctctgagctaaaggagcatgttctaacccaatgcaaggaagct aggaaccttgaaaaaaggttagagaaattgctaactagaataaccagaatagagaagaac ataaataacctgatggagctgaaaaacacagcacgagaacttcgtgaagtatgcacaagt atcaatagctga >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_3|131_aa MWEDPDEPGVTEPLNSDEPSLPEGDVSPAPVEEASPALLTAASPVSEVLACPPLSEGFNP ALLEKMTHWGRGWVPKTPGDPTPVDCLSSAHAEAVMEGPFEEDPTINHSTRVPISDFWET QCKTLAVVYAE >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_3|396_bp atgtgggaagaccctgatgaacctggagttactgagcccctaaattctgatgaaccttct ttgccagagggagatgtctccccagccccagtagaagaggcctccccagccctactgaca gcagcctccccagtgtcagaggtattggcctgtccacctctgtctgaggggtttaaccct gcattgcttgagaaaatgacacactggggcaggggttgggtccccaagactccaggggat cctactcccgtggattgtctgagctcagctcatgctgaagctgttatggagggtcctttt gaggaggatcccacaataaatcacagcacaagagtccccatctcagacttttgggaaaca caatgtaagacactcgctgtggtatatgcagagtga >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_4|131_aa MVESKGEAGTSYMARAEGRERGREKCYTLLNNQLSQEFTHYHRNSTKGEGVKPFRRNCPH DWIIIFHQALPPTLEITIRHEIWAGTQIQNISDLKAVMETTNKRKAPEMCVQGVHKRAKR PMWLEDSKHGI >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_4|396_bp atggtggaaagcaaaggggaagcaggcacatcttacatggccagagcagaaggaagagag agagggagggagaagtgctacacacttttaaacaaccagctttcacaggaattcactcac tatcataggaacagcaccaagggggaaggtgttaaaccattcaggagaaactgcccccat gattggataatcatcttccaccaggccctacctccaacgttggagattacaattcgacat gagatttgggcagggacacagatccaaaacatatcagacctaaaagcagtcatggagacc actaacaagcgcaaagcccctgagatgtgcgttcagggtgttcacaaaagagcaaagagg ccgatgtggctggaagatagtaagcacgggatatga >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_5|606_aa MYRALRLLARSRPLVRAPAAALASAPGLGGAAVPSFWPPNAARMASQNSFRIEYDTFGEL KVPNDKYYGAQTVRSTMNFKIGGVTERMPTPVIKAFGILKRAAAEVNQDYGLDPKIANAI MKAADEVAEGKLNDHFPLVVWQTGSGTQTNMNVNEVISNRAIEMLGGELGSKIPVHPNDH VNKSQSSNDTFPTAMHIAAAIEVHEVLLPGLQKLHDALDAKSKEFAQIIKIGRTHTQDAV PLTLGQEFSGYVQQVKYAMTRIKAAMPRIYELAAGGTAVGTGLNTRIGFAEKVAAKVAAL TGLPFVTAPNKFEALAAHDALVELSGAMNTTACSLMKIANDIRFLGSGPRSGLGELILPE NEPGSSIMPGKVNPTQCEAMTMVAAQVMGNHVAVTVGGSNGHFELNVFKPMMIKNVLHSA RLLGDASVSFTENCVVGIQANTERINKLMNESLMLVTALNPHIGYDKAAKIAKTAHKNGS TLKETAIELGYLTAEQFDEWVKPKDMLGPNVLLVFAAERNLGETNVIIEKGWYYKDEIKT DSTSQLSLKAGQSVDSCCVQTAAQEKEDRGEGVEDGPFHDTWVLQELQYEIWVGTQPNLI SVDLKY >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_5|1821_bp atgtaccgagcacttcggctcctcgcgcgctcgcgtcccctcgtgcgggctccagccgca gccttagcttcggctcccggcttgggtggcgcggccgtgccctcgttttggcctccgaac gcggctcgaatggcaagccaaaattccttccggatagaatatgatacctttggtgaacta aaggtgccaaatgataagtattatggcgcccagaccgtgagatctacgatgaactttaag attggaggtgtgacagaacgcatgccaaccccagttattaaagcttttggcatcttgaag cgagcggccgctgaagtaaaccaggattatggtcttgatccaaagattgctaatgcaata atgaaggcagcagatgaggtagctgaaggtaaattaaatgatcattttcctctcgtggta tggcagactggatcaggaactcagacaaatatgaatgtaaatgaagtcattagcaataga gcaattgaaatgttaggaggtgaacttggcagcaagatacctgtgcatcccaacgatcat gttaataaaagccagagctcaaatgatacttttcccacagcaatgcacattgctgctgca atagaagttcatgaagtactgttaccaggactacagaagttacatgatgctcttgatgca aaatccaaagagtttgcacagatcatcaagattggacgtactcatactcaggatgctgtt ccacttactcttgggcaggaatttagtggttatgttcaacaagtaaaatatgcaatgaca agaataaaagctgccatgccaagaatctatgagctcgcagctggaggcactgctgttggt acaggtttaaatactagaattggctttgcagaaaaggttgctgcaaaagtggctgcactt acaggcttgccttttgtcactgctccgaataaatttgaagctctggctgctcatgacgct ctggttgagctcagtggagccatgaacactactgcctgcagtctgatgaagatagcaaat gatattcgatttttgggttctggtcctcggtcaggtctgggagaattgatcttgcctgaa aatgaaccaggaagcagtatcatgccaggcaaggtgaaccctactcagtgtgaagcaatg accatggttgcagcccaagtcatggggaaccatgttgctgtcactgtcggaggcagcaat ggacattttgagttgaatgttttcaagccaatgatgattaaaaatgtgttacactcagcc aggctgctgggggatgcttcagtttcctttacagaaaactgcgtggtgggaatccaggcc aatacagaaaggatcaacaagctgatgaatgagtctctaatgttggtgacagctctcaat cctcatatagggtatgacaaggcagcaaagattgctaagacagcacacaaaaatggatca accttaaaggaaactgctatcgaacttggctatctcacagcagagcagtttgacgaatgg gtaaaacctaaggacatgctgggtccaaatgtccttttggtctttgcagcagaaagaaat ctgggagagacaaatgtaataatagaaaaagggtggtattataaagacgagattaaaact gactccacaagccagttatctctgaaggctggacagtctgtggacagctgctgtgtgcag acagcagcacaggaaaaggaagacagaggagagggagtagaggatggtcccttccacgac acgtgggtattacaggagctacaatatgagatttgggtggggacacagccaaaccttatc agtgtagacttaaaatattag >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_6|302_aa MFSQRDQQTSKKTPNEKQKVGSLQACFLAKRNFQIDVYEAREDTRVATFTRGRSINLALS HRGRQALKAVGLEDQIVSQGIPMRARMIHSLSGKKSAIPYGTKSQTYFSFGCFVLFFRSD KVPKDVTCDLIVGCDGAYSTVRSHLMKKPRFDYSQQYIPHGYMELTIPPKNGDNKSFTCT LFMPFEEFEKLLTSNDVVDFFQKYFPDAIPLIGDSSGKDEPPMSRGFWASYAGSSGSLGL GVPQELAELFSTGRYTNLWHSRYIRQPQILDPTLTRHGFEDCLVFDELMDKFSNDLIISV KY >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_6|909_bp atgttcagtcaaagagatcaacaaaccagtaaaaaaactcctaatgagaaacaaaaggtt ggctcattacaagcatgctttcttgcaaagaggaatttccagattgatgtatatgaagct agggaagatactcgagtggctaccttcacacgtggaagaagcattaacttagccctttct catagaggacgacaagccttgaaagctgttggcctggaagatcagattgtatcccaaggt attcccatgagagcaagaatgatccactctctttcaggaaaaaagtctgcaattccctat gggacaaagtctcagacatatttttcttttggatgttttgttctatttttcagatctgac aaagttcccaaagatgtcacttgtgacctcattgtaggatgtgatggagcctattcaact gtcagatctcacctgatgaagaaacctcgctttgattacagtcagcagtacattcctcat gggtacatggagttgactattccacctaagaacggagataacaaatcattcacatgtact ttgttcatgccctttgaagagtttgaaaaacttctaaccagtaatgatgtggtagatttc ttccagaaatactttccggatgccatccctctaattggagattcctccgggaaagatgag ccaccaatgtcccgaggcttttgggcatcttatgcaggctcctcaggctccttggggctt ggcgtgccacaggaactggctgaacttttttctactggtagatacacaaatctttggcac agcagatacatccggcagcctcagatcttggatcctacacttactagacacggctttgaa gactgcttggtatttgatgagttaatggataaattcagtaacgaccttattatctctgtc aaatactga >gi568815597r:241397831_241619722|GENSCAN_predicted_peptide_7|385_aa XTQNDGGVALLKTMEVSTDQKWAGCGIHQRQERGPPLCAAPLGWISQLQAATREARASMG PVQQGTICMQVTEKYTNSIIKADLLQNSGTREDHASPEDPLTPVSNPWIVSIATLTVLAY ERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDVHGLGCTVDWKSKDA NDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVEDLQTIQVIKILKYEKKLAKMC FLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRKFR RSLLQLLCLRLLRCQRPAKDLPAAGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITS DESLSVDDSDKTNGSKVDVIQVRPL >gi568815597r:241397831_241619722|GENSCAN_predicted_CDS_7|1158_bp nagacacagaacgatggcggtgttgctctgctgaaaacgatggaggtcagcacggaccag aaatgggctggctgcggaatccatcagagacaggagagagggccacccctctgcgctgca cctcttggttggatcagccagcttcaggcagccactagggaagccagagcctccatgggt ccagtgcagcaaggcactatctgcatgcaggtgacagagaaatatacaaacagtatcatt aaagcggacttgttgcaaaatagcggtacacgtgaagaccatgcaagcccagaagacccc ctgacccctgtcagcaatccttggattgtttccattgccaccctaaccgtgctggcctat gaacgttacattcgcgtggtccatgccagagtgatcaatttttcctgggcctggagggcc attacctacatctggctctactcactggcgtgggcaggagcacctctcctgggatggaac aggtacatcctggacgtacacggactaggctgcactgtggactggaaatccaaggatgcc aacgattcctcctttgtgcttttcttatttcttggctgcctggtggtgcccctgggtgtc atagcccattgctatggccatattctatattccattcgaatgcttcgttgtgtggaagat cttcagacaattcaagtgatcaagattttaaaatatgaaaagaaactggccaaaatgtgc tttttaatgatattcaccttcctggtctgttggatgccttatatcgtgatctgcttcttg gtggttaatggtcatggtcacctggtcactccaacaatatctattgtttcgtacctcttt gctaaatcgaacactgtatacaatccagtgatttatgtcttcatgatcagaaagtttcga agatcccttttgcagcttctgtgcctccgactgctgaggtgccagaggcctgctaaagac ctaccagcagctggaagtgaaatgcagatcagacccattgtgatgtcacagaaagatggg gacaggccaaagaaaaaagtgactttcaactcttcttccatcatttttatcatcaccagt gatgaatcactgtcagttgacgacagcgacaaaaccaatgggtccaaagttgatgtaatc caagttcgtcctttgtag