GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:40:04 Sequence gi568815588r:27575232_27843415 : 268184 bp : 39.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 13538 14122 585 1 0 46 42 222 0.864 9.33 1.02 PlyA + 14368 14373 6 1.05 2.00 Prom + 19420 19459 40 -2.35 2.01 Init + 25003 25051 49 0 1 88 58 1 0.132 -1.74 2.02 Term + 26278 26411 134 2 2 24 41 188 0.284 4.97 2.03 PlyA + 27756 27761 6 1.05 3.04 PlyA - 29882 29877 6 1.05 3.03 Term - 31698 31452 247 2 1 69 43 188 0.248 6.68 3.02 Intr - 33264 33042 223 1 1 48 68 118 0.171 2.06 3.01 Init - 41290 41212 79 1 1 78 93 48 0.894 5.59 3.00 Prom - 49379 49340 40 -5.35 4.00 Prom + 59035 59074 40 -3.05 4.01 Init + 63070 63139 70 0 1 85 89 18 0.150 2.86 4.02 Intr + 71061 71185 125 1 2 95 66 23 0.054 0.18 4.03 Intr + 78383 78548 166 1 1 79 42 89 0.008 2.01 4.04 Intr + 84560 84838 279 0 0 83 89 121 0.012 8.23 4.05 Intr + 85216 85467 252 2 0 3 115 206 0.694 11.28 4.06 Term + 88003 88175 173 2 2 13 43 162 0.853 1.21 4.07 PlyA + 88315 88320 6 1.05 5.16 PlyA - 90459 90454 6 1.05 5.15 Term - 100250 99998 253 1 1 83 48 213 0.436 10.93 5.14 Intr - 100323 100290 34 1 1 107 105 29 0.200 2.86 5.13 Intr - 103431 103317 115 0 1 83 91 24 0.111 1.30 5.12 Intr - 112830 112757 74 1 2 87 46 80 0.060 1.91 5.11 Intr - 126959 126771 189 0 0 54 54 90 0.001 0.84 5.10 Intr - 159560 159225 336 0 0 63 111 317 0.800 26.07 5.09 Intr - 160143 159990 154 0 1 39 76 118 0.992 4.42 5.08 Intr - 165953 165906 48 2 0 60 91 48 0.084 0.26 5.07 Intr - 166263 166114 150 0 0 41 89 125 0.635 7.34 5.06 Intr - 166753 166614 140 2 2 60 76 148 0.740 10.06 5.05 Intr - 167244 167050 195 1 0 -5 58 141 0.506 0.26 5.04 Intr - 167787 167490 298 0 1 22 43 220 0.841 6.42 5.03 Intr - 168160 167942 219 1 0 17 77 119 0.060 1.28 5.02 Intr - 168357 168298 60 0 0 90 100 76 0.097 7.01 5.01 Init - 169353 169186 168 0 0 81 69 245 0.123 19.48 5.00 Prom - 174060 174021 40 -8.45 6.03 PlyA - 174485 174480 6 1.05 6.02 Term - 176546 176405 142 1 1 91 51 129 0.786 5.92 6.01 Init - 176717 176626 92 2 2 79 43 108 0.834 5.41 6.00 Prom - 177033 176994 40 -5.45 7.03 PlyA - 177194 177189 6 1.05 7.02 Term - 192335 192154 182 0 2 95 54 166 0.874 10.69 7.01 Init - 196205 196052 154 2 1 76 44 70 0.901 1.59 7.00 Prom - 196823 196784 40 -5.25 8.00 Prom + 199142 199181 40 -4.65 8.01 Init + 203360 203398 39 1 0 58 98 19 0.036 0.17 8.02 Intr + 208815 208884 70 2 1 52 100 32 0.023 -1.36 8.03 Intr + 210687 210850 164 1 2 88 92 85 0.027 7.67 8.04 Term + 237307 237456 150 0 0 76 39 149 0.205 5.73 8.05 PlyA + 238487 238492 6 1.05 9.04 PlyA - 238907 238902 6 1.05 9.03 Term - 239618 239538 81 2 0 84 49 108 0.683 3.21 9.02 Intr - 242885 242747 139 1 1 15 84 112 0.187 3.05 9.01 Init - 245087 244915 173 2 2 76 53 114 0.407 5.66 9.00 Prom - 245141 245102 40 -7.95 10.00 Prom + 246361 246400 40 -4.75 10.01 Sngl + 250685 250981 297 1 0 56 47 203 0.695 8.49 10.02 PlyA + 253420 253425 6 1.05 11.03 PlyA - 253443 253438 6 1.05 11.02 Term - 259340 259226 115 1 1 84 45 126 0.033 4.96 11.01 Intr - 260809 260745 65 1 2 136 -7 126 0.009 4.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 228890 228981 92 1 2 48 77 110 0.915 5.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_1|194_aa MKRNEQNLQEIWDYVKRPNLHFIGVPENDGENGTKLENTLQDIIQENFPNLARQANIQIQ EIQRTPQRYSSRRAIQRHIIARFTKVEIKEKLLRGAREKGQVTHKVKTIRLTAHLSAEIL QARRQWGSIFNILKEKNFQPRISCPAKLSFVNEGEIKSFTDKQILRDFVTKRPALQELLK EALNMERKNRYQPP >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_1|585_bp atgaaaaggaatgaacaaaacctccaagaaatatgggactatgtgaaaagaccaaaccta cattttattggtgtacctgaaaatgatggggagaatggaaccaagttggaaaacactctt caggatattatccaggagaacttccccaacctagcaagacaggccaacattcaaattcag gaaatacagagaacaccacaaaggtactcatcgagaagagcaatccaaagacacataata gccagattcaccaaggttgaaattaaggaaaaattgttaaggggagccagagagaaaggt caggttacccacaaagtgaagaccatcagactaacagcacatctctctgcagaaatccta caagccagaagacagtgggggtcaatattcaacattcttaaagaaaagaactttcaaccc agaatttcatgtccagccaaactaagcttcgtaaatgaaggagaaataaaatcctttaca gacaagcaaattctgagagattttgtcaccaaaaggcctgccttacaagagctcctgaag gaagcactaaacatggaaaggaaaaaccgataccagccaccgtaa >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_2|60_aa MEFHHIGQVGFEHLTSDTQDPNCIACWIEEQPVVEELAFATALAAKTSYSSRAALIFQLK >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_2|183_bp atggagtttcatcacattggccaggttggttttgaacacttgacctcagatacccaggac ccaaactgcatagcatgttggattgaggagcagcccgtggtagaagaattggcctttgca acggctctagcagccaaaacttcatattcatcaagagcagctttgatcttccagctgaaa taa >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_3|182_aa MGNLKQLFNLQASLNVTAFHQLHFEKGQVVEEKEKILTFHLNFRFKELGVFLSGLKEKQL YQRKEWTCYNQHLQQEVSLPTFSGVSWYIQTSYSFLKSQTCLARCQHLAPSAGGRKPMST KNQPSSVKREPTTPDDQNHQFAYVDTEIRQSVDFNFSHMGSRSPSNNQEERLCLSVKMSW MA >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_3|549_bp atgggaaacctgaagcagctcttcaatctccaggcttcacttaatgttactgctttccac cagctccactttgagaaagggcaggttgtagaagaaaaagaaaagatattaacctttcac ctgaatttcaggtttaaagagttgggcgttttcctatctggccttaaagaaaagcagctt tatcagaggaaagaatggacctgctacaatcagcatttacaacaggaagtaagcttgcca actttcagtggtgtaagctggtacattcagacttcttacagcttcctgaaatcacagacc tgccttgccagatgccagcatcttgcaccttctgcaggtggaagaaagcccatgagtaca aagaaccaaccctcctcagtcaaacgagaacccacgactccggacgaccagaatcaccaa tttgcttatgtagacactgaaataagacaaagtgtcgactttaattttagccatatggga agtcgcagtccttcaaacaatcaggaagaacggctttgcctttcagtgaaaatgtcttgg atggcttag >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_4|354_aa MERFGLGDERREKEVSGVSSSFCEHYTSIWRDKTCYTHETITEKCNDAIHLFTKLETDSR NVMHISLAESLVVVALQERNREGLKLHECLLGPSVRTDYETERGHPLRGLSAIVREAQPT GSFTCLLLLVVKLKQTVLPGLRALGLGRGQSGHQPNWWFGMLVPEIWPVSRTGSGAQAVA DHVVQVLFCFSGRSAEWEELWVLPRRTLWQHRSEKKKSSEAAEEDLRTMEEQVRGTWGET VQGGSKGGERNGATSESEEDNELDCDPLRESQGPQHSERMGSHSRAPGCAFTHCIPEVKE EIDFEKQAQRGDERENKHGGENNNSELPEDKEEMEDNKQSQSKIKRHENNTMKT >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_4|1065_bp atggagaggtttggattaggagatgagaggagggaaaaagaggtttctggggtcagtagc agtttctgtgaacattatacatctatttggagagacaagacatgttatacccatgagaca attacagaaaaatgcaatgatgcaattcatctattcaccaaactagaaacagattccaga aatgtcatgcacataagccttgcagagtccctggttgtggtggccttgcaggaaagaaat agagagggtctcaaactccatgaatgtctgctgggtcctagtgtgaggacagattatgag acagagaggggacaccctcttagaggactttcagccattgtaagggaggctcaacccaca gggtcctttacctgcctcctacttctggtggtgaagctaaagcaaactgttctacctggt ttgagggctttagggctgggccggggtcaaagtggtcaccagcccaattggtggtttggc atgctggttccagagatttggcctgtttcccgcacaggcagtggggcccaggctgtggcc gaccacgtagtccaagtgctgttttgtttctcaggaaggtctgcagagtgggaggagctg tgggttctgcccagaaggactttgtggcagcacagaagtgaaaaaaaaaaaagcagcgag gcagcggaggaggatctgagaacaatggaggaacaagtcagaggaacttggggtgagact gttcagggagggagcaaaggaggagagaggaatggagcaacttcagagagtgaagaagat aatgagctggactgtgatcccctgagggagagtcaagggcctcagcattctgaacgaatg ggatcccacagcagggccccaggatgtgcatttactcactgtatcccagaagttaaagag gaaatagattttgaaaagcaagctcaaagaggggatgaaagagaaaacaagcacggaggg gagaataacaacagtgagcttcctgaggacaaggaagaaatggaagataataaacaaagc caaagcaaaatcaagagacatgaaaataataccatgaaaacttga >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_5|810_aa MGRAKSAARLGAVTLRLRGRADLLLTPVRGRQRLRASATPEVSCWLSPTVAAVAREPGDP AKFKALNLQGSEHSQPLSGAVLFEDGGASERERGGRPYSGVLDSPHARPEVGIPDGPPLK DNLGLRHRRTGYAHSPGGPARLRPGGVGVLQRAWWRLRAPGAAPRPSAAWTSELGVICPS PDCGCRCAAEGEEKGLSSVQRNRRSRAVGESRLRLRGLLFGHSSPSPEPRGSPGPRSREG RLPPGSPGPACSAASLLSKTEIRFRQNVTTSRFGWNCISKCLKRQKPRRLRRRRASLGER LGSLAAARDTSRTLGDLSPDHKRRKASGSRQEIPEGSFESQPHTLPGLPGQGHSRVRRQR NGGKVRHKRQALQDMARPLKQWLYKHRDNPYPTKTEKILLALGSQMTLVQFSADAPCRKE CRMRCQVSNWFANARRRLKNTVRQPDLSWALRIKLYNKYVQGNAERLSVSSDDSCSEDGE NPPRTHMNEGGYNTPVHHPVIKSENSVIKAGVRPESRASEDYVAPPKYKSSLLNRYLNDS LRHVMATNTTMMGKTRQRNHSGSFSSNEFEEELVSPSSSETEGNFVYRTVALGDAFGFVR AWLQKRVLYSLIFLDSVLQLTLNRRNLPFVEGVEISFALKANEVLMYSLLEIGDKDEMQR SQAAGQLGTLMLSERAEAPCIDGIDTLTGSATHNIFDSHRLSSCLCSNHSEIDSSDTLEN GSNKGESDGIFNANCSEFDRLLSVLLNSSAANRKGPSKDDTYWKEINAAMALTNLAQGKD KLQGTTSCIIQKSSHIAEVKTVKVPLVQQF >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_5|2433_bp atggggcgggcgaagtccgccgcgcgccttggggctgtaactctgcggctccgcggccgc gcggacttgctcctgacgccggtgcgcggcaggcagaggctcagggcctccgcgacccca gaggtgagctgctggctctccccgacagtggcagcagtggcccgggagcccggcgacccg gccaagttcaaggccctgaacctgcaggggtctgagcactctcagccactcagcggtgcg gtgctgtttgaggacggaggcgcctcggagcgggagcggggtggccggccctacagcggt gtcctggacagtcctcacgcccgccccgaggtgggcattcccgacggcccgcccctcaag gacaacctcggcctgagacaccggaggaccgggtatgcgcactcccctgggggccctgcc cgcctgcggcccggaggggtgggggtgctccaaagagcttggtggagactaagggcgcca ggagctgcccctcggccctccgctgcctggacctcggagttgggggttatttgcccctct ccggactgtgggtgccggtgcgcggctgagggcgaggaaaaaggactgagctcggtgcag agaaacaggcgaagccgagcagtcggggagtccaggctcaggctccgcggacttctcttc ggtcactcgagcccctcgccagagccccgcggctcgcctggcccaaggagtcgggaaggt cgcctcccgccagggagtcccgggcctgcttgttctgcggcgtcgctgcttagtaaaaca gagattcggtttcgtcaaaatgttactaccagtcgtttcgggtggaattgcatttcaaag tgtttaaagaggcagaaacccaggaggctccggcgcaggcgcgcaagcctcggggagcgg ctgggctcgctggcggccgcgcgcgacacctcccggaccctcggcgacctttcccctgac cacaagcgacggaaagcctctgggagcagacaagagatccccgagggaagtttcgagagc cagccgcataccctgccaggcctgccaggccagggccactcgagggtgcggcgccagcgg aatggcgggaaggtgaggcacaagcggcaggccctgcaagacatggcgcgacccctcaag cagtggctttacaagcaccgtgacaacccgtaccccaccaagaccgagaagatactcttg gccctcggctcgcagatgacgctagtgcagttcagtgcagatgctccgtgcaggaaagaa tgcaggatgcgctgtcaggtgtcaaattggtttgctaatgcaagacgtcggcttaagaat accgttcgacagccagatttaagctgggctttgagaataaagttatacaacaagtatgtt caaggcaatgctgaacggcttagcgtaagcagtgatgactcatgttctgaagatggagaa aatcctccaagaacccacatgaacgaagggggctataataccccagttcaccatcctgtg attaaaagtgagaattcggtcatcaaagcgggagtgaggccagagtcacgggccagtgag gactacgtggcaccccccaaatacaagagcagcttgttgaaccgttaccttaatgactct ttgagacatgtcatggccacgaacactaccatgatgggaaaaacaaggcaaagaaaccac tcgggatcttttagctccaatgaatttgaggaagaattagtgtctccatcgtcatcagaa actgaaggcaactttgtctatcgcacagtggctttgggagatgcttttggctttgtccgt gcctggcttcagaagagagttttgtactccctcattttcctggattcagttctacagcta actttgaatagaaggaatttgccctttgtggaaggagtggagatcagttttgcattgaaa gctaatgaggtcttaatgtattctctcctggagattggtgataaagatgaaatgcagaga agtcaagcagctggccagcttggcacgctgatgctaagtgagagagcagaggctccatgc attgatggcattgatacattgacgggttctgcaacccataacatatttgatagccacagg ctttcttcttgtctgtgcagtaaccactctgagattgacagttcagacactctggaaaac ggatccaataagggtgaaagcgatggcattttcaatgcaaactgcagtgagtttgatcgt ttactttctgttttgcttaactccagcgcagctaacagaaaaggaccaagcaaggatgac acgtattggaaggagatcaacgcagctatggccttaacaaatcttgcacagggaaaggac aaactgcagggaactaccagctgcatcatccagaagtcgtcccatatagcagaagtaaag actgtcaaagtgccgctggtgcagcagttttaa >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_6|77_aa MEFNHHRGDHGAQKAFAIRPFTEKGFWTQVRLFAARLTWVTACYGLTCVPSKFICDTLTP NGTELGDKAFNEVNKVK >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_6|234_bp atggagttcaatcatcatcggggagatcacggggcccagaaagcttttgctatccggccc tttacagaaaaaggtttttggactcaggttaggttatttgctgctcggctgacatgggtc actgcctgttacggactgacctgtgtcccctcaaaattcatatgtgacaccctaaccccc aatggaactgaacttggagacaaggcctttaatgaggtgaataaagttaaatga >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_7|111_aa MRSTCESRSPGAQAHFKTETSPTGLEEASPLDSLSSHQLGSSITQGNTTERTVHLMETFQ VFVPELADFLVSTYKSPMGVRGEVQAISAVDGNHSPGTSSSSGFLITALPE >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_7|336_bp atgagaagcacttgtgaaagtcgcagcccaggggcacaggctcatttcaagactgagacc tcccccacaggactagaggaagcctcccctctcgactccttatcttcacaccagttgggc tccagtatcacacaggggaatacaactgaaagaacagtccatctcatggagacatttcag gtttttgttccagaattagcagactttctggtgtccacatacaagagtcccatgggtgtc agaggtgaggtgcaggccatcagtgctgttgatgggaatcacagtcctggcaccagcagc agtagtggtttcttgatcacagcacttcctgaatga >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_8|140_aa MIFAMGLCSTSPRRSGKEFVDIFQNHHNCHQRTVNKGRSNKAFSFISMLVCHLGLLLTYL RPQNVEGWACRSGSIWKPEALDSKRVLRKYQQEPACGYWIYNQLQLPGDPGQGTQPYPVE AVTHKEEKKGHKNKNINLKT >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_8|423_bp atgatctttgccatgggtctttgcagtacctctcccaggaggagtggtaaggaatttgtg gacatatttcagaaccaccacaattgccatcagaggacagtgaataaaggcagatctaat aaggctttctctttcatctcaatgttggtgtgccatcttgggttgcttcttacttacctg agacctcagaatgtggagggctgggcctgccgttctggctctatttggaagcccgaggct ctggacagcaaaagagtgctcaggaagtaccagcaagagccagcctgcggatattggata tacaaccagctgcagcttcctggagatcctggtcaggggacccaaccatatccagtagaa gctgtcacacataaggaggagaagaagggacacaagaataaaaacattaatctaaagaca taa >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_9|130_aa MTRSEGGWAKRRLDFDRHQVKGFLSKRAVQGGSSQWHVEPGPLVSMEVSQRQGRAWSRLA ALTFNYLVVMWKSEISRLTVGTPYSRITKVTMVDVGYEEVSKSHDSVIGEGTGKNRVGFM DSYGFPGGML >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_9|393_bp atgaccaggtctgagggaggatgggcgaaacgcaggctagattttgacaggcatcaggtc aagggtttcctttccaagagagccgtacagggtggcagcagccagtggcatgtggagcct gggccattggtgtccatggaggtcagccagcgacaaggcagggcctggagcaggctggca gcgctcacattcaattatttggtagtgatgtggaagagtgaaatctccaggcttacagtt ggcactccatactccaggataacaaaggttaccatggtggatgtgggctacgaggaggtc agtaaaagccatgactcagttattggtgaaggcacagggaagaacagagttggatttatg gacagttatggttttcctggagggatgttgtaa >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_10|98_aa MTPHPHRKQTRQKYFPPSTSPLTSDLDLLNSIDLHFNSTLINQPLLWSYPDLSNTSDCFI SASRCCLSEIPGSGGRRMGSRTSRGTDLPVQSPCSGPP >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_10|297_bp atgacacctcatcctcatagaaaacaaacaagacaaaaatattttcctccaagcacctcg cctctaacttcagaccttgacctcctcaactccattgatcttcacttcaattccacatta attaatcaaccattattatggtcatacccggaccttagtaataccagtgactgcttcatc tctgcctctcgttgctgtctatcagaaattccaggttctggaggacgcagaatgggatca agaacatcaagaggcactgatctaccagtgcagtctccctgctctggtccaccatga >gi568815588r:27575232_27843415|GENSCAN_predicted_peptide_11|59_aa ATQREDDEDEDLYDDPLPLNVYTVPWMSTNLHPPLFDDSTWQPAREDASQLPLSLGVAV >gi568815588r:27575232_27843415|GENSCAN_predicted_CDS_11|180_bp gctactcaacgggaagatgacgaggatgaagacctttatgatgatccacttccacttaat gtatacacagtcccctggatgtccaccaaccttcatcctcccctctttgatgatagcaca tggcagccagccagagaagatgcttctcagcttcccttgagtcttggtgtggccgtgtga