GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:10:34 Sequence gi568815578f:1018774_1265077 : 246304 bp : 43.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 103 98 6 1.05 1.09 Term - 1351 1340 12 1 0 120 37 8 0.093 -2.90 1.08 Intr - 1989 1935 55 2 1 86 95 49 0.064 4.38 1.07 Intr - 7610 7520 91 0 1 72 84 46 0.033 1.65 1.06 Intr - 23247 23140 108 1 0 58 101 27 0.063 1.26 1.05 Intr - 34173 33980 194 2 2 82 72 123 0.295 9.24 1.04 Intr - 37691 37631 61 0 1 64 116 38 0.628 1.99 1.03 Intr - 39694 39589 106 1 1 81 98 5 0.446 0.59 1.02 Intr - 40555 40457 99 1 0 156 72 11 0.765 6.61 1.01 Init - 49240 49190 51 1 0 85 110 15 0.248 4.56 1.00 Prom - 54148 54109 40 -2.26 2.00 Prom + 58264 58303 40 -1.86 2.01 Init + 73749 73787 39 2 0 111 97 20 0.410 5.39 2.02 Intr + 76392 76517 126 2 0 112 121 -55 0.132 1.08 2.03 Intr + 94591 94636 46 0 1 91 97 47 0.464 3.88 2.04 Intr + 97648 97772 125 2 2 53 94 -5 0.323 -3.10 2.05 Intr + 99980 100129 150 1 0 114 101 134 0.952 17.66 2.06 Intr + 106725 106877 153 2 0 73 105 232 0.998 23.67 2.07 Intr + 108653 108735 83 1 2 85 80 99 0.453 7.24 2.08 Intr + 116348 116533 186 2 0 101 105 153 0.490 17.10 2.09 Term + 128332 128419 88 1 1 -49 55 211 0.011 1.23 2.10 PlyA + 129704 129709 6 1.05 3.00 Prom + 141328 141367 40 -2.86 3.01 Init + 141811 142114 304 0 1 71 13 320 0.829 20.24 3.02 Intr + 142352 142811 460 0 1 17 -6 547 0.431 30.44 3.03 Intr + 144357 144410 54 0 0 110 92 7 0.577 1.39 3.04 Intr + 145545 145703 159 0 0 110 87 67 0.750 7.90 3.05 Term + 146256 146307 52 0 1 108 49 70 0.825 2.00 3.06 PlyA + 148192 148197 6 1.05 4.03 PlyA - 148581 148576 6 1.05 4.02 Term - 162814 162075 740 2 2 125 48 695 0.851 62.83 4.01 Init - 166257 165948 310 0 1 54 39 218 0.901 8.99 4.00 Prom - 169382 169343 40 -7.06 5.06 PlyA - 169433 169428 6 1.05 5.05 Term - 171476 171095 382 1 1 29 49 744 0.102 58.91 5.04 Intr - 172152 172037 116 0 2 18 94 69 0.056 -0.25 5.03 Intr - 175449 175336 114 0 0 95 84 26 0.089 3.54 5.02 Intr - 177181 177112 70 0 1 99 67 0 0.255 -1.82 5.01 Init - 179003 178867 137 2 2 95 71 96 0.569 8.21 5.00 Prom - 190234 190195 40 -3.96 6.00 Prom + 191881 191920 40 -6.16 6.01 Init + 199335 199707 373 2 1 24 58 220 0.249 9.73 6.02 Intr + 202082 202333 252 0 0 67 35 132 0.300 3.21 6.03 Term + 203018 203280 263 0 2 70 55 152 0.539 5.79 6.04 PlyA + 203299 203304 6 1.05 7.00 Prom + 204061 204100 40 -2.46 7.01 Init + 206643 206733 91 2 1 69 78 32 0.063 -0.99 7.02 Intr + 212753 212846 94 0 1 96 22 65 0.053 -0.28 7.03 Intr + 215312 215418 107 2 2 111 77 27 0.298 3.76 7.04 Intr + 226918 226986 69 2 0 85 62 54 0.245 1.65 7.05 Intr + 227045 227085 41 0 2 121 75 8 0.321 0.74 7.06 Term + 235496 235684 189 1 0 86 49 116 0.531 4.95 7.07 PlyA + 236540 236545 6 1.05 8.02 PlyA - 236808 236803 6 1.05 8.01 Term - 242207 242142 66 2 0 130 43 79 0.872 5.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 128410 128246 165 1 0 37 78 179 0.870 11.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_1|258_aa MRKNQQREQCQMLLSNQILIPNKHLAPKLCLRVCLHRTQPATQILIPGGQHSQSLWEVSP SSPSEEQPPVLSPGGVDPFPSLSYKVGSDDSTHMQARSQDSVKEKRSSRKVLGHLSLPCM GHGLLPTLLMLTALALGLGSGDRTQQCCQSLMPMLKTMRGGKTKVKLGEKERSPHSSSAA TFFTLTLITHNTWVWFFARDESPPLEWTWMELEAVILSKLMQEQKTKYHKFSLISGSKRE LLMAPSEGGEAEPAKYLT >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_1|777_bp atgaggaagaaccagcaaagagaacagtgtcaaatgcttctgagcaatcagatattgatt cctaataaacaccttgcacctaaactctgtcttcgtgtctgcctgcatagaacccaacct gcaacacaaattctgattcctggaggtcagcattcacaatccctttgggaagtatctccc tcatctccctcggaggaacagcccccagttctcagtccaggtggggttgaccccttccca tccctgagctacaaagttggaagtgatgattccacacatatgcaggctagatctcaagac agtgtcaaggaaaaaaggagcagccgcaaggtgctgggccacctgagtctgccatgcatg ggccacgggctcctaccgaccctcctgatgctcacagccctggcactggggttgggcagt ggggacaggacacaacagtgctgccaatccctgatgcccatgctcaagaccatgagagga ggaaagaccaaggtgaaactgggggaaaaggagaggtctccccactcctcttccgcggca acattcttcaccttaactcttatcactcacaacacgtgggtttggttttttgcccgtgat gagtctcctccactggaatggacatggatggagctggaggccgttatccttagcaaacta atgcaggaacagaaaaccaaataccacaagttctcactcataagtgggagcaaaagggag ctgctgatggcaccttcagagggtggtgaagctgaaccggctaagtacctcacgtaa >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_2|331_aa MAVTVLGIVFMFKKLQMRGRSLEIVHLMCPSPHFTDGDAEAQGGAVTCYITNLVKPPTAL IGQNGTLADVDLLIQNLCRRGSRIWLSQGNFRNTKFGEAQGQLTKGVMDYKETPKSRALM AGLEVLFASAAPAITCRQDALVCFLHWEVVTHGYFGLGVGDQPGPNDKKSELLPAGWNNN KDLYVLRYEYKDGSRKLLVKAITVESSMILNVLEYGSQQVADLTLNLDDYIDAEHLGDFH RTYKNSEELRSRIVSGIITPIHEQWEKANVSSPHREFPPATAREVDPLRIPPHHPHTSRQ PPWPRSIMDIVSVIIIIIIIIIIIIIITTLC >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_2|996_bp atggctgtcactgttcttggcatagtgttcatgttcaagaaacttcaaatgagggggagg agtttagaaattgtccatcttatgtgcccttctccccattttacagatggggacgctgag gcccagggaggagcagtgacctgctatatcacaaaccttgttaagccacccacagccctc atagggcagaatgggactcttgctgatgtagaccttctcatccagaatctctgcagaagg ggttcaagaatctggttgtctcaaggaaactttaggaacacaaaatttggagaggcacag ggccagctgacaaagggagtgatggactataaagagacgccgaagtcgcgggcgctcatg gcgggcctggaggtactgttcgcatcggcagcgccggccatcacctgcaggcaggacgcg ctcgtctgcttcttgcattgggaagtggtgacacacggttacttcggcttgggtgtcggt gaccagccgggtcccaatgataagaagtcagaactgctgccagctgggtggaacaacaat aaagacctgtatgtcctccggtatgagtataaggatgggtccagaaagctccttgtgaaa gccatcaccgtggagagcagcatgatcctcaatgtgctggaatatggctcacagcaagtg gcagacttgaccctgaacttggatgattatatcgatgcagaacacctgggtgacttccac aggacctacaagaacagtgaggagcttcggtctcgtattgtgtctggaatcatcacacct atccatgagcagtgggaaaaggctaatgtaagcagtccccaccgggagttcccccctgct accgccagagaggtggacccactccggattcctccacaccacccacacaccagtcggcag cctccctggcccaggtccattatggacattgtttccgttatcatcatcatcatcatcatc atcatcatcatcatcatcatcaccaccctgtgttga >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_3|342_aa MEEEITILVIDNGSSMCKAGFAGDDAPGTMFPSTVRCPQHQGMMVGMGQKVFYLGDKAQS KHGILTMKYPIEHRIFTSWEGMEKIWHHTLQQAACVPGGAPGPDHLMKILTEYDYSFPTT AKREIMRDIKEKQCCVAPDFEQEMATAMSSFSLEKSYKLPDSQVITIGNMQFQCLEVLFQ PFFLGVEFCSIHETTFSSIIRCDVGIHKDLYANTVLASGTTMYPSISNRMQKEITALVPA QGRSRSSCSQSASTPCDPLGPFVVGGEDLDPFGPRRGGMIVDPLRSGFPRALIDPSSGLP NRLPPGAVPPGARFDPFGPIGTSPPGPNPDHLPPPGYDDMYL >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_3|1029_bp atggaagaagagatcaccatactcgtcattgacaatggctccagcatgtgcaaagctggt tttgctggggacgacgcccctggaaccatgtttccctccactgtcaggtgcccccaacac cagggcatgatggtggggatgggccagaaggtcttctacctgggcgacaaggcccagagc aagcacggcatcctgacgatgaagtaccccatcgagcacaggattttcaccagctgggaa ggcatggagaagatctggcaccacaccctacaacaagctgcatgtgttcctggaggagca cctggacctgaccacctcatgaagattctcacagagtacgactacagcttccctaccaca gccaagcgggagatcatgcgtgacatcaaggagaagcagtgctgcgtcgccccggacttt gagcaggagatggccaccgctatgtcatccttctccctggagaaaagctacaaactgcct gacagccaggtgatcaccattggcaacatgcagttccagtgtttggaggtgttgttccag cccttctttctgggtgtggaattttgcagcatccatgagaccaccttcagctctatcata aggtgtgatgtgggcatccacaaggacctgtatgccaacacggtgctggccagtggcacc accatgtacccaagcatctccaacaggatgcagaaagagatcactgccctggtgccagca caaggaagatcaagatcatcgtgctcccagagcgcaagtactccgtgtgatcccctgggc ccgtttgttgtcgggggagaagacttagacccttttgggcctcggagaggtggcatgatt gtggatcccctgagatctggcttcccaagagcacttattgacccttcctcaggcctcccg aaccgacttcctccaggcgctgtgcccccaggagctcgctttgacccctttggacccatt gggaccagcccacccggacctaacccagaccatctccccccgccgggctacgatgacatg tacctgtga >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_4|349_aa MRPGDPASPGLRLVRFGLPGPASVGKGRSRGCGLGRRERARWAAGVLGLSARNFGYNSVD LRVRICAPQGFVYNSCVCLSTGFCEASPTAAVRPSLRVSVRAAAAKGPRDELGPSFPMAS PPGLELKTLSNGPQAPRRSAPLGPVAPTREGVENACFSSEEHETHFQNPGNTRLGSSPSP PGGVSSLPRSQRDDLSLHSEEGPALEPVSRPVDYGFVSALVFLVSGILLVVTAYAIPREA RVNPDTVTAREMERLEMYYARLGSHLDRCIIAGLGLLTVGGMLLSVLLMVSLCKGELYRR RTFVPGKGSRKTYGSINLRMRQLNGDGGQALVENEVVQVSETSHTLQRS >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_4|1050_bp atgcgaccaggggacccggcctccccgggcctccgcctggtccggttcggccttccgggt cccgcgtctgtgggcaagggcaggagcagggggtgtggactgggccggcgggagcgggcg cggtgggcagccggcgtgctggggctttctgcgcgcaactttgggtacaacagcgtggac ctgcgtgtgcgtatctgcgcgccccaaggctttgtgtacaactcgtgcgtgtgtctatcg actgggttttgtgaagcttcacccacagccgctgtgcgtccctctctccgggtgtcagtg cgcgcggcggctgccaaggggccaagggatgagctggggccctccttcccaatggcatct ccccctggtctggaactgaagacactgagcaatggtccccaagccccaaggagatcagct cccctgggcccagtggccccaaccagggagggtgtggagaatgcctgcttctcctcagag gagcatgagacccatttccagaaccctgggaacacgagactgggcagctcacccagtccc cctgggggtgtctcctcactgccccgatcccagcgggatgatctgtcccttcattcagag gaggggccagccctggagcccgtgagccgcccggtggattatggctttgtttccgccctc gttttcctggtgagtgggattcttctggtggtgacagcatacgccatcccccgtgaggct cgagtcaatccggacacagtgacagcgcgggagatggaacgactggagatgtactacgcc cgcctaggctcccacctggacaggtgcatcatcgcaggcctcgggctgctcacggtgggc ggcatgctcttgtcggtgctgctcatggtctccctgtgcaagggcgagctgtaccgccgg aggaccttcgtccccggcaagggctccaggaagacctacggctccattaacctgcgcatg agacagctcaatggggatgggggccaggccctggtggagaatgaagttgtccaggtctca gagactagccacaccctccagaggtcttaa >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_5|272_aa MASHSSPMGSQYYYGFPGPDSAMHALNTVVSEKDLTLDLSGLVARNKRCGPHSYSLNTHL LHACLRLPTQRENTTLKTFIPQGWEIHTDQVEREAECQPGRLKICVHDTAQELPLASTAR NALLGRNLCLFRQSSTTQMPDEIPISLDDRMRPPSLKKKKEWEKRKRKRNEKEKEEEEEE EKEKKKKKKKKKKKKKKKKKKKKKKKEEEEEWEKRKRKRKEKEKEEEEKKTKKNKKKKKK NRRRRRRRRKNKKKEEEKEGWRRNEYFRLGSL >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_5|819_bp atggccagccattcttcacccatgggcagccaatactactatggattccctgggccagac tctgcaatgcatgctttgaacacagtggtgagcgagaaagacctcacgcttgacctctca gggcttgtggcccgcaacaaacgctgtgggccccactcatattccctcaacactcacctt ctccatgcgtgcctacggcttcctactcagagagagaacacaacactcaagacttttatt ccccaaggctgggagatccatacagaccaggtggaaagggaggcagaatgccagccaggc cggctgaagatttgtgtgcatgatacagctcaggaacttccacttgctagcacagcacgc aatgctcttcttggcaggaatctttgcctcttccggcagagcagtaccactcagatgcca gatgaaatccccatcagcctggatgacagaatgaggcccccatctctgaagaagaagaag gagtgggagaagaggaagaggaagaggaatgagaaggagaaggaggaggaggaggaggag gagaaggagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaaggaggaggaggaggagtgggagaagaggaagaggaagaggaag gagaaggagaaggaggaggaggagaagaagacgaagaagaataagaagaagaagaagaag aacagaagaagaaggagaagaagaagaaagaacaagaagaaggaggaggagaaggaggga tggagaagaaatgaatatttcaggctgggtagtttatga >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_6|295_aa MEERVSVIEDQMNEMKREEKFREKRVKRNEQSLQEMWDYVKRPNLHLIGVPESDGENGTK LENTLQDIIQENFPNLARQANIQIQEIQRTPQRDSSRRATPRHIIVRFTKVEMKEKMLRA AREKVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQNLLKLISNFS KVSEYKINVRKSQTFLYTNNRQTESQIMKLEKTTLKFTWNQKRAHIAKTILSQKNKAGGV TLPNFKLYYKATVTKTAWYWYRNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKK >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_6|888_bp atggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaagagaag tttagagaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatgtgggactacgtg aaaagaccaaatctacatctgattggtgtacctgaaagtgacggagagaatggaaccaag ttggaaaacactcttcaggatatcatccaggagaacttccccaacctagcaaggcaggcc aacattcaaattcaggaaatacagagaacgccacaaagagactcctcgagaagagcaact ccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggca gccagagagaaagtgttggaagttctcgccagggcaatcaggcaagagaaagaaataaag ggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgta tatttagaaaaccccatcatctcagcccaaaacctccttaagctgataagcaacttcagc aaagtctcagaatacaaaatcaacgtgcgaaaatcacaaacattcctatacaccaataac agacaaacagagagccaaatcatgaaattggaaaaaactactttaaagttcacatggaac caaaaaagagcccacattgccaagacaatcctaagccaaaagaacaaagctggaggcgtc acgctacctaacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccgaaacagagatatagaccaatggaacagaacagagccctcagaaataataccacac atctacaaccatctcatctttgacaaacctgacaaaaacaagaaatga >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_7|196_aa MGPRPHPCHAPAQVSQVRAPGQSRRLRTRTGLVDLPKENFEASYNAITLPEEFHDFDTQN MNAIDVSEHFTQNQSRPEEITLRENFDNDLIFQAESFDRFFFPFARDSSSSGGRNPVMVL DLFPLPPGIGMRIEESNKMGMQSFSLMKLCRNSDRKQAAAKFYSFLVLKKQLAIELSQSA PYADIIATMGPMFYNI >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_7|591_bp atggggcccaggccccatccctgtcatgcccctgctcaggtgtctcaggtccgagcgcca ggacaaagccggagactgcgcactaggaccggactggttgaccttccaaaagagaatttt gaagcatcttacaatgctatcacattgccagaagaatttcatgattttgacacccaaaat atgaatgctattgatgtttcagaacactttactcagaaccaaagcagaccagaagaaatc actcttagagaaaattttgacaatgatctaattttccaagctgagagctttgacaggttc ttcttcccttttgcccgtgacagctccagcagtggtggcaggaatcctgtcatggtcctc gatctctttcctctgccacctggtattggcatgagaatagaggaatctaacaagatggga atgcagtcctttagtctgatgaagctctgtagaaatagtgaccgaaaacaagcagctgcc aaattttatagctttcttgtcctaaagaaacagctggctattgagctgagccagagtgct ccctatgcagatattatagctacgatgggaccaatgttttataacatatga >gi568815578f:1018774_1265077|GENSCAN_predicted_peptide_8|21_aa PTQCEDHEDEDLYVDPLPLNE >gi568815578f:1018774_1265077|GENSCAN_predicted_CDS_8|66_bp cctactcagtgtgaagaccatgaagatgaagacctttatgttgatccacttccacttaat gaatag