GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:43:23 Sequence gi568815597f:146198170_146398712 : 200543 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7959 8098 140 2 2 86 115 88 0.660 10.96 1.02 Term + 14488 14572 85 1 1 70 40 81 0.356 -2.35 1.03 PlyA + 15264 15269 6 1.05 2.06 PlyA - 15876 15871 6 1.05 2.05 Term - 20601 20414 188 1 2 58 55 71 0.302 -2.53 2.04 Intr - 25778 25666 113 1 2 88 101 63 0.780 6.70 2.03 Intr - 30269 30024 246 1 0 80 65 113 0.757 3.85 2.02 Intr - 30651 30540 112 1 1 46 101 237 0.054 19.32 2.01 Init - 41166 41031 136 0 1 76 57 115 0.039 7.55 2.00 Prom - 44444 44405 40 -8.55 3.00 Prom + 46866 46905 40 -6.55 3.01 Init + 49222 49418 197 0 2 67 64 128 0.544 6.75 3.02 Intr + 56573 56679 107 2 2 107 63 40 0.162 2.34 3.03 Intr + 63578 63670 93 0 0 50 92 62 0.047 1.82 3.04 Intr + 66393 66520 128 1 2 102 61 66 0.173 4.78 3.05 Intr + 72941 73078 138 1 0 58 98 30 0.208 0.74 3.06 Intr + 73608 73769 162 2 0 83 74 80 0.105 5.35 3.07 Intr + 82055 82206 152 1 2 81 79 35 0.158 -0.06 3.08 Intr + 83442 83603 162 0 0 105 100 58 0.955 6.87 3.09 Intr + 88177 88280 104 1 2 122 60 20 0.621 1.50 3.10 Term + 99882 100546 665 1 2 41 44 573 0.832 41.24 3.11 PlyA + 103122 103127 6 1.05 4.06 PlyA - 104138 104133 6 1.05 4.05 Term - 115226 114880 347 0 2 6 42 229 0.206 3.77 4.04 Intr - 117216 117086 131 2 2 77 94 46 0.402 3.52 4.03 Intr - 123827 123698 130 0 1 115 31 132 0.634 9.03 4.02 Intr - 130042 129882 161 0 2 80 111 112 0.958 11.41 4.01 Init - 132672 132524 149 0 2 74 23 66 0.612 -1.69 4.00 Prom - 143368 143329 40 -4.85 5.02 PlyA - 143664 143659 6 1.05 5.01 Sngl - 146546 146052 495 2 0 86 39 652 0.982 55.90 5.00 Prom - 161318 161279 40 -3.65 6.00 Prom + 164547 164586 40 -3.25 6.01 Init + 169049 169094 46 1 1 109 66 29 0.611 3.75 6.02 Intr + 171187 171286 100 2 1 99 46 85 0.464 3.65 6.03 Term + 171444 171861 418 0 1 9 55 225 0.406 5.06 6.04 PlyA + 172641 172646 6 1.05 7.05 PlyA - 173417 173412 6 1.05 7.04 Term - 178609 178305 305 2 2 62 45 130 0.200 0.45 7.03 Intr - 178824 178646 179 2 2 36 111 97 0.376 5.44 7.02 Intr - 179418 179388 31 1 1 100 94 33 0.477 1.37 7.01 Init - 185335 185188 148 1 1 56 110 84 0.655 7.70 7.00 Prom - 186923 186884 40 -7.35 8.04 PlyA - 187469 187464 6 1.05 8.03 Term - 189205 189034 172 0 1 46 41 180 0.470 5.42 8.02 Intr - 190811 190629 183 1 0 70 64 100 0.536 3.78 8.01 Intr - 195951 195744 208 1 1 82 68 213 0.295 15.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 30667 30540 128 1 2 49 101 228 0.803 19.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_1|74_aa MLKIYSLAPDPNQQYTNIKEKVTETKITHLMKIYDLAKENEAENIHSSEIRHRPLRMPDQ ISFPNAENQLLLTF >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_1|225_bp atgctgaaaatctacagcctggcccctgatccaaatcagcaatatactaatattaaggaa aaagtaacagaaaccaaaatcacccacctaatgaagatatatgatctagcaaaagaaaat gaggctgagaacatccacagctcagaaatccgacataggccacttcgcatgcctgatcag ataagctttcccaatgctgaaaaccaactgcttctaaccttttaa >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_2|264_aa MVRAREGFAGEVIFELGDVFSEVGSEAFCWEENGEQKEGKRLQQRGGRGGGGGGGDREDT RPAPRSAVGAAGALAVLRDPRAWREAGSKSQKLLFRSARVQGGGQFCPSGSAFPGVEREP TAGLGGAERRNARFWRGERGQGRQAKRPAPSQPASPLPGGGTWAGTCLACAPNSESTERQ VLVTASVTPAFPPLEHSIGEADGTPLVLTFVAPAELLYAEALQVSSVWFRVISGHMNAFL SIIGIYWAGCLPAITIFSMKRPGE >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_2|795_bp atggtgagggctagggaaggttttgcaggggaggtgatatttgaactaggagatgtcttt agtgaagtaggaagtgaagccttttgctgggaagaaaatggggaacagaaagaaggaaag aggctacagcagagaggcggccgcggcggcggcggcggaggaggcgaccgagaagatacc cgccctgcgccccgctctgctgtgggcgctgctggcgctctggctgtgctgcgcgacccc cgcgcatggagggaggccggcagcaagtctcagaaactcctttttcgtagtgccagggtg cagggaggtgggcagttttgcccttcaggttccgcgtttcctggggtcgagcgagagccg acggcgggcctcggaggggctgagcgaaggaatgccagattctggcgtggagagcggggg cagggccgccaagccaaacggcctgcaccttcgcagccagcctcgcctttgccagggggc ggcacatgggccggaacttgtctggcatgtgccccaaattcagagtcaactgagcggcag gtcttagtcacagcatctgtgactcctgctttcccacctttagaacattctataggagag gctgatggcaccccgctggttttgacattcgttgctccagctgaactgctctatgctgaa gccctgcaggtctcaagtgtttggttcagagtaatttcaggccatatgaatgctttctta agtatcattggaatttactgggctggctgtctgccagctattaccatatttagcatgaag agacctggggaatga >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_3|635_aa MGRLDYSDEPDVITGVLIEKGGRSVRERDVTMRDHQPKPKTNKQKQKLWLEAGRGKEQCL SSSSRRHCHAPGGVGGDASNPLGFDLAERGLSFQLLGLFLKKMRYVNTVPATPNFLFEMK VPHPEPPALRSRGIMKCGIKLLLGQGGPLHSVGSTSLNSVMLNEARFQLCRAGMQPCLNL GGGAQEIIQAPLPRAAQGRRPGLQGPPQSLSPRNMILTPPKVASSCSALDKRAGHCCWGD ISRQARESCRLLSAYLLAGKTNIGQLFSKGGMQSTEDPLTYYLCQALYSSQQCWKIGITI LKLQMRKLRSGDLSVSEIPNPRGMDRLPEGSSETCVRKEKLSGLSSLLHDHSFSPTYHCL AWNGSWKEDSRETIETDNRKDAVISELSLEHFSPVCQGFREGFPNIQAGCQGFLTLPGAP APDSPSPPLPGGVAAAGPPRRRPEQQQQQQEPASMMKFKPNQTRTYDREGFKKRAACLCF RSEQEDEVLLVSSSRYPDQWIVPGGGMEPEEEPGGAAVREVYEEAGVKGKLGRLLGIFEQ NQDRKHRTYVYVLTVTEILEDWEDSVNIGRKREWFKVEDAIKVLQCHKPVHAEYLEKLKL GCSPANGNSTVPSLPDNNALFVTAAQTSGLPSSVR >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_3|1908_bp atgggaagattagattattcagacgaacctgatgtaatcacaggggtccttatagaaaaa ggaggcaggagtgtcagagaaagagatgtgacaatgagggaccatcagccaaaaccaaaa acaaacaaacaaaaacaaaaactgtggttagaagctggaagaggcaaagaacagtgtctc tcctccagctccaggagacattgtcatgctccaggtggggtgggtggtgatgccagtaat cccctgggctttgacttagctgagcgtggactcagttttcagttgcttggcttgttcttg aagaaaatgagatatgtcaacacagtgcctgccacacccaacttcctctttgagatgaaa gtgccccacccagagccccctgctttgagaagcagagggataatgaagtgtggcataaag cttctcctgggtcagggtggcccactccacagtgttggcagcaccagtctcaacagtgtg atgctgaatgaagctaggttccagctgtgtagagctggcatgcagccctgcctgaacctc ggaggaggcgcccaggagatcatccaggcaccgctgcctagggccgcgcagggaagacgg ccagggctccagggtccgcctcagtcactgtcaccccgcaacatgattctgactccccca aaggtggccagcagctgctccgcacttgacaaacgtgctggacactgctgctggggagac atcagccggcaagcaagggaaagttgcagacttctcagtgcttacctcctagctgggaag acaaatattggacaactattctctaaaggaggcatgcagagtactgaagacccactgacc tattatttgtgtcaggcactttattcctcacaacaatgctggaagattggtataactatt ctcaaattacagatgaggaaactgaggtctggggatctgtctgtgtcggagatccccaac ccccggggcatggacaggcttcctgagggctcctcagaaacatgtgttagaaaagaaaag ctttccggtctgtccagtctcctccatgaccacagtttctcacccacataccattgtctg gcttggaatggatcctggaaggaggactcaagagaaaccatagaaacagacaacaggaaa gatgctgtcatttctgagttatctttggagcatttctctccagtgtgtcagggtttcaga gaaggctttcctaatattcaggctggctgtcagggtttcctaacgctcccgggcgctcct gcgcccgactcgccctcgcccccactccccggcggggtggcggcggccgggcccccacgg cggcggccggagcagcagcagcagcagcaggagcccgcctctatgatgaagttcaagccc aaccagacgcggacctacgaccgcgagggcttcaagaagcgggcggcgtgcctgtgcttc cggagcgagcaggaggacgaggtgctgctggtgagtagcagccggtacccagaccagtgg attgtcccaggaggaggaatggaacccgaggaggaacctggcggtgctgccgtgagggaa gtttatgaggaggctggagtcaaaggaaaactaggcagacttctgggcatatttgagcag aaccaagaccgaaagcacagaacatatgtttatgttctaacagtcactgaaatattagaa gattgggaagattctgttaatattggaaggaagagagagtggttcaaagtagaagatgct atcaaagttctccagtgtcataaacctgtacatgcagagtatctggaaaagctaaagctg ggttgttccccagccaatggaaattctacagtcccttcccttccggataataatgccttg tttgtaaccgctgcacagacctctgggttgccatctagtgtaagatag >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_4|305_aa MDEAGNHHSQQTITRTENQTPHVLTHRWELNDENAWTQGGEHHTLGPVRGYIIEQGVCDL VLCEAAFPKKLAFAYLEDLHSEFDEQHGKKVPTVSQPYSFIEFALDSKANNLSSLSKKYR QDAKYLNMRSTYAKLAAVAVFFIMLIVNEATWLSPLYATFRPPRASKKTKAEKPIQETAT SKVEATSDHTERLLKSQKAIDAGVVAEKMEHLYSANGNVNSFSRCAKHFGDFSKNRQQLP LDPAIPLLDIYLKEYRLFCHKDTCTHMFIAALFTIAKTWNQPKCPSVVVWIKKMWYTYTV EYTQP >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_4|918_bp atggatgaagctggaaaccatcattctcagcaaactatcacaaggacagaaaaccaaaca ccgcatgttctcactcacagatgggagttgaacgatgagaatgcatggacacagggtggg gaacatcacacactggggcctgtcaggggctacattattgagcagggggtgtgtgatttg gttttatgtgaagctgccttccctaagaagttggcttttgcctacctagaagatttgcac tcagaatttgatgaacagcatggaaagaaggtgcccactgtgtcccaaccctattccttt attgaatttgcattggattcaaaggctaacaatttgtccagtctgtccaagaaataccgc caggatgcgaagtacttgaacatgcgttccacttacgccaaacttgcagcagtagctgta tttttcatcatgttaatagtaaatgaggccacttggctgagcccattatatgcaacattc agacccccaagggcatcaaagaagacaaaagcagaaaaacctatccaagaaacagcaact tcaaaggttgaagcaacatcagaccacacagaacggctattaaaaagtcaaaaagcaata gatgccggcgtggttgcagagaaaatggaacacttatatagtgctaatgggaatgtaaat tcgttcagccgttgtgcaaagcactttggtgatttctcaaagaaccggcaacaattacca ctcgacccagcgatcccattattggatatatacctaaaggaatatagattgttctgccat aaagacacatgcacgcatatgttcatcgcagcactattcacgatagcaaagacatggaat caacctaaatgcccatcagtggtagtctggataaagaaaatgtggtacacgtacaccgtg gaatatacacagccataa >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_5|164_aa MVNSVVFFDITVDGKPLGRISIKLFADKIPKTAENFRALSTGEKGFRYKGSCFHRIIPGF MCQGGDFTRPNGTDDKSIYGEKFDDENLIRKHTGSGILSMVNAGPNTNGSQLFICTAKTE WLDGKHVAFGKVKERVNIVEAMEHFGYRNSKTSKKITIADCGQF >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_5|495_bp atggtcaactccgtcgtcttttttgacatcaccgtcgacggcaagcccttgggccgcatc tccatcaaactgtttgcagacaagattccaaagacagcagaaaactttcgtgctctgagc actggagagaaaggatttcgttataagggttcctgctttcacagaattattccagggttt atgtgtcagggtggtgacttcacacgccctaatggcaccgatgacaagtccatctatggg gagaaatttgatgatgagaacctcatccgaaagcatacaggttctggcatcttgtccatg gtaaatgctggacccaacacaaatggctcccagttattcatctgcactgccaagactgag tggttggatggcaagcatgtggcctttggcaaggtgaaagaacgtgtgaatattgtggaa gccatggagcactttgggtacaggaatagcaagaccagcaagaagatcaccattgctgac tgtggacaattctaa >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_6|187_aa MKMKATSSGPGPPVAGPDHRPAEPPRLHGAEGALEALPVAAPRGAKKSTGTHEPPSPRSL PWWSPRAEHRIRRWQRKLRSGPKSDWAGRAQAPSLGEGGAKNGKSHPGSHRAFSLPRAPR RLGPGSQPGRFRGFLKQARGRAGEGHSAILLAPESPNAQVSNVTSATYRSNLSGLPRPCS VVLLGGP >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_6|564_bp atgaagatgaaggccacctcttctgggccaggtcctcccgttgcaggacccgaccaccgc ccagctgagcccccgcggctccacggcgcagaaggtgcactggaggccctgcccgttgcc gccccgcggggtgccaagaagtcaacggggacccacgagccaccctcaccacgatccctg ccctggtggagcccccgtgcggaacacaggatccgaagatggcagcggaagctccgcagc ggccccaaaagcgactgggcagggagggcacaggctccctcactgggtgaaggcggcgca aagaacgggaagagccatcccgggagccaccgggcgttcagcctccctagggcccccagg cggctcgggccggggtctcaaccggggcgtttccgggggtttctgaagcaggcgaggggc agggcgggcgaaggccattcggctatccttctggctccagaatctcccaacgcgcaggtg tccaacgtgaccagcgcgacttaccgctccaatctctccggtcttccaaggccttgctca gtcgtcctgctgggagggccctga >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_7|220_aa MVGSPLPVGLGFTLKGAVRTEVSGKGPEPWGLSSGDGDGRRGKAGQESSGRIAAHFGQDR LFFKLQKSGESANAVPHYHKLCSRVSHIWGNRRGQHIRSAMDKPRPGKTTFVIMVSPLPA SHASPFTRTVTCPAHPPPPPPPTPSPPSPHTQLGLSGPTSGPEPAPTAREILRGEAAAPA LPHLHRSRPIRDVTDSAFPSPRLPFCRSAYQPAAGAGRGK >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_7|663_bp atggtaggctcccctcttcctgtgggtttgggttttaccttgaaaggggctgtcagaaca gaagtttcagggaaggggccagagccctggggactttcctcaggtgatggtgatggaaga agaggcaaggcaggacaggaaagctcagggaggatcgcagcgcatttcggccaagacaga ctgttctttaaactacaaaaatcaggggaaagcgcgaacgcagtcccccactaccacaaa ttatgcagtcgagtttcccacatttggggaaatcgcaggggtcagcacatccggagtgca atggataagcctcgccctgggaaaaccaccttcgtgatcatggtatctcccctgccagcc tcacacgcttcaccctttacacgcacggtcacttgccccgcgcaccccccccccccaccc ccccccacccccagccctcctagccctcacacacagctgggactctcaggtccgaccagc ggtcctgaacccgctcccacggcacgggaaatccttcgtggcgaagcagcagcccctgcg ctgcctcatttacatagaagtcgccctatccgtgatgtcaccgacagtgcctttcccagt ccccgtctgcctttctgccgctcagcctaccaacccgctgccggagccggcagggggaag tga >gi568815597f:146198170_146398712|GENSCAN_predicted_peptide_8|187_aa XPTRQYNCERPWKTGAMRAVNRVGLIGDPQIGRFWNQDDLAPHLQQKPPETPGPDPDLSR LGAQGKLNARGTRALAKPFPYKEQACYVYNPMAPVPSTISTWFYELSTRSPQHRNPTNSH ECYREKQGLASTTPANPQPPPALLALTHSWDSQVRPAVLNPLPRHGKSFVAKQQPLRCLI YIETPYR >gi568815597f:146198170_146398712|GENSCAN_predicted_CDS_8|564_bp nagcccacccggcagtacaactgtgaaaggccttggaaaactggagcgatgagagcggtg aatcgtgttggtctcattggagacccgcagattgggagattctggaaccaggacgacctt gcccctcacctgcagcagaagcccccggaaacgcccggccccgacccggacctgagccgc ctgggggcccaagggaagctgaacgcccggggcaccagggcactggcaaagccctttcca tacaaagagcaagcgtgttatgtctacaacccaatggcaccagttccaagtacaatttct acttggttctatgagctgagtacacgttccccccagcacagaaatcctacaaactcccat gaatgctatagggaaaagcaggggctagccagcaccactccagccaatccccaaccaccc ccagccctcctagccctcacacacagctgggactctcaggtccgaccagcggtcctgaac ccgctcccacggcacgggaaatccttcgtggcgaagcagcagcccctgcgctgcctcatc tacatagaaacgccctatcggtga