GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:53:46 Sequence gi568815596f:64355343_64558425 : 203083 bp : 42.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 65 209 145 2 1 112 55 80 0.863 3.50 1.02 PlyA + 774 779 6 1.05 2.12 PlyA - 2162 2157 6 1.05 2.11 Term - 4730 4627 104 0 2 76 53 76 0.371 0.36 2.10 Intr - 5725 5538 188 0 2 83 107 214 0.688 21.21 2.09 Intr - 19456 19281 176 1 2 102 52 81 0.218 3.72 2.08 Intr - 20157 20024 134 1 2 50 86 52 0.534 0.64 2.07 Intr - 21734 21549 186 0 0 57 60 141 0.540 6.94 2.06 Intr - 24053 23909 145 2 1 69 41 77 0.041 0.03 2.05 Intr - 31802 31696 107 0 2 82 91 58 0.061 4.51 2.04 Intr - 39709 39609 101 0 2 69 51 90 0.015 2.23 2.03 Intr - 44898 44875 24 2 0 116 103 20 0.136 2.52 2.02 Intr - 45242 45171 72 1 0 56 94 73 0.152 2.30 2.01 Init - 47836 47601 236 1 2 73 50 140 0.183 6.36 2.00 Prom - 53353 53314 40 -4.55 3.00 Prom + 53479 53518 40 -7.25 3.01 Init + 58998 59114 117 2 0 63 47 86 0.195 2.25 3.02 Intr + 62404 62518 115 0 1 104 67 53 0.218 3.90 3.03 Intr + 75490 75515 26 2 2 123 100 -1 0.053 1.43 3.04 Term + 80511 80666 156 2 0 73 36 151 0.137 5.35 3.05 PlyA + 82472 82477 6 1.05 4.00 Prom + 96639 96678 40 -6.05 4.01 Init + 99469 99534 66 0 0 91 92 84 0.660 10.35 4.02 Intr + 100247 100335 89 1 2 111 94 99 0.999 10.65 4.03 Intr + 100946 101123 178 2 1 75 96 167 0.979 15.20 4.04 Term + 102943 103086 144 0 0 83 34 137 0.988 4.73 4.05 PlyA + 103274 103279 6 1.05 5.03 PlyA - 103305 103300 6 1.05 5.02 Term - 105566 105349 218 0 2 96 41 106 0.055 3.02 5.01 Init - 118127 117887 241 2 1 64 51 185 0.384 10.48 5.00 Prom - 120098 120059 40 -4.15 6.00 Prom + 122460 122499 40 -9.65 6.01 Init + 123948 124096 149 2 2 65 97 119 0.372 10.22 6.02 Intr + 133393 133500 108 1 0 47 92 162 0.221 10.98 6.03 Intr + 141642 141783 142 0 1 35 44 169 0.617 6.63 6.04 Intr + 145177 145350 174 0 0 92 10 164 0.110 8.21 6.05 Term + 149519 149800 282 1 0 39 49 225 0.705 8.14 6.06 PlyA + 150494 150499 6 1.05 7.00 Prom + 151455 151494 40 -0.85 7.01 Init + 156679 156767 89 0 2 52 22 127 0.487 0.66 7.02 Intr + 160555 160752 198 1 0 27 -3 200 0.416 2.34 7.03 Term + 161953 162127 175 1 1 70 49 220 0.904 12.55 7.04 PlyA + 164203 164208 6 1.05 8.02 PlyA - 165412 165407 6 1.05 8.01 Sngl - 169247 168765 483 2 0 83 50 234 0.177 15.00 8.00 Prom - 177774 177735 40 -5.25 9.00 Prom + 180818 180857 40 -6.55 9.01 Sngl + 196133 198097 1965 1 0 78 41 1644 0.991 152.40 9.02 PlyA + 199102 199107 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 9717 9805 89 2 2 73 100 91 0.832 8.96 S.002 Term + 12701 12863 163 2 1 69 50 123 0.816 3.03 S.003 Term - 52919 52719 201 2 0 50 46 197 0.918 8.11 S.004 Term + 145177 145458 282 0 0 92 42 220 0.880 12.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_1|48_aa XALEMSLKSLMVCELPGATAHPGDNGQGKQTTCPLHFLGHLSDLNLGG >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_1|147_bp nnggccctggaaatgtcactgaaatctctcatggtgtgtgaacttcccggtgccacagca catcctggagataatggccaagggaagcagaccacttgcccattgcactttctgggccat ctgtctgatttgaacttaggaggatga >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_2|490_aa MVGNVRGLIEGGQVIPHFPNETEVDTQTNNNCSDSFSFPEPQLTAVPSGFVAFHNSFDFR TSLTLAPTPEPKEPPHLRNLSSLEASEPGVTLQGECQGQAKESLGGKKECMFAVLDVFTY ECKCAEIHSARLKSEYTTDEPLGTGDFEGFKYLKGKSGWWGKREGMVTLLSLYVAGEKEQ GTAHARPQVDTSCLECINVDDDMFEMNKHLTYLRSIITLGELFPEDPGSRRCRSLQKAQP RIELGPRHVEAEDRASEVGAVGCVAIQIDFAHCVIIDMDSVAVIKDQNNRELAHLGHQDI LEMPPRHESHLDYSKSQHRVKPELMVSTPRQYDLKDFLPPPTVKPGRAHSALLEQAHTQK SERVNPPGQLLTNGVWERVDEFSPLLSLRWTMLRLFHSTEFLCHLLFGPLSGCSQCLDVL TGLVPDGCYARSCGDELGFDAQLNHLLCILSKLLNSENGAECARHWLPAPWAAASAKGKI RLPFIHSCCQ >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_2|1473_bp atggttggtaatgtgaggggccttattgagggggggcaagtcattccacattttccaaat gagaccgaagtggatacacagactaacaataactgcagtgactcattctcctttccagaa cctcaactcacagctgtccccagtggctttgtggcatttcacaacagttttgactttaga acctctctcaccttggcccccactccagagccaaaggaacccccacatctcaggaatctg tcatctctggaggcctcagaacctggtgtcacattacagggtgaatgccagggacaagct aaagagagcttaggaggaaagaaagaatgcatgtttgctgtcctggatgtgttcacctat gaatgtaagtgtgctgagatacacagtgcaaggctgaagtcagagtacaccactgatgaa cccctgggaactggtgattttgagggcttcaaatacttaaaggggaaaagcggttggtgg ggcaagagagagggtatggtcacattactgagtctatatgttgcaggagaaaaggagcag ggcacggcccatgccaggccccaggtagacacttcatgtctggaatgcataaatgtggat gatgacatgtttgagatgaacaagcacctcacgtacttgagaagcatcatcactttgggg gaacttttccctgaagacccggggtctcggcggtgcaggagtcttcagaaagcccagcct cgcatagaattaggccctcggcatgttgaagctgaagacagagcttctgaggtgggggca gttggctgtgttgctatccagattgactttgcccactgtgtcattatagatatggattca gttgctgtaattaaggaccaaaataatagagaattagctcatttggggcaccaggacatc ctggagatgcctccccgccatgagtcccaccttgactacagcaagtcccagcacagagtt aagccagagctgatggtgtcaactcccaggcaatatgacctcaaggacttcctacctcca ccaactgtgaaaccagggagagctcactcagccttgctggaacaggcacacacccagaag tctgagagagttaaccccccggggcaactgctgaccaatggggtatgggagagggtggat gaattctcccctctcctgagtctcaggtggacaatgctaaggctttttcattctacggaa ttcctgtgtcacctgctgtttggaccactctctggctgctcacagtgccttgacgtcctc acagggttagttcctgatggttgttatgcacgcagctgtggagatgaacttggatttgat gctcagctcaaccacttactgtgtatcttgagcaagttacttaactctgagaatggggct gagtgtgcacggcactggctacccgctccatgggctgctgcctctgccaaagggaagata agacttcccttcatccacagctgctgccagtga >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_3|137_aa MCRLKPGAEKNTEKAVPESWKPISPKNRLATMMHSLKFRVRQSWSDRGHLRDLVKEKWSQ GYIWCRVTGVFASLPCTGSPYIILFEGRKILDSSVPLTIPTESQDPDCRNAGIPMSQFLA FPSFYLIPQKVQRSPEL >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_3|414_bp atgtgcagactaaagccaggtgctgagaaaaacacagaaaaggctgtccctgagagctgg aaacctatatcccctaaaaacaggctagcaaccatgatgcactctcttaagtttagggtc aggcagtcctggagtgatcgtgggcatctcagggatctggttaaggagaagtggagtcag ggatacatctggtgtcgggttacaggggtgtttgcatcacttccatgtacaggttcaccc tatatcattttattcgaagggaggaaaattcttgactcttctgttcctctgaccatcccc actgaatctcaagatcctgactgccggaatgcaggcatccccatgtctcaattcttggct tttccctctttctacctcatccctcagaaagttcagcgatctccagagctataa >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_4|158_aa MAGRDARQPPLCRATASQVCAWIVPFCGHIKGGMRPGKKVLVMGIVDLNPESFAISLTCG DSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQSAIPYFPFIPDQPFRVEILCEHPR FRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQITKLG >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_4|477_bp atggcgggtagggacgcccgacagccccctctgtgccgtgcaaccgccagccaggtgtgc gcgtggatagttccattttgtgggcacattaaaggtggcatgagaccaggcaagaaggtg ttagtgatgggcatcgtagacctcaacccagagagctttgcaatcagcttgacctgtggg gactcagaagaccctcctgccgatgtggcaatcgaactcaaagctgtgttcacagatcgg cagctactcagaaattcttgtatatctggggagaggggtgaagaacagtcagcaatccct tactttccattcattccagaccagccattcagggtggaaattctttgtgagcacccacgt ttccgagtgtttgtggatggacaccaactttttgatttttaccatcgcattcaaacgtta tctgcaattgacaccataaagataaatggagacctccagatcaccaagcttggctga >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_5|152_aa MAVKTMSCTSQNHRHPSRTSAELQNRVPFPSASVKQADAASPSSAAGSAIQPVGLDLLST REPNTCTVPPPCGLLRALVQEPDCYLYSLVLVIAELGLWSEITPYIPEYFPVEALHCIFH PEILTLDLTGPGEGVNIVFSLQLKPLTNLGYT >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_5|459_bp atggctgtgaagacaatgtcatgtacttcacagaaccacagacaccccagtagaacgtct gcagaacttcagaatcgagtgccctttccctcagcttcagtgaaacaggcagatgcagca agccccagctctgcagcaggttctgctattcagcccgttggacttgacctcctgtccacc agggagccaaatacatgcactgtccctcccccatgtggactgctcagggccctggttcag gagcctgattgttatttatactccttggttttggttattgcagaactagggctctggtca gaaataacaccttacatacccgaatactttcctgtggaagctttacattgcattttccat cctgaaattctcaccttagaccttacaggcccaggagagggtgtgaatattgttttcagt ttacagctaaagccccttactaatttgggctatacctga >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_6|284_aa MGLEDLQTLLGLTGKCGSSPQACKDFFPLGIQRLFQCKEHGYASVSDSPRRSTPQSTSSH WAVDHSSSGSEHLIPRLIAAAASRNRRGSFIKLLESMGMDDKSKVDTLREEESAGGEPWD VSAFAAHATTDEPVFVERLRKSLPGEMKAESPGKKGDGSLETGRKKVSGGDMSGADARSK GTNVDGTFFQIADGTNLLPVPRSQGSLVRTGCTGRGAVSREVDVKEAQSLLERPLEAEKW GEEGIPRLLRSPLHPLTVPPISGSSPEATWQRSLRKKGGSTEAT >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_6|855_bp atggggctggaagatttgcagactcttctgggcctcaccggaaagtgcggctctagcccc caagcatgtaaggacttttttcccctaggcattcagaggctttttcagtgtaaggaacat ggctatgcttcggtcagcgatagccccaggagatctacacctcagagcacttcctcccac tgggcagtggaccactcgagcagcggttctgaacacctgatccccagactcatcgcagca gcagcatctaggaacagaagaggcagcttcatcaagcttttagaaagcatgggaatggat gataaatcaaaggtagatacactgagagaagaggaatccgctggaggggaaccttgggat gtttctgcttttgcagctcatgcaacaacagatgagccagtttttgtggagagactgaga aaatcacttcctggagaaatgaaggctgaaagccctggcaaaaaaggagatggttctcta gagacaggcaggaaaaaagtttcaggaggagacatgtcaggtgctgatgcgagaagcaaa ggtaccaatgttgatggcaccttcttccagatcgctgatgggacaaacttattacccgtg cccagaagccagggctccctggtgagaactggctgcaccggcagaggagctgtctcacgg gaggtggatgtgaaggaggcacagtccctgttggagaggccactggaagcagagaaatgg ggagaggaaggaatacccaggcttcttcgttccccgctccatcctctcacagtgcctccc attagcggaagttctccagaagccacttggcaaaggagcctgagaaagaaaggaggaagc actgaggcaacatga >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_7|153_aa MGRLAGRGADPPTSLPDGAAGRAEGLLTSHSSTPEVQNVSREFSIENLLKASLTMRLPQV ILSFGEQQQTLEGPRISNPSRQNYLRQYQWQLAASGSSTPEVQNVIREFGIDNLWKASVT VCSGEQRQTLEGLRISSPSRQNSLRQYQWQLAA >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_7|462_bp atggggcggctggccgggcggggggccgacccccccacctccctcccggacggggcggct ggccgggcagaggggctcctcacttcccactctagcacaccagaggtccaaaatgtcagc agggagttcagcattgaaaatctcttgaaagctagtttgacaatgcggctcccccaagtg attttaagttttggagagcagcagcagaccctggagggacccagaattagcaaccccagt agacagaattacctgaggcaatatcagtggcagctggcagcttcaggctctagcacacca gaggtccaaaatgtcatcagggagttcggcattgacaatctctggaaagccagtgtgacc gtgtgttcaggggagcagcggcagacacttgagggactcagaattagcagccccagtaga cagaattctctgaggcagtatcagtggcaattggcagcttaa >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_8|160_aa MGMAQGPGEELTPRGRTVYSDAELVRAEGRLLTALSAQQHHGRPPGVSPLPRDPHPTTLR PLPPPPPPPPPPPRRPPAHSTHSPIMQRGNRCRRHVTSGRRWLGRDRALVGAVGWGRTLQ GNWCHRTLIPAQALSCRAPTAILGAGKLPFPFTSLRVIEL >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_8|483_bp atgggcatggctcaggggcccggggaggaactgactccccggggaaggaccgtctattcc gacgcggaactggtgagagcggaaggccggctgctcacagccctgtcagcgcagcagcac catggacgcccacccggagtttcaccgctgccgcgggacccccatcccactaccctccgc cctcttccgccgccgccgccgcctcctccacctcctccgcggagaccccccgcccactcc acacactctcccatcatgcagcgggggaaccgctgccgccgacacgtcacttccgggcgc aggtggctaggtcgggaccgggcgctggtgggtgctgtggggtgggggaggacactccag gggaattggtgtcacagaacgcttatccctgcgcaggcgctgtcgtgcagagcgccgacc gccattttgggtgcgggcaagctgccctttccttttacttccttgcgagtcatagagctg tag >gi568815596f:64355343_64558425|GENSCAN_predicted_peptide_9|654_aa MEPDIIRMYSSSPPPLDNGAEDDDDDEFGEFGGFSEVSPSGVGFVDFDTPDYTRPKEEFV PSNHFMPIHEFSENVDSLTSFKSIKNGNDKDITAELSAPVKGQSDVLLSTTSKEIISSEM LATSIDGMERPGNLNKVVEQRQNVGTLESFSPGDFRTNMNVVHQNKQLESCNGEKPPCLE ILTNGFAVLETVNPQGTDDLDNVADSKGRKPLSTHSTEYNLDSVPSPAEEFADFATFSKK ERIQLEEIECAVLNDREALTIRENNKINRVNELNSVKEVALGRSLDNKGDTDGEDQVCVS EISIVTNRGFSVEKQGLPTLQQDEFLQSGVQSKAWSLVDSADNSEAIRREQCKTEEKLDL LTSKCAHLCMDSVKTSDDEVGSPKEESRKFTNFQSPNIDPTEENDLDDSLSVKNGDSSND FVTCNDINEDDFGDFGDFGSASGSTPPFVTGTQDSMSDATFEESSEHFPHFSEPGDDFGE FGDINAVSCQEETILTKSDLKQTSDNLSEECQLARKSSGTGTEPVAKLKNGQEGEIGHFD SVPNIQDDCNGFQDSDDFADFSSAGPSQVVDWNAFEDEQKDSCSWAAFGDQQATESHHRK EAWQSHRTDENIDTPGTPKTHSVPSATSKGAVASGHLQESATSVQVFTNFLYFV >gi568815596f:64355343_64558425|GENSCAN_predicted_CDS_9|1965_bp atggagccagacatcattcgaatgtactcttcatccccaccaccattagacaatggagca gaggatgatgatgatgatgaatttggggaatttggtgggttttcagaagttagcccttct ggtgtagggtttgttgatttcgatacaccagattatactcgtcccaaggaagagtttgta ccttcaaaccattttatgccaattcatgaattctcagaaaatgtagatagccttacaagc tttaagtccattaaaaatggtaatgataaggacatcactgctgaactttctgctcctgtg aaaggacagtctgatgttttactttctaccaccagcaaagaaataatttcatctgaaatg ttagctacttccattgatggcatggaaagaccaggaaatttaaataaagtagtggagcag agacagaatgttggaacacttgaaagtttctctccaggagattttagaactaatatgaat gttgttcatcaaaacaagcagttagagagctgcaatggtgaaaagcctccttgtctggag attctaacaaatgggtttgcagtgttggaaactgtaaatcctcagggaacagatgatctg gacaatgtagctgattcaaagggacggaagcctcttagcactcatagcactgagtataat ttagactctgtacctagtcctgctgaggaatttgcagattttgccacattttccaaaaag gaaaggatacaattagaagaaatagaatgtgcagttttaaatgatagagaagcactaacc attcgggaaaacaataaaattaatagagtcaatgaactgaattctgtaaaagaagtggct ttgggtagaagcttggataacaaaggagacactgatggagaggatcaggtttgtgtttca gaaataagcatagtgactaacagaggtttcagtgttgaaaaacaaggccttccaacactg caacaggatgaatttttacagtcaggtgttcagtcaaaggcttggagtttggtagactca gctgataattcagaagccattaggagagaacaatgtaaaactgaagaaaaacttgactta cttacttctaaatgtgctcacctatgcatggattctgttaaaacttctgatgatgaagtt ggttctcccaaagaagaaagtagaaagtttactaatttccaaagcccaaacattgacccc acagaagaaaatgatttggatgattctttaagtgtaaaaaatggtgatagtagtaatgac tttgtgacttgcaatgatatcaatgaagatgattttggtgattttggtgactttggctct gccagtggctcaactccaccttttgttactggtactcaagattcaatgagtgatgccact tttgaagagtcttcagagcactttccacattttagtgaaccaggtgatgactttggagaa tttggggatataaatgctgtttcttgccaagaggagacaatattaacaaagtcagaccta aaacagacttctgataatttatcagaagaatgtcaattggcaagaaaatctagtggaaca ggcactgaacctgttgcaaaacttaaaaatgggcaagaaggtgagattggacattttgat tctgtgccaaatattcaggatgactgcaatggttttcaagactctgatgattttgcagac ttcagttcagctggtcctagccaagttgtagattggaatgcttttgaggatgaacaaaaa gatagttgttcttgggctgcttttggagaccagcaggctactgaatctcatcatcgaaag gaagcctggcagtcacataggacagatgaaaatattgatactccaggaacccccaaaacg cacagtgtaccttcagcaacttccaaaggagcagttgctagtggccatttacaggaatca gccacttcagttcaggtatttactaattttctctattttgtatga