GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:18:25 Sequence gi568815575f:130072063_130284737 : 212675 bp : 44.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 355 164 192 2 0 95 100 161 0.974 17.46 1.07 Intr - 2079 1987 93 1 0 4 84 95 0.468 0.64 1.06 Intr - 2690 2519 172 2 1 101 62 137 0.778 11.92 1.05 Intr - 9273 9194 80 1 2 9 80 120 0.018 2.57 1.04 Intr - 10079 10042 38 1 2 115 116 -8 0.018 2.61 1.03 Intr - 14471 14353 119 2 2 85 97 10 0.001 0.86 1.02 Intr - 20939 20832 108 2 0 76 35 88 0.035 2.78 1.01 Init - 26147 26004 144 2 0 63 91 88 0.335 4.74 1.00 Prom - 33975 33936 40 -3.86 2.00 Prom + 36515 36554 40 -5.66 2.01 Init + 45203 45323 121 1 1 83 68 114 0.765 9.26 2.02 Term + 48199 48356 158 2 2 81 41 91 0.272 1.80 2.03 PlyA + 50898 50903 6 1.05 3.14 PlyA - 52278 52273 6 1.05 3.13 Term - 57566 57495 72 2 0 104 48 89 0.987 4.41 3.12 Intr - 58104 57908 197 1 2 47 100 217 0.994 17.93 3.11 Intr - 59737 59613 125 0 2 104 110 125 0.999 16.53 3.10 Intr - 61393 61251 143 1 2 94 50 28 0.964 -1.25 3.09 Intr - 64123 63983 141 1 0 114 103 155 0.996 20.15 3.08 Intr - 64669 64581 89 2 2 100 116 61 0.999 9.79 3.07 Intr - 65123 65016 108 0 0 116 65 114 0.999 12.16 3.06 Intr - 68555 68471 85 2 1 55 99 45 0.759 1.69 3.05 Intr - 73507 73417 91 0 1 102 105 52 0.973 8.30 3.04 Intr - 75561 75431 131 0 2 61 93 85 0.999 5.79 3.03 Intr - 75814 75690 125 2 2 61 86 83 0.925 5.70 3.02 Intr - 77506 77407 100 1 1 71 105 38 0.943 3.48 3.01 Init - 84488 84399 90 2 0 48 92 61 0.607 2.99 3.00 Prom - 86791 86752 40 -5.66 4.00 Prom + 92217 92256 40 -5.16 4.01 Sngl + 93476 93940 465 1 0 49 47 265 0.822 14.71 4.02 PlyA + 95693 95698 6 1.05 5.00 Prom + 97525 97564 40 -7.96 5.01 Init + 100001 100258 258 1 0 98 53 667 0.441 59.34 5.02 Term + 112223 112678 456 1 0 131 47 564 0.981 51.73 5.03 PlyA + 112789 112794 6 1.05 6.00 Prom + 125823 125862 40 -4.86 6.01 Init + 126081 126239 159 2 0 71 64 138 0.615 9.52 6.02 Term + 128565 128669 105 2 0 119 41 4 0.486 -2.79 6.03 PlyA + 128877 128882 6 1.05 7.12 PlyA - 129313 129308 6 1.05 7.11 Term - 132930 132859 72 0 0 70 38 72 0.235 -1.69 7.10 Intr - 135952 135862 91 0 1 39 70 77 0.179 1.00 7.09 Intr - 143952 143729 224 0 2 84 85 40 0.444 0.23 7.08 Intr - 148418 148287 132 2 0 48 87 68 0.877 3.54 7.07 Intr - 154843 154697 147 1 0 83 94 34 0.935 3.93 7.06 Intr - 155720 155620 101 0 2 107 86 40 0.997 5.53 7.05 Intr - 157072 156915 158 0 2 54 94 82 0.790 5.05 7.04 Intr - 158665 158448 218 1 2 80 77 31 0.346 -1.50 7.03 Intr - 164258 164152 107 0 2 56 93 24 0.308 -0.37 7.02 Intr - 164577 164407 171 1 0 71 92 26 0.258 1.21 7.01 Init - 174932 174797 136 2 1 70 82 159 0.944 13.80 7.00 Prom - 181297 181258 40 -8.26 8.00 Prom + 181731 181770 40 0.24 8.01 Init + 193541 193614 74 1 2 41 84 49 0.439 0.34 8.02 Intr + 196679 196950 272 2 2 56 40 156 0.205 4.79 8.03 Intr + 198848 198999 152 0 2 78 65 43 0.181 0.88 8.04 Term + 210596 210937 342 1 0 10 44 240 0.239 6.31 8.05 PlyA + 211421 211426 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 9268 9194 75 1 0 93 80 119 0.967 12.69 S.002 Term + 181337 181865 529 2 1 66 47 214 0.858 8.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_1|316_aa MNIQVSLWQAGPQALRACARQQSFVHTPCLPMAQGGSRLREPVVTKEQYSQDSKPEYREG HLFGFVGSTGQLRDLSVHGFWYSQAVYTVGGLWVPWRKRKKHLFAWDLPSNHSSSGTDQG LQRRCGPSKLTGPHSQDSMAITLQPSDLIFEFASNGMDDDIHQLEDPSVFPAVIVEQVPY PDLLHLYSGLELDDVHNGIITDGTLCMTQDQILEGSFLLTDDNEATSHTMSTAEVLLNME SPSDILDEKQIFSTSEMLPDSDPAPAVTLPNYLFPASEPDALNRAGDTSDQEGHSLEEKA SREESAKKTGKSKKRX >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_1|948_bp atgaatattcaggtttcgctctggcaggcagggccccaggcactgcgggcctgtgccagg cagcaatccttcgtccacaccccctgcctgcccatggctcaggggggctccaggctgcgg gagcctgtagtgacgaaagagcagtattctcaagattcgaaacctgagtacagggagggc cacctttttggatttgtgggctccacaggccaactgcgggacttgagtgtgcatggattt tggtattcccaggcagtctacactgtgggtgggctgtgggtcccctggagaaagaggaaa aagcatctctttgcctgggacctgccatctaatcactcctcctctgggacagatcaaggc ctgcaaagaagatgtggcccctcaaagctaactggcccccactcccaagacagcatggct attaccctacagcccagtgacctgatctttgagttcgcaagcaacgggatggatgatgat atccaccagctggaagacccctctgtgttcccagctgtgatcgtggagcaggtaccctac cctgatttactgcatctgtactcgggactggagttggacgacgttcacaatggcatcata acagacgggaccttgtgcatgacgcaggatcagatcctggaaggcagttttttgctgaca gatgacaatgaggccacctcgcacaccatgtcaaccgcggaagtcttactcaatatggag tctcccagcgatatcctggatgagaagcagatcttcagtacctccgaaatgcttccagac tcggaccctgcaccagctgtcactctgcccaactacctgtttcctgcctctgagcccgat gccctgaacagggcgggtgacactagtgaccaggaggggcattctctggaggagaaggcc tccagagaggaaagtgccaagaagactgggaaatcaaagaagagaann >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_2|92_aa MKAPTKLQKITGFQLSSSWTAAQEGWLPGGDDADTGGAAGSAEDRIDKHGPPNCPSRSGL KQSLYLSCRPSRVSGCAPALATVCIPCAGDAF >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_2|279_bp atgaaagcccctacaaaacttcagaagatcaccggctttcaattgtcttcgagctggaca gcagctcaggagggctggctgcctggaggggatgatgccgacacaggaggcgctgctggc tctgcagaagaccggattgacaagcacggaccacctaactgtccctctagatccggactt aagcagagcctctacctctcatgtcggccgagccgagtctcaggctgcgccccagccttg gccacagtctgcatcccctgtgctggagatgcgttctaa >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_3|498_aa MASSGASGGKIDNSVLVLIVGLSTVGAGAYAYKTMKEDEKRYNERISGLGLTPEQKQKKA ALSASEGEEVPQDKAPSHVPFLLIGGGTAAFAAARSIRARDPGARVLIVSEDPELPYMRP PLSKELWFSDDPNVTKTLRFKQWNGKERSIYFQPPSFYVSAQDLPHIENGGVAVLTGKKV VQLDVRDNMVKLNDGSQITYEKCLIATARALGTEVIQLFPEKGNMGKILPEYLSNWTMEK VRREGVKVMPNAIVQSVGVSSGKLLIKLKDGRKVETDHIVAAVGLEPNVELAKTGGLEID SDFGGFRVNAELQARSNIWVAGDAACFYDIKLGRRRVEHHDHAVVSGRLAGENMTGAAKP YWHQSMFWSDLGPDVGYEAIGLVDSSLPTVGVFAKATAQDNPKSATEQSGTGIRSESETE SEASEITIPPSTPAVPQAPVQGEDYGKGVIFYLRDKVVVGIVLWNIFNRMPIARKIIKDG EQHEDLNEVAKLFNIHED >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_3|1497_bp atggctagctctggtgcatcagggggcaaaatcgataattctgtgttagtccttattgtg ggcttatcaacagtaggagctggtgcctatgcctacaagactatgaaagaggatgaaaaa agatacaatgaaagaatttcagggttagggctgacaccagaacagaaacagaaaaaggcc gcgttatctgcttcagaaggagaggaagttcctcaagacaaggcgccaagtcatgttcct ttcctgctaattggtggaggcacagctgcttttgctgcagccagatccatccgggctcgg gatcctggggccagggtactgattgtatctgaagatcctgagctgccgtacatgcgacct cctctttcaaaagaactgtggttttcagatgacccaaatgtcacaaagacactgcgattc aaacagtggaatggaaaagagagaagcatatatttccagccaccttctttctatgtctct gctcaggacctgcctcatattgagaatggtggtgtggctgtcctcactgggaagaaggta gtacagctggatgtgagagacaacatggtgaaacttaatgatggctctcaaataacctat gaaaagtgcttgattgcaacagctcgagccttgggcacagaagtgattcaactcttcccc gagaaaggaaatatgggaaagatcctccccgaatacctcagcaactggaccatggaaaaa gtcagacgagagggggttaaggtgatgcccaatgctattgtgcaatccgttggagtcagc agtggcaagttacttatcaagctgaaagacggcaggaaggtagaaactgaccacatagtg gcagctgtgggcctggagcccaatgttgagttggccaagactggtggcctggaaatagac tcagattttggtggcttccgggtaaatgcagagctacaagcacgctctaacatctgggtg gcaggagatgctgcatgcttctacgatataaagttgggaaggaggcgggtagagcaccat gatcacgctgttgtgagtggaagattggctggagaaaatatgactggagctgctaagccg tactggcatcagtcaatgttctggagtgatttgggccccgatgttggctatgaagctatt ggtcttgtggacagtagtttgcccacagttggtgtttttgcaaaagcaactgcacaagac aaccccaaatctgccacagagcagtcaggaactggtatccgatcagagagtgagacagag tccgaggcctcagaaattactattcctcccagcaccccggcagttccacaggctcccgtc cagggggaggactacggcaaaggtgtcatcttctacctcagggacaaagtggtcgtgggg attgtgctatggaacatctttaaccgaatgccaatagcaaggaagatcattaaggacggt gagcagcatgaagatctcaatgaagtagccaaactattcaacattcatgaagactga >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_4|154_aa MGQSPGSRFLCLGLRTHTVRTKGTSFCFKAPAARPPHRNISATAIRDLLLPFPLTHDRRV KHREPRPAPPVSSHAHYADSSSQLRVGIGQRSRPARAGEGERRPEAYCAGVALAAALRLV RTARGFAHALAGRKLAKTLALGSRFSRVCGTVLD >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_4|465_bp atgggtcagtcacctgggagccggttcctctgcctcgggcttcggacgcacacggtccgc accaagggcaccagcttctgcttcaaagcacccgccgccaggcctccacaccggaacatt tcggcgaccgctattcgggacctcctccttccctttcctctcacgcacgaccgacgggtc aaacaccgtgagccccggccagctcccccagtctcttcacacgcacattacgcagactcc tcctcccagctccgggtgggcattggacagagaagccggcctgctagagccggggaaggg gaacggcgaccggaggcctactgcgcaggcgtagcactcgctgccgcattgcgattggtc cgcacggcgagaggatttgcgcatgcactagctggccgcaagctcgcaaagacgctggct ctaggtagccggttttcgcgagtttgtggcacagtgttggactga >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_5|237_aa MAQPILGHGSLQPASAAGLASLELDSSLDQYVQIRIFKIIVIGDSNVGKTCLTFRFCGGT FPDKTEATIGVDFREKTVEIEGEKIKVQVWDTAGQERFRKSMVEHYYRNVHAVVFVYDVT KMTSFTNLKMWIQECNGHAVPPLVPKVLVGNKCDLREQIQVPSNLALKFADAHNMLLFET SAKDPKESQNVESIFMCLACRLKAQKSLLYRDAERQQGKVQKLEFPQEANSKTSCPC >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_5|714_bp atggcgcagcccatcctgggccatgggagcctgcagcccgcctcggccgctggcctggcg tccctggagctcgactcgtcgctggaccagtacgtgcagattcgcatcttcaaaataatc gtgattggggactccaacgtgggcaagacctgcctgaccttccgcttctgcgggggtacc ttcccagacaagactgaagccaccatcggcgtggacttcagggagaagaccgtggaaatc gagggcgagaagatcaaggttcaggtgtgggacacagcaggtcaggaacgtttccgcaaa agcatggtcgagcattactaccgcaacgtacatgccgtggtcttcgtctatgacgtcacc aagatgacatctttcaccaacctcaaaatgtggatccaagaatgcaatgggcatgctgtg cccccactagtccccaaagtgcttgtgggcaacaagtgtgacttgagggaacagatccag gtgccctccaacttagccctgaaatttgctgatgcccacaacatgctcttgtttgagaca tcggccaaggaccccaaagagagccagaacgtggagtcgattttcatgtgcttggcttgc cgattgaaggcccagaaatccctgctgtatcgtgatgctgagaggcagcaggggaaggtg cagaaactggagttcccacaggaagctaacagtaaaacttcctgtccttgttga >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_6|87_aa MHSSLSKFVDDIKLFQVVKYQTDKNQTAEGFQNQRREVYEYEHEVNYGVAKYKKAPLTDT QVAHFLTSFRSLLKCHLLKEAQLDHQI >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_6|264_bp atgcacagcagcctctccaagtttgttgatgatattaagctttttcaggtggtcaaatac caaactgacaagaatcaaactgcagaagggtttcaaaatcagagaagagaggtatatgag tatgagcatgaggtgaactatggtgtggccaagtataagaaagctcctctgacagatacc caggtggctcacttcctcacctccttcaggtctttactcaaatgccatcttctcaaggag gcccaacttgaccatcaaatttaa >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_7|518_aa MAELFMECEEEELEPWQKKVEETQDEDDDELIFVGEISSSKPAISSMKNTSYVLKHPSTS KVNSVTPKKPKTSEDVPQINPSTSLPLIGSPPVTSSQVMLSKGTNTSSPYDAGADYLRAC PKCNVQFNLLDPLKYHMKHCCPDMITKFLGVIVKSERPCDEDKTDSETGKLIMLVNEFYY GRHEGVTEKEPKTYTTFKCFSCSKVLKNNIRFMNHMKHHLELEKQNNESWENHTTCQHCY RQYPTPFQLQCHIESTHTPHEFSTICKICELSFETEHILLQHMKDTHKPGEMPYVCQVCQ FRSSTFSDVEAHFRAAHENTKNLLCPFCLKVSKMATPYMNHYMKHQKKGVHRCPKCRLQF LTSKEKAEHKAQHRTFIKPKELEGLPPGAKVTPPTSQNTTARNPRKSNASRSKTSKLHAT TSTASKVNTSKPRGRIAKSKAKPSYKQKRQRNRKNKMSLALKNISVGSPIQMTVMKKACT NLQGMYNLLKGYGYKLLIEIDSCSMEFMQPGARQVGSS >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_7|1557_bp atggcagaactctttatggaatgtgaagaagaagagctagagccatggcagaagaaagta gaagaaactcaggatgaggatgacgatgaactgatctttgttggagagatatcaagttca aaaccagccatttcaagtatgaaaaatacttcatatgtgttgaaacatccttctacttct aaagtaaacagtgttactccaaaaaaaccaaagaccagtgaagatgttcctcagataaat ccctccacttcattgcctttaattggctctcctccagtgacatcctcccaagttatgctg tcaaaaggtacaaatacctcatctccatatgatgctggagcagattacctaagagcttgt ccaaagtgcaatgttcagttcaatcttttggatcctttgaaataccacatgaagcattgt tgtccagacatgataactaaatttttgggagtaattgttaaatcagaacgtccatgtgat gaagacaagactgattcagagacaggaaagttgatcatgttagtcaatgagttttattat ggaaggcatgaaggagtcactgagaaagagccaaagacttacacaacctttaaatgcttc agttgctcgaaagttcttaaaaataatattaggtttatgaaccacatgaaacatcacttg gaacttgagaagcagaacaatgaaagctgggaaaaccacaccacctgccagcactgttac cggcaatatcccacacccttccaactgcagtgccacattgagagtacacacactccccat gagttttctactatttgcaaaatctgtgaattatcatttgaaacagagcatattctttta caacatatgaaggacacccataaacctggtgaaatgccatatgtttgccaggtttgccag tttagatcatcaacattttctgatgtagaagctcattttagagcagcccatgaaaacact aagaacttgctatgtccattttgcctcaaagttagtaaaatggcaaccccctacatgaat cattacatgaagcatcagaaaaaaggagttcatcgttgcccaaaatgcagactacaattt ttgaccagcaaggagaaagctgaacataaggcgcagcatcgtacatttataaagcctaaa gaactagaaggattgcctcctggagcaaaagtcacacctccgacatctcaaaatacaact gctagaaatcctagaaaatctaatgccagtagatctaagacaagtaagcttcatgcaact acatccactgcaagtaaagttaatacaagtaagccaaggggacgtatagctaagtccaaa gcaaaaccctcttacaagcaaaagcgacagcgcaacagaaaaaataaaatgagccttgct ttgaagaacataagcgtaggaagtcctatacagatgaccgtgatgaagaaggcttgtacc aaccttcaaggaatgtacaatttattaaaggggtatggatataagctgctcattgaaata gattcctgctccatggagttcatgcaacctggtgcaagacaagttggaagttcctaa >gi568815575f:130072063_130284737|GENSCAN_predicted_peptide_8|279_aa MGTLDIGADNLSGNFWSTTSWLPNLRGRGPLTESRADRSDWGSDKEEEKEEEEREEREKP QATASATATPPETSSAEPASQPGAEVYNCTRCLMGRRRLHPLNIRGLLTAHGRTAVSRVS TSNLKLRGGMVKLPQQDDQKTLIHPNFNLTYSFFREKYSQEDQRALTLDISSESYKWGHK LIKISFLIPPLWCITIVTWRKEKREQNEEKWQARITAAIIGDALNAQRASKGNPKGNKDN ASKGSCFKCKKNGHWAKECSKPLPGPYCQCEGTSCGPWH >gi568815575f:130072063_130284737|GENSCAN_predicted_CDS_8|840_bp atgggaacactggacataggggctgacaacctgagtgggaatttctggtctaccacttcc tggctgccaaatctgcgggggcgggggccgcttaccgagagcagagcagatcggtctgac tggggttcggataaggaggaggagaaagaggaggaggaaagggaggagcgcgaaaaacct caggcgacagcctcagcaacagcgactcccccggagacttccagcgcggaaccggcgtct cagccgggggcagaagtatataactgcacgcgctgcctcatgggtagacgccgactccac cccctgaacatccggggccttttaaccgcccacggacggacggctgtgtcacgagtgtcc acatccaacctaaaattacgaggaggaatggtcaaactgcctcagcaggatgaccagaaa acacttatacacccaaatttcaatctaacttattctttttttcgtgaaaaatacagccaa gaagaccagcgggctctgacattagacataagctccgaaagctacaaatggggccacaaa ctaatcaaaatcagcttcctgataccacctttatggtgtataacaattgtgacctggagg aaggaaaaaagggaacagaatgaagaaaaatggcaagccagaattacggcagccatcatt ggcgatgccttgaatgctcaaagagcatctaagggaaatccaaagggcaataaggataat gccagcaaaggctcttgcttcaaatgcaagaaaaatgggcactgggcaaaggaatgtagt aagcccctgccaggcccctactgtcaatgcgaaggcaccagttgtggcccctggcactag