GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:09:38 Sequence gi568815589f:89505122_89706076 : 200955 bp : 47.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 545 716 172 2 1 54 39 110 0.516 2.65 1.02 Intr + 7571 7738 168 1 0 108 66 78 0.986 7.74 1.03 Intr + 11010 11115 106 2 1 102 48 72 0.316 4.39 1.04 Intr + 11538 11572 35 1 2 130 42 18 0.089 -0.66 1.05 Intr + 13324 13460 137 0 2 56 94 81 0.080 4.87 1.06 Intr + 14146 14288 143 1 2 -3 94 102 0.039 1.70 1.07 Intr + 24148 24188 41 2 2 87 75 -8 0.122 -4.26 1.08 Intr + 25299 25376 78 2 0 96 93 18 0.741 2.85 1.09 Term + 25692 25823 132 2 0 84 48 108 0.618 4.49 1.10 PlyA + 28165 28170 6 1.05 2.07 PlyA - 30288 30283 6 1.05 2.06 Term - 30615 30505 111 0 0 89 55 58 0.324 1.16 2.05 Intr - 33261 33135 127 2 1 28 80 89 0.147 2.78 2.04 Intr - 47895 47789 107 0 2 112 31 40 0.049 -0.29 2.03 Intr - 51383 51300 84 2 0 82 94 29 0.732 2.92 2.02 Intr - 51909 51823 87 0 0 21 85 95 0.496 2.67 2.01 Init - 53292 53245 48 0 0 67 111 31 0.531 4.25 2.00 Prom - 53702 53663 40 -6.26 3.00 Prom + 53835 53874 40 -3.96 3.01 Init + 54898 55077 180 0 0 25 51 184 0.389 7.58 3.02 Term + 67000 67098 99 0 0 44 39 112 0.078 0.03 3.03 PlyA + 67835 67840 6 1.05 4.00 Prom + 74716 74755 40 -4.06 4.01 Init + 100001 100053 53 1 2 70 94 58 0.984 5.25 4.02 Intr + 100311 100412 102 0 0 117 86 154 0.996 17.39 4.03 Intr + 100546 100759 214 1 1 100 109 491 0.996 51.12 4.04 Term + 100848 100958 111 2 0 127 48 163 0.994 14.76 4.05 PlyA + 101415 101420 6 1.05 5.00 Prom + 102056 102095 40 -2.26 5.01 Init + 105765 105856 92 2 2 81 64 68 0.336 2.43 5.02 Intr + 107823 107933 111 0 0 142 44 4 0.281 0.89 5.03 Intr + 112020 112087 68 0 2 102 74 37 0.227 2.25 5.04 Term + 116021 116094 74 0 2 32 52 118 0.317 0.67 5.05 PlyA + 116144 116149 6 1.05 6.04 PlyA - 116484 116479 6 1.05 6.03 Term - 118526 118375 152 0 2 6 54 111 0.359 -2.43 6.02 Intr - 118744 118603 142 1 1 68 92 85 0.681 6.83 6.01 Init - 119501 119490 12 2 0 86 99 6 0.722 1.83 6.00 Prom - 120340 120301 40 -1.36 7.10 PlyA - 125447 125442 6 1.05 7.09 Term - 127727 127447 281 0 2 117 48 84 0.056 2.91 7.08 Intr - 138629 138579 51 0 0 117 85 16 0.052 3.08 7.07 Intr - 150512 150402 111 0 0 110 53 40 0.341 3.05 7.06 Intr - 151384 151254 131 0 2 64 56 58 0.301 0.54 7.05 Intr - 154632 154529 104 0 2 70 77 100 0.918 6.07 7.04 Intr - 156531 156362 170 1 2 69 85 126 0.930 10.07 7.03 Intr - 158034 157907 128 2 2 21 54 88 0.262 -0.98 7.02 Intr - 161641 161534 108 0 0 79 82 49 0.746 2.80 7.01 Init - 163340 163186 155 2 2 78 81 148 0.524 12.56 7.00 Prom - 165959 165920 40 -10.94 8.04 PlyA - 166011 166006 6 -0.45 8.03 Term - 166708 166342 367 0 1 38 37 776 0.726 61.58 8.02 Intr - 168010 167983 28 2 1 105 97 7 0.941 0.47 8.01 Init - 170115 169959 157 0 1 56 42 167 0.916 6.98 8.00 Prom - 171037 170998 40 -6.46 9.06 PlyA - 171890 171885 6 -0.45 9.05 Term - 172944 172870 75 0 0 84 49 66 0.046 0.14 9.04 Intr - 177423 177313 111 0 0 84 78 9 0.017 0.08 9.03 Intr - 180159 179941 219 0 0 45 77 106 0.100 3.70 9.02 Intr - 190572 190447 126 0 0 64 51 109 0.923 5.68 9.01 Init - 191763 191641 123 0 0 49 105 89 0.657 6.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 125136 125273 138 2 0 83 103 35 0.831 5.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_1|337_aa XKGAVNLSMGELNCSDSLLIIREFGEYKGLDFSKRNIESLIAILMTPWFSPFSVYDCQDT WIIIEVGGSELRSTRCKPAHLGDVTCPHPLYTHVPFLTLPQASETVAFHRLGTAEQQPTH LAGLLSVAPVPLTPQGGCRRITMLTDCVQGTPSTGRKKAQSQKMLPVPDKHVVDLCISAA RPLENECESQLEETSRKPGDPCQEGDRSEPPNKKPEPPQPTMPWEAQAAAKGYMERIHST TPGYQLAASISVHPAPVFPKPCDYSPHHRQLPFITSSRAQVEGSYSHIVLEKQPTLANRR MNGVLRHRYAVEEQLGDFQALVAKVQHPREAGAACFY >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_1|1014_bp nngaaaggtgccgtgaacctgagcatgggagaactcaattgttctgattcccttttgatt attcgggaatttggagagtacaaaggattggatttttcaaagagaaacattgagagcctt attgctattcttatgaccccatggttttctcctttctctgtctatgactgtcaggacaca tggatcatcatcgaggttggcggctcagagctaaggagcacacgctgcaaaccagcccac cttggggatgtgacttgtcctcaccctctttatacccatgtccccttcctgacgctgccc caggcttctgagacagtagccttccacaggcttggaactgcggaacagcagcctacacac cttgcagggctgctgtctgtggccccagtgcccttgacaccccaggggggctgcagacgc atcaccatgctaacagactgtgttcaaggaaccccaagcactggaagaaagaaagcccag tcacagaaaatgctacctgttcctgacaagcacgtagttgatctttgcatatctgctgct agacctttggaaaatgagtgtgaatctcagcttgaggagacttcaagaaagcctggtgac ccctgccaggagggtgacagatctgagccaccaaataagaagcctgagcctcctcagccc accatgccatgggaagcccaggctgcagcaaaaggctacatggagaggatccattcaaca accccgggctaccagctggcagccagcatcagtgtccatccagcccctgtttttcccaag ccctgtgactactctccccatcatagacaactgccctttatcacctcctccagagcacag gttgagggcagctacagtcacatagtgcttgagaagcagccgaccctggccaatcgacgg atgaacggggtactcagacacaggtatgcagtggaagagcagctaggggacttccaggct ctagtggccaaagtacagcatcctcgagaagctggagctgcttgcttttattga >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_2|187_aa MCQLHRKKQKPQEKLLERADSNHMKVPRHTHESDRSVNAQQRSASSQNTPTGPASTFHAG KHSTSTTSQLTFQITPAENICTKAMALVHIEEHPGGIHFQNTFPKRLHRIQELEMTIAPE FAKRLERRSRLSSACKAHYPRDEEGSAAGLPVWVSIELRQPGVPGGSEGVKQGKVSLPWG TMSTSYT >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_2|564_bp atgtgccagcttcacaggaaaaagcaaaaaccacaagagaaactattggaacgtgcagat tcaaaccacatgaaggtgccacggcacacccacgagagtgaccggagtgtgaatgcgcaa cagcgctcggcatcgagtcagaacacccccaccgggccagccagtaccttccacgccgga aagcactccacctccactacctcccagctgaccttccagatcaccccagctgagaacatc tgcactaaggccatggctttagtgcacatcgaagagcacccggggggaatacatttccag aatacatttcctaaaagattgcacaggatccaagaactggaaatgacaattgctccagaa tttgccaagagactggaaaggagatcaaggctcagctctgcctgcaaagcccattacccc agggatgaggaaggcagtgctgcaggcttgccggtgtgggtgtccatagagctgagacaa ccaggggtccctgggggctcagagggtgtgaagcagggcaaggtctcgctgccatggggc accatgagcacatcctatacctga >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_3|92_aa MQLVAAKLGFIAVAQRQVTKNVDQEDDTIGSAFVTVICHSLWLHADSVRTCGDVATVVLG KKEEEKGNGKETGKKVKIIRGLTQEFHDVNKF >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_3|279_bp atgcaacttgtggctgccaagttgggcttcatagctgtagcacaacggcaggtcaccaag aatgtagaccaggaagatgacaccatcggatctgcatttgttactgttatttgtcactca ctgtggctccatgcggacagtgtaaggacctgtggagatgttgccacagtggtgctgggg aaaaaagaagaagagaaaggaaatggaaaggaaacaggaaagaaagtgaagatcattaga ggattaacccaggagtttcatgatgtgaacaaattctaa >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_4|159_aa MTLEEVRGQDTVPESTARMQGAGKALHELLLSAQRQGCLTAGVYESAKVLNVDPDNVTFC VLAAGEEDEGDIALQIHFTLIQAFCCENDIDIVRVGDVQRLAAIVGAGEEAGAPGDLHCI LISNPNEDAWKDPALEKLSLFCEESRSVNDWVPSITLPE >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_4|480_bp atgactctggaagaagtccgcggccaggacacagttccggaaagcacagccaggatgcag ggtgccgggaaagcgctgcatgagttgctgctgtcggcgcagcgtcagggctgcctcact gccggcgtctacgagtcagccaaagtcttgaacgtggaccccgacaatgtgaccttctgt gtgctggctgcgggtgaggaggacgagggcgacatcgcgctgcagatccattttacgctg atccaggctttctgctgcgagaacgacatcgacatagtgcgcgtgggcgatgtgcagcgg ctggcggctatcgtgggcgccggcgaggaggcgggtgcgccgggcgacctgcactgcatc ctcatttcgaaccccaacgaggacgcctggaaggatcccgccttggagaagctcagcctg ttttgcgaggagagccgcagcgttaacgactgggtgcccagcatcaccctccccgagtga >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_5|114_aa MHTRMLIPCPSLGAKVLIFEVGLWMDDWAMRASPSLQTYHALSYPGALDVLVQLRELHVF LNGDSACRADGDKAVSRHILHPVSAACFLGKSGEGAGEEGQHAVLNGLQQQTAL >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_5|345_bp atgcacacccgcatgcttattccatgcccttccctgggggccaaggtcctgatctttgag gtgggcttgtggatggatgactgggctatgagggcttctccctctctccagacatatcat gccctttcataccccggggccttggacgtgctggtccagctgcgtgaactgcatgttttc ttgaacggcgactctgcttgcagagctgatggggacaaggctgtgtctcggcacatcctc cacccagtctcagctgcctgcttcctgggaaagagtggcgaaggtgctggtgaggagggc cagcatgccgtgctcaatggcctgcaacagcagacagccctgtga >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_6|101_aa MGGQPMAAKSLLVFLSLPERSEMQAGVCLKCQERRQQGKPTSCVYGKPEQKAPHCRWAWG RDPNSEDSGQFTPKSHGGWPPPRSSLYNLSPSHWLPASADI >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_6|306_bp atggggggccagcccatggctgcaaagagcctgctggtgttcctgagcctgccagaaaga tctgaaatgcaggcaggtgtatgtcttaaatgccaagaaaggaggcagcaagggaagccc acaagctgtgtttatgggaaaccagaacaaaaagctcctcattgccgctgggcctggggc cgggacccaaacagcgaggacagcgggcagttcacccccaagagccatgggggctggcct cctcctcgttctagcttgtacaatttatctccctctcactggctccctgcttctgcagac atctga >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_7|412_aa MRKLKQLPKSNDYFRSVRSLVHIKSFASQTGIPTPNSAATWKGDSHAASVRSTGIESLVA PPGPLCQPLDCVAKPVQKHKTQAMHKKSPSPSNDLPLSVSSINRSTESVCFAAQGQNPGV LLSLLGQLKNVHAICGMLCAKLPGMHQGEEHLVAIPEWQMLTEDKRHAGSVLHSHEALDK AVQRAQLGLSEELSGHNLARDSLEIQTRLISPGPAKGSGHKSVEKDSEAVFRHSLSGEFV VAAQEEGSGGDLHVDALPTQVYRKSGLFSSRPHHCEAPGTGSQGQEDWGSGLQELKASLT IDKLPVWKQREVNKKSRCPGINPKSAPEVSSSVSVPGKAIHKTLLHLSSKPEYSSKVLLL RISDKSWSRIWSLSPPPSPFCDGSLHKSLNKLWWDQHGRNAFNICWRSACQS >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_7|1239_bp atgaggaaactgaagcagttgccaaagtcaaacgactacttcagatcagtccgctccctg gtgcacatcaaaagctttgcctctcaaacggggatccccacacccaactcagctgccaca tggaaaggtgacagccatgcagcctctgtgagaagcacaggcatagagtctttggtggcc cctccgggtcccctttgtcagcccctggactgtgtggccaaacctgtgcagaagcacaaa acccaggccatgcacaaaaaaagtcctagtccaagcaatgacctcccactgagcgtctca agcatcaaccgcagcactgaaagtgtgtgctttgccgcacagggacagaaccctggagtc ctgctgtcactacttggtcagctgaagaacgtccatgctatctgtggaatgctgtgcgca aagctccctggcatgcaccagggagaagagcatttggtggccataccagagtggcagatg ctcacagaagacaaaaggcatgcaggatctgttctccacagccatgaagccctggacaaa gctgtccagagggctcagctgggtctcagtgaggagctcagtgggcacaacctggcccgg gactcactggagatccagaccagactcatctcccctgggccagccaagggctctgggcac aagagtgttgagaaggattctgaggctgtgttcagacacagcctgtcaggtgagttcgtg gtggctgcccaagaggaaggaagtggtggcgacttacatgttgatgctttgccaacccag gtttataggaagtccggtttattctcatcaaggccccaccactgtgaggcaccaggcact ggcagccaagggcaggaggactggggctctggcctccaggagctcaaggcctcattgaca attgacaagcttcctgtttggaagcagagagaagtaaataagaagtccaggtgcccaggc attaatcccaagagtgccccagaagtctcctcctcagtaagtgtccctgggaaggccatc cacaagacgctgctgcatttgtcctccaagccagagtactcttcaaaagttttacttctt aggatctctgataagagctggagccggatctggagcctctcacctccacccagccctttc tgtgatggcagtttacacaaatctttaaacaaactttggtgggatcaacatggtcggaat gctttcaacatctgctggcgttccgcatgccagtcctga >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_8|183_aa MEGAWLLGRTPSCLLGGEGRRCKPRTPPISSMNNTALSPKPCAMNSPDQLTQKKQYFKKT PRLCFQISTQGVVINITTTTTITTTISITTTITTITTITTTITIITTISITTTTTITTIT IITTISITTTTIITTITIITTISITATTTTTITTTITTVSTIITTTTITTIMTTTITIII SNL >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_8|552_bp atggagggggcctggttgctgggcagaactccgagctgcctgctgggcggggaaggcagg cgctgcaagcccaggactcctcccatcagcagcatgaacaacaccgccctgtccccgaag ccctgtgcaatgaactcaccagatcagctgacccagaaaaagcagtatttcaagaaaact cctcgactctgtttccaaataagtacccaaggagttgtcatcaacatcaccaccaccacc accatcaccaccaccatctccatcaccaccaccatcaccaccataaccaccatcaccacc accatcaccatcatcaccaccatctccatcaccaccaccaccaccatcaccaccatcacc atcatcaccaccatctccatcaccaccaccaccatcatcaccaccatcaccatcatcacc accatctccatcaccgccaccaccaccaccaccatcaccaccaccatcaccaccgtctcc accatcatcaccaccaccaccatcaccactatcatgactaccaccattactatcattata tctaacctttaa >gi568815589f:89505122_89706076|GENSCAN_predicted_peptide_9|217_aa MIKHTGYSLSISLPHCEMGLHPLHLEEGEMSASFISTQEVEIFRLITLITSQFSKVDTEG DNTFDGSCRPSRAVLQQEAAAGRRSPWSAVQIFQPNKPSKGEAGADWPSRSPGSPLEPEF PTGLVEQNVSICFVWKAQEHVDPASFNGPEYLPLALDPLCGNDTLLNRLSTSVSGIGGFL VSLTSRKKRRTLAEKAAVSDTRNAQLLAGPAACSREQ >gi568815589f:89505122_89706076|GENSCAN_predicted_CDS_9|654_bp atgatcaagcatactggctattctctgagcatcagtcttccccattgtgaaatgggcctg caccctctgcacctagaggaaggggagatgtcagcatctttcatcagcacccaggaagtg gagattttcaggttgattacgctgatcacatcacagtttagcaaagtggacacggaggga gacaatacctttgacgggagctgcagaccttccagggctgtcctgcagcaggaagcagct gcaggccggagaagtccatggagtgccgtccagatattccagcccaacaaaccttctaag ggagaagcaggcgctgactggccctcacggtcccctggctctccactggagccagagttc ccgacaggacttgtggaacagaatgtcagcatttgctttgtctggaaagctcaagagcat gtggatccggcctcattcaacggcccagaatatctgcccctggcattggaccctctgtgt ggcaatgacactttgctaaatcgactctctaccagtgtgtctggaattggtgggttcttg gtctcactgacttcaagaaagaagcggcggaccctcgccgaaaaagcagctgttagtgat acacgcaatgcacagctgttggccgggcctgctgcctgctccagagaacaatga