GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:03:44 Sequence gi568815581f:34861567_35062682 : 201116 bp : 43.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2541 2650 110 2 2 72 52 78 0.161 1.68 1.02 Intr + 27470 27641 172 2 1 78 119 80 0.886 10.05 1.03 Intr + 28100 28119 20 1 2 82 78 36 0.524 -2.59 1.04 Intr + 28589 28719 131 2 2 66 56 78 0.541 2.74 1.05 Intr + 29271 29300 30 1 0 71 93 46 0.446 1.50 1.06 Term + 35264 35376 113 0 2 83 44 70 0.500 0.82 1.07 PlyA + 37055 37060 6 1.05 2.12 PlyA - 38011 38006 6 1.05 2.11 Term - 38899 38891 9 1 0 110 41 0 0.109 -4.31 2.10 Intr - 40936 40764 173 2 2 41 102 80 0.030 4.36 2.09 Intr - 69485 69383 103 2 1 67 110 87 0.689 8.55 2.08 Intr - 70934 70801 134 0 2 85 82 96 0.866 9.06 2.07 Intr - 77764 77617 148 1 1 61 103 128 0.999 11.41 2.06 Intr - 81077 80918 160 1 1 74 100 96 0.953 9.39 2.05 Intr - 90487 90384 104 1 2 135 33 60 0.680 4.17 2.04 Intr - 93033 92860 174 0 0 64 87 91 0.989 6.74 2.03 Intr - 97128 96994 135 0 0 90 52 64 0.900 3.76 2.02 Intr - 98084 98021 64 1 1 68 89 110 0.956 7.82 2.01 Init - 98527 98517 11 1 2 87 33 -6 0.011 -6.21 2.00 Prom - 99148 99109 40 -7.56 3.00 Prom + 99751 99790 40 -8.26 3.01 Sngl + 99953 101119 1167 1 0 67 46 1159 0.453 103.71 3.02 PlyA + 101766 101771 6 1.05 4.00 Prom + 117364 117403 40 -2.16 4.01 Init + 121701 121986 286 2 1 62 110 287 0.796 25.54 4.02 Intr + 124422 124565 144 1 0 59 110 78 0.978 7.35 4.03 Intr + 127861 128097 237 2 0 109 48 231 0.964 18.79 4.04 Intr + 129397 129548 152 2 2 77 71 149 0.698 11.98 4.05 Intr + 130189 130271 83 0 2 77 75 171 0.619 13.14 4.06 Intr + 130392 130469 78 0 0 91 75 56 0.904 3.27 4.07 Intr + 130958 131126 169 2 1 97 73 302 0.999 29.55 4.08 Intr + 132710 132865 156 1 0 43 86 173 0.966 12.81 4.09 Intr + 134498 134629 132 1 0 123 74 70 0.989 9.94 4.10 Intr + 135008 135087 80 1 2 101 109 54 0.999 7.15 4.11 Intr + 136172 136259 88 2 1 120 98 87 0.999 12.87 4.12 Intr + 136653 136730 78 2 0 97 98 51 0.990 6.75 4.13 Intr + 137038 137161 124 0 1 114 77 184 0.994 20.06 4.14 Intr + 137741 137883 143 0 2 88 100 204 0.999 21.67 4.15 Intr + 138216 138290 75 2 0 69 64 91 0.958 4.51 4.16 Intr + 139691 139837 147 1 0 89 64 158 0.999 13.93 4.17 Intr + 140343 140538 196 2 1 117 91 166 0.990 18.79 4.18 Intr + 141102 141223 122 1 2 100 100 144 0.887 17.01 4.19 Term + 142707 142940 234 2 0 134 39 201 0.682 16.12 4.20 PlyA + 143220 143225 6 1.05 5.08 PlyA - 144567 144562 6 1.05 5.07 Term - 150583 150402 182 2 2 100 54 213 0.998 16.87 5.06 Intr - 153197 153174 24 0 0 102 92 6 0.568 0.40 5.05 Intr - 155014 154804 211 1 1 84 68 268 0.998 22.89 5.04 Intr - 156040 155957 84 1 0 124 75 116 0.999 13.92 5.03 Intr - 160218 159805 414 0 0 43 110 127 0.310 4.70 5.02 Intr - 164995 164808 188 2 2 89 113 78 0.940 9.81 5.01 Init - 168486 168354 133 0 1 78 47 73 0.697 2.50 5.00 Prom - 168610 168571 40 -2.46 6.03 PlyA - 170644 170639 6 1.05 6.02 Term - 189905 189709 197 0 2 42 49 118 0.339 0.87 6.01 Intr - 198631 198559 73 1 1 58 105 33 0.120 0.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 15796 15978 183 0 0 76 53 127 0.923 5.54 S.002 Init - 99827 99691 137 2 2 52 45 209 0.910 10.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:34861567_35062682|GENSCAN_predicted_peptide_1|191_aa KGLKDTFKKQQKNKVKLELNKVPTLEEQQNSTERRSWSAQVKAEDKTARPMLDASWISYS VLKTAVDDEHYSPSLQIPILRLRGLKDMPRFAQLSHNPNQRAHYSLISAHDDVPPFIIPW DILMAPETELLDNRAESYSAWHAQGFKLPVDTIKALIPFPINLECLLHLSSSNSAALSAG CTAELPAEIPI >gi568815581f:34861567_35062682|GENSCAN_predicted_CDS_1|576_bp aaaggactgaaggacacgttcaagaagcagcagaaaaacaaggttaaattagaattaaac aaagtgccaactttagaggagcaacagaactccacagagaggcgcagctggtcagctcaa gtcaaagctgaagacaagactgccagacctatgctagatgcctcatggatatcttattcc gtccttaaaacagctgtggatgatgagcattactcccccagtttgcagattccaatactg aggctcagaggcttaaaggacatgcccaggtttgcacagctgagccacaaccccaaccag agggcccattattccctcatcagtgcccacgacgatgtacccccttttatcattccctgg gatatactcatggccccagagactgagctcctggacaacagagcagagtcttactcagct tggcatgcacagggattcaaactaccagtggacaccatcaaagctcttatccccttcccc atcaacctggaatgcctgctccacctttcttcctctaattcagcagcgctcagtgctggc tgcacagcagaattaccagcagagattccaatttga >gi568815581f:34861567_35062682|GENSCAN_predicted_peptide_2|404_aa MERRLVSGAGDIKLTKDGNVLLDEMQIQHPTASLIAKVATAQDDVTGDGTTSNVLIIGEL LKQADLYISEGLHPRIIAEGFEAAKIKALEVLEEVKVTKEMKRKILLDVARTSLQTKVHA ELADVLTEVVVDSVLAVRRPGYPIDLFMVEIMEMKHKLGTDTKEVNSGFFYKTAEEKEKL VKAERKFIEDRVQKIIDLKDKVCAQSNKGFVVINQKGEEKFTFIEECVNPCSVTLLVKGP NKHTLTQVKDAIRDGLRAIKNAIEDGCMVPGAGAIEVAMAEALVTYKNSIKGRARLGVQA FADALLIIPKVLAQNAGYDPQETLVKVQAEHVESKQLVGVDLNTGSETKFCPYTQEVVPL RPIHIDRDLQEADAHKPEGYSVVYTEKCQVDGMPTLPGGTTKVS >gi568815581f:34861567_35062682|GENSCAN_predicted_CDS_2|1215_bp atggagagaaggcttgtttctggtgcaggtgacatcaaactcaccaaagatggcaatgtg ctgctcgatgagatgcaaattcaacatccaacagcttccttgatagcaaaagtagcaaca gctcaggatgacgtcacaggagatggtactacttcaaatgttctaattattggagagtta ttaaaacaagctgacctgtacatttctgagggcctgcaccctagaataatagctgaagga tttgaagctgcaaagataaaagcacttgaagttttggaggaagttaaagtgacaaaggag atgaaaagaaaaatcctcttagatgtagctagaacatcattacaaactaaagttcatgct gaactggctgatgtcttaacagaggttgtggtggattctgttttggctgttagaagacca ggttaccctattgatctcttcatggtagaaataatggagatgaagcataaattaggaaca gatacaaaagaggtgaactctggtttcttttataagactgcagaagagaaagagaaattg gtaaaagctgaaagaaaatttattgaagatagagtacaaaaaataatagacctgaaggac aaagtctgtgctcagtcaaataaaggatttgtcgtcattaatcaaaagggtgaagaaaag ttcacttttattgaggagtgtgttaacccttgctctgttaccttgttggttaaaggacca aataagcatactctcacacaagtcaaggatgccataagagatggacttcgtgctatcaaa aatgccattgaagatggttgtatggttcctggagctggtgcaattgaagtggcaatggct gaagctcttgttacatataagaacagtataaaaggaagagctcgtcttggagtccaagct tttgctgatgccttactcattattcccaaggttcttgctcagaatgctggttatgaccca caggaaacattagtaaaagttcaggctgagcatgtcgagtcaaaacaacttgtgggcgta gatttgaatacaggctcagaaaccaaattttgtccttacactcaagaggtcgtgcctttg aggccaattcacattgacagagacctacaagaagcagatgcacacaagccagaaggctac tctgtggtttacacagagaaatgccaggtggatggaatgcccaccttgccaggtggcaca acaaaggtaagttaa >gi568815581f:34861567_35062682|GENSCAN_predicted_peptide_3|388_aa MPRPTETRIVLGLVAKMASSASARTPAGKRVINQEELRRLMKEKQRLSTSRKRIESPFAK YNRLGQLSCALCNTPVKSELLWQTHVLGKQHREKVAELKGAKEASQGSSASSAPHSVKRK APDADDQDVKRAKATLVPQVQPSTSAWTTNFDKIGKEFIRATPSKPSGLSLLPDYEDEEE EEEEEEGDGERKRGDASKPLSDAQGKEHSVSSSREVTSSVLPNDFFSTNPPKAPIIPHSG SIEKAEIHEKVVERRENTAEALPEGFFDDPEVDARVRKVDAPKDQMDKEWDEFQKAMRQV NTISEAIVAEEDEEGRLDRQIGEIDEQIECYRRVEKLRNRQDEIKNKLKEILTIKELQKK EEENADSDDEGELQDLLSQDWRVKGALL >gi568815581f:34861567_35062682|GENSCAN_predicted_CDS_3|1167_bp atgcctcgcccgacggaaaccagaatcgttttgggtctggtcgccaagatggcgtcctcc gcctccgcccggactccggcagggaagcgagtgataaatcaggaagaattgcggcggtta atgaaggagaagcagcgtctgagcaccagtcggaaacggatagaatctccattcgcgaag tacaaccgtttggggcagctgagttgtgccctgtgtaacactccggttaagagcgagctc ctgtggcagactcacgtcctgggaaagcagcaccgagagaaagtggccgagctgaaaggc gcgaaggaagccagccagggttcgtccgccagttcagcgcctcattccgtcaagaggaaa gcgccggacgcagacgaccaagatgtcaagagagcgaaggccaccttggtgcctcaggta cagccctccacatctgcgtggaccaccaactttgacaaaataggaaaggagttcattaga gcgactcccagtaagccttcaggactcagtttactccccgattatgaagatgaggaggag gaggaagaggaggaggaaggagatggagaaagaaaaaggggggacgccagcaagccgctc tccgacgcacagggcaaggagcactcagtttcctcttcacgggaggtaacaagtagtgtg ctgccaaacgatttctttagtactaatcctcccaaggcccccataattcctcattcaggg tcaattgagaaagcagaaatacatgaaaaagtggtggaaaggagagaaaacaccgcggaa gcgttaccggaaggtttttttgacgaccctgaggtagatgcaagagtacgaaaggttgat gctccaaaagatcagatggacaaagagtgggacgaattccaaaaagccatgaggcaggtc aacactatttccgaagccatagttgccgaagaggatgaggagggacggttggaccgccag attggggagatcgatgagcagatagagtgttaccgacgggtggaaaagctacggaatcgc caggatgaaataaaaaataaacttaaagaaatcctgaccataaaagaactgcagaaaaag gaagaagagaatgctgacagcgatgatgagggggaactacaggatttgttgtctcaggat tggagggtgaaaggggcattgttatag >gi568815581f:34861567_35062682|GENSCAN_predicted_peptide_4|907_aa MAEQRFCVDYAKRGTAGCKKCKEKIVKGVCRIGKVVPNPFSESGGDMKEWYHIKCMFEKL ERARATTKKIEDLTELEGWEELEDNEKEQITQHIADLSSKAAGTPKKKAVVQAKLTTTGQ VTSPVKGASFVTSTNPRKFSGFSERQNEFISVVSPTAKPNNSGEAPSSPTPKRSLSSSKC DPRHKDCLLREFRKLCAMVADNPSYNTKTQIIQDFLRKGSAGDGFHGDVYLTVKLLLPGV IKTVYNLNDKQIVKLFSRIFNCNPDDMARDLEQEVDEFLLRLSKLTKEDEQQQALQDIAS RCTANDLKCIIRLIKHDLKMNSGAKHVLDALDPNAYEAFKASRNLQDVVERVLHNAQEVE KEPGQRRALSVQASLMTPVQPMLAEACKSVEYAMKKCPNGMFSEIKYDGERVQVHKNGDH FSYFSRSLKPVLPHKVAHFKDYIPQAFPGGHSMILDSEVLLIDNKTGKPLPFGTLGVHKK AAFQDANVCLFVFDCIYFNDVSLMDRPLCERRKFLHDNMVEIPNRIMFSEMKRVTKALDL ADMITRVIQEGLEGLVLKDVKGTYEPGKRHWLKVKKDYLNEGAMADTADLVVLGAFYGQG SKGGMMSIFLMGCYDPGSQKWCTVTKCAGGHDDATLARLQNELDMVKISKDPSKIPSWLK VNKIYYPDFIVPDPKKAAVWEITGAEFSKSEAHTADGISIRFPRCTRIRDDKDWKSATNL PQLKELYQLSKEKADFTVVAGDEGSSTTGGSSEENKGPSGSAVSRKAPSKPSASTKKAEG KLSNSNSKDGNMQTAKPSAMKVGEKLATKSSPVKVGEKRKAADETLCQTKVLLDIFTGVR LYLPPSTPDFSRLRRYFVAFDGDLVQEFDMTSATHVLGSRDKNPAAQQVSPEWIWACIRK RRLVAPC >gi568815581f:34861567_35062682|GENSCAN_predicted_CDS_4|2724_bp atggctgagcaacggttctgtgtggactatgccaagcgtggcacagctggctgcaaaaaa tgcaaggaaaagattgtgaagggcgtatgccgaattggcaaagtggtgcccaatcccttc tcagagtctgggggtgatatgaaagagtggtaccacattaaatgcatgtttgagaaacta gagcgggcccgggccaccacaaaaaaaatcgaggacctcacagagctggaaggctgggaa gagctggaagataatgagaaggaacagataacccagcacattgcagatctgtcttctaag gcagcaggtacaccaaagaagaaagctgttgtccaggctaagttgacaaccactggccag gtgacttctccagtgaaaggcgcctcatttgtcaccagtaccaatccccggaaattttct ggcttttcagaaaggcagaatgaattcatctctgttgtttctccaacagccaagcccaac aactctggggaagccccctcgagccccacccctaagagaagtctgtcttcaagcaaatgt gaccccaggcataaggactgtctgctacgggagtttcgaaagttatgcgccatggtggcc gataatcctagctacaacacgaagacccagatcatccaggacttccttcggaaaggctca gcaggagatggtttccacggtgatgtgtacctaacagtgaagctgctgctgccaggagtc attaagactgtttacaacttgaacgataagcagattgtgaagcttttcagtcgcattttt aactgcaacccagatgatatggcacgggacctagagcaggaagtggatgagttccttctg cggctgtccaagctcaccaaggaggatgagcagcaacaggccctacaggacattgcctcc aggtgtacagccaatgaccttaaatgcatcatcaggttgatcaaacatgatctgaagatg aactcaggtgcaaaacatgtgttagacgcccttgaccccaatgcctatgaagccttcaaa gcctcgcgcaacctgcaggatgtggtggagcgggtccttcacaacgcgcaggaggtggag aaggagccgggccagagacgagctctgagcgtccaggcctcgctgatgacacctgtgcag cccatgttggcggaggcctgcaagtccgttgagtatgcaatgaagaaatgtcccaatggc atgttctctgagatcaagtacgatggagagcgagtccaggtgcataagaatggagaccac ttcagctacttcagccgcagtctcaagcccgtccttcctcacaaggtggcccactttaag gactacattccccaggcttttcctgggggccacagcatgatcttggattctgaagtgctt ctgattgacaacaagacaggcaaaccactgccctttgggactctgggagtacacaagaaa gcagccttccaggatgctaatgtctgcctgtttgtttttgattgtatctactttaatgat gtcagcttgatggacagacctctgtgtgagcggcggaagtttcttcatgacaacatggtt gaaattccaaaccggatcatgttctcagaaatgaagcgagtcacaaaagctttggacttg gctgacatgataacccgggtgatccaggagggattggaggggctggtgctgaaggatgtg aagggtacatatgagcctgggaagcggcactggctgaaagtgaagaaagactatttgaac gagggggccatggccgacacagctgacctggtggtccttggagccttctatgggcaaggg agcaaaggcggcatgatgtcaatcttcctcatgggctgctacgaccctggcagccagaag tggtgcacagtcaccaagtgtgcaggaggccatgatgatgccacgcttgcccgcctgcag aatgaactagacatggtgaagatcagcaaggaccccagcaaaatacccagctggttgaag gtcaacaagatctactatcctgacttcatcgtcccagacccaaagaaagctgccgtgtgg gagatcacaggggctgaattctccaaatcggaggctcatacagctgacgggatctccatc cgattccctcgctgcacccgaatccgagatgataaggactggaaatctgccactaacctt ccccaactcaaggaactgtaccagttgtccaaggagaaggcagacttcactgtagtggct ggagatgaggggagctccactacagggggtagcagtgaagagaataagggtccctcaggg tctgctgtgtcccgcaaggcccccagcaagccctcagccagtaccaagaaagcagaaggg aagctgagtaactccaacagcaaagatggcaacatgcagactgcaaagccttccgctatg aaggtgggggagaagctggccacaaagtcttctccagtgaaagtaggggagaagcggaaa gctgctgatgagacgctgtgccaaacaaaggtattgctggacatcttcactggggtgcgg ctttacttgccaccctccacaccagacttcagccgtctcagacgctactttgtggcattc gacggggacctggtacaggaatttgatatgacttcagccacgcacgtgctgggtagcagg gacaagaaccctgcggcccagcaggtctccccagagtggatttgggcatgtatccggaaa cggagactggtagctccctgctag >gi568815581f:34861567_35062682|GENSCAN_predicted_peptide_5|411_aa MEYYAAIKNDEFMSFVGTWMKLEIIILSKLLQGQKTKHRMFSLIDFIMWATCCNWFCLDG QPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARKQQTCLDCKKNFCM TCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDISTEMCREKEELVLL VLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSSSAQATSVPPAQV QENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEG LTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLE ENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHVFRS >gi568815581f:34861567_35062682|GENSCAN_predicted_CDS_5|1236_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaatcatcattctcagtaaactattgcaaggacaaaaaaccaaacaccgcatg ttctcgctcatagattttatcatgtgggcaacctgctgcaactggttctgcctggatgga cagcctgaggaggtcccaccaccccagggagccaggatgcaggcctattccaaccctggg tacagctccttcccttccccaacaggcttggaaccaagctgcaagtcctgtggggctcac tttgcaaacacggccaggaagcagcagacctgcttggactgtaagaaaaatttttgcatg acctgttcgagccaagtagggaatgggccccgcctctgccttctctgccaacggtttcga gctacagcctttcagcgagaggagctcatgaagatgaaggtgaaggacttgagggactat ctcagcctccatgacatctctaccgaaatgtgccgggagaaagaagagctggtgctcttg gtccttggccagcagcctgtaatctcccaggaggacaggactcgtgcctccaccttgtcc ccagactttcctgagcagcaggccttcctgacccagcctcactccagcatggttccacct acctcacccaacctcccctcttcatctgcacaagccacctctgttcccccagcccaggtt caggagaatcagcaggccaatggccatgtgtctcaggatcaagaggaacccgtctacctg gagagcgtggccagagtacctgctgaggatgagacccagtctattgactcagaggacagc tttgtcccaggccgaagggcctctctgtctgacctgactgacctggaggacattgaaggc ctgacagtgcggcagctgaaagagatcttggctcgcaactttgtcaactacaagggctgc tgtgagaagtgggagctgatggagagagtgacccggctatacaaggatcagaaaggactc cagcacctggtcagtggtgccgaagaccaaaacgggggagcagtaccatcaggcttggag gagaacctgtgtaagatctgcatggactcacccattgactgtgttcttctggagtgtggc cacatggtaacctgtaccaagtgtggcaagcgcatgaatgaatgtcccatctgccggcag tatgtaatccgagctgtgcatgtcttccggtcctga >gi568815581f:34861567_35062682|GENSCAN_predicted_peptide_6|89_aa VFLLYFKYLEVVKTCCPGGVSNQAVNQLAADMKGYLLVEGLGYSGGKRSDSPKQEVKCKQ DVFVCCNSCACVLKLGAVKKSRLLVFELG >gi568815581f:34861567_35062682|GENSCAN_predicted_CDS_6|270_bp gtttttcttttatacttcaaatatctggaagttgtgaaaacctgctgccctggtggagta tcaaaccaggctgtgaaccagttggcagcagatatgaaaggttacctacttgttgaaggt ctagggtacagcggtggaaagagatcagatagcccaaaacaggaagtcaagtgcaaacag gatgtgtttgtctgctgtaactcgtgtgcctgtgttttgaaacttggtgctgttaaaaag tcaagacttctagtatttgagcttggctga