GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:36:38 Sequence gi568815581r:34828051_35061393 : 233343 bp : 44.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10808 10932 125 1 2 81 76 51 0.030 2.84 1.02 Intr + 25248 25405 158 0 2 56 51 91 0.012 1.85 1.03 Intr + 30122 30249 128 0 2 46 53 69 0.249 -0.40 1.04 Intr + 36057 36166 110 2 2 72 52 78 0.130 1.68 1.05 Intr + 60986 61157 172 2 1 78 119 80 0.886 10.05 1.06 Intr + 61616 61635 20 1 2 82 78 36 0.524 -2.59 1.07 Intr + 62105 62235 131 2 2 66 56 78 0.541 2.74 1.08 Intr + 62787 62816 30 1 0 71 93 46 0.446 1.50 1.09 Term + 68780 68892 113 0 2 83 44 70 0.500 0.82 1.10 PlyA + 70571 70576 6 1.05 2.12 PlyA - 71527 71522 6 1.05 2.11 Term - 72415 72407 9 1 0 110 41 0 0.109 -4.31 2.10 Intr - 74452 74280 173 2 2 41 102 80 0.030 4.36 2.09 Intr - 103001 102899 103 2 1 67 110 87 0.689 8.55 2.08 Intr - 104450 104317 134 0 2 85 82 96 0.866 9.06 2.07 Intr - 111280 111133 148 1 1 61 103 128 0.999 11.41 2.06 Intr - 114593 114434 160 1 1 74 100 96 0.953 9.39 2.05 Intr - 124003 123900 104 1 2 135 33 60 0.680 4.17 2.04 Intr - 126549 126376 174 0 0 64 87 91 0.989 6.74 2.03 Intr - 130644 130510 135 0 0 90 52 64 0.900 3.76 2.02 Intr - 131600 131537 64 1 1 68 89 110 0.956 7.82 2.01 Init - 132043 132033 11 1 2 87 33 -6 0.011 -6.21 2.00 Prom - 132664 132625 40 -7.56 3.00 Prom + 133267 133306 40 -8.26 3.01 Sngl + 133469 134635 1167 1 0 67 46 1159 0.453 103.71 3.02 PlyA + 135282 135287 6 1.05 4.00 Prom + 150880 150919 40 -2.16 4.01 Init + 155217 155502 286 2 1 62 110 287 0.796 25.54 4.02 Intr + 157938 158081 144 1 0 59 110 78 0.978 7.35 4.03 Intr + 161377 161613 237 2 0 109 48 231 0.964 18.79 4.04 Intr + 162913 163064 152 2 2 77 71 149 0.698 11.98 4.05 Intr + 163705 163787 83 0 2 77 75 171 0.619 13.14 4.06 Intr + 163908 163985 78 0 0 91 75 56 0.904 3.27 4.07 Intr + 164474 164642 169 2 1 97 73 302 0.999 29.55 4.08 Intr + 166226 166381 156 1 0 43 86 173 0.966 12.81 4.09 Intr + 168014 168145 132 1 0 123 74 70 0.989 9.94 4.10 Intr + 168524 168603 80 1 2 101 109 54 0.999 7.15 4.11 Intr + 169688 169775 88 2 1 120 98 87 0.999 12.87 4.12 Intr + 170169 170246 78 2 0 97 98 51 0.990 6.75 4.13 Intr + 170554 170677 124 0 1 114 77 184 0.994 20.06 4.14 Intr + 171257 171399 143 0 2 88 100 204 0.999 21.67 4.15 Intr + 171732 171806 75 2 0 69 64 91 0.958 4.51 4.16 Intr + 173207 173353 147 1 0 89 64 158 0.999 13.93 4.17 Intr + 173859 174054 196 2 1 117 91 166 0.990 18.79 4.18 Intr + 174618 174739 122 1 2 100 100 144 0.887 17.01 4.19 Term + 176223 176456 234 2 0 134 39 201 0.682 16.12 4.20 PlyA + 176736 176741 6 1.05 5.08 PlyA - 178083 178078 6 1.05 5.07 Term - 184099 183918 182 2 2 100 54 213 0.998 16.87 5.06 Intr - 186713 186690 24 0 0 102 92 6 0.568 0.40 5.05 Intr - 188530 188320 211 1 1 84 68 268 0.998 22.89 5.04 Intr - 189556 189473 84 1 0 124 75 116 0.999 13.92 5.03 Intr - 193734 193321 414 0 0 43 110 127 0.310 4.70 5.02 Intr - 198511 198324 188 2 2 89 113 78 0.940 9.81 5.01 Init - 202002 201870 133 0 1 78 47 73 0.697 2.50 5.00 Prom - 202126 202087 40 -2.46 6.03 PlyA - 204160 204155 6 1.05 6.02 Term - 223421 223225 197 0 2 42 49 118 0.358 0.87 6.01 Intr - 232147 232075 73 1 1 58 105 33 0.128 0.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 49312 49494 183 0 0 76 53 127 0.923 5.54 S.002 Init - 133343 133207 137 2 2 52 45 209 0.910 10.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:34828051_35061393|GENSCAN_predicted_peptide_1|328_aa MDVRHGAVAAILCHEDKSPRLRVAERKPRKTLSPRCTLELLSEKLKTKFNEPDIYKKSEG LVTKVENEIRGVLWHPGGNFRLVVKMGPVKEDSWVLIDQHLKDKCICLQDWPSKDSGSGR GAFALYDHLLGQKWGFLKGLKDTFKKQQKNKVKLELNKVPTLEEQQNSTERRSWSAQVKA EDKTARPMLDASWISYSVLKTAVDDEHYSPSLQIPILRLRGLKDMPRFAQLSHNPNQRAH YSLISAHDDVPPFIIPWDILMAPETELLDNRAESYSAWHAQGFKLPVDTIKALIPFPINL ECLLHLSSSNSAALSAGCTAELPAEIPI >gi568815581r:34828051_35061393|GENSCAN_predicted_CDS_1|987_bp atggatgtaagacatggagcagtagcagccatcttatgccacgaggataaaagccccagg ttaagggtggcagaaaggaaacccaggaagactctgagccccagatgcacccttgagctg ctaagtgagaagcttaaaaccaagtttaatgaaccagatatatataagaagtccgaaggg ctggtgaccaaggtagagaatgagatacggggtgttctctggcacccaggggggaatttt cgcctggtggtgaaaatgggtccagtgaaagaagattcatgggtcctcatagaccagcat ttgaaggacaaatgcatctgcctgcaggactggcccagcaaggactctggttctggaaga ggtgcctttgccctgtatgaccacctgcttgggcagaagtgggggtttctgaaaggactg aaggacacgttcaagaagcagcagaaaaacaaggttaaattagaattaaacaaagtgcca actttagaggagcaacagaactccacagagaggcgcagctggtcagctcaagtcaaagct gaagacaagactgccagacctatgctagatgcctcatggatatcttattccgtccttaaa acagctgtggatgatgagcattactcccccagtttgcagattccaatactgaggctcaga ggcttaaaggacatgcccaggtttgcacagctgagccacaaccccaaccagagggcccat tattccctcatcagtgcccacgacgatgtacccccttttatcattccctgggatatactc atggccccagagactgagctcctggacaacagagcagagtcttactcagcttggcatgca cagggattcaaactaccagtggacaccatcaaagctcttatccccttccccatcaacctg gaatgcctgctccacctttcttcctctaattcagcagcgctcagtgctggctgcacagca gaattaccagcagagattccaatttga >gi568815581r:34828051_35061393|GENSCAN_predicted_peptide_2|404_aa MERRLVSGAGDIKLTKDGNVLLDEMQIQHPTASLIAKVATAQDDVTGDGTTSNVLIIGEL LKQADLYISEGLHPRIIAEGFEAAKIKALEVLEEVKVTKEMKRKILLDVARTSLQTKVHA ELADVLTEVVVDSVLAVRRPGYPIDLFMVEIMEMKHKLGTDTKEVNSGFFYKTAEEKEKL VKAERKFIEDRVQKIIDLKDKVCAQSNKGFVVINQKGEEKFTFIEECVNPCSVTLLVKGP NKHTLTQVKDAIRDGLRAIKNAIEDGCMVPGAGAIEVAMAEALVTYKNSIKGRARLGVQA FADALLIIPKVLAQNAGYDPQETLVKVQAEHVESKQLVGVDLNTGSETKFCPYTQEVVPL RPIHIDRDLQEADAHKPEGYSVVYTEKCQVDGMPTLPGGTTKVS >gi568815581r:34828051_35061393|GENSCAN_predicted_CDS_2|1215_bp atggagagaaggcttgtttctggtgcaggtgacatcaaactcaccaaagatggcaatgtg ctgctcgatgagatgcaaattcaacatccaacagcttccttgatagcaaaagtagcaaca gctcaggatgacgtcacaggagatggtactacttcaaatgttctaattattggagagtta ttaaaacaagctgacctgtacatttctgagggcctgcaccctagaataatagctgaagga tttgaagctgcaaagataaaagcacttgaagttttggaggaagttaaagtgacaaaggag atgaaaagaaaaatcctcttagatgtagctagaacatcattacaaactaaagttcatgct gaactggctgatgtcttaacagaggttgtggtggattctgttttggctgttagaagacca ggttaccctattgatctcttcatggtagaaataatggagatgaagcataaattaggaaca gatacaaaagaggtgaactctggtttcttttataagactgcagaagagaaagagaaattg gtaaaagctgaaagaaaatttattgaagatagagtacaaaaaataatagacctgaaggac aaagtctgtgctcagtcaaataaaggatttgtcgtcattaatcaaaagggtgaagaaaag ttcacttttattgaggagtgtgttaacccttgctctgttaccttgttggttaaaggacca aataagcatactctcacacaagtcaaggatgccataagagatggacttcgtgctatcaaa aatgccattgaagatggttgtatggttcctggagctggtgcaattgaagtggcaatggct gaagctcttgttacatataagaacagtataaaaggaagagctcgtcttggagtccaagct tttgctgatgccttactcattattcccaaggttcttgctcagaatgctggttatgaccca caggaaacattagtaaaagttcaggctgagcatgtcgagtcaaaacaacttgtgggcgta gatttgaatacaggctcagaaaccaaattttgtccttacactcaagaggtcgtgcctttg aggccaattcacattgacagagacctacaagaagcagatgcacacaagccagaaggctac tctgtggtttacacagagaaatgccaggtggatggaatgcccaccttgccaggtggcaca acaaaggtaagttaa >gi568815581r:34828051_35061393|GENSCAN_predicted_peptide_3|388_aa MPRPTETRIVLGLVAKMASSASARTPAGKRVINQEELRRLMKEKQRLSTSRKRIESPFAK YNRLGQLSCALCNTPVKSELLWQTHVLGKQHREKVAELKGAKEASQGSSASSAPHSVKRK APDADDQDVKRAKATLVPQVQPSTSAWTTNFDKIGKEFIRATPSKPSGLSLLPDYEDEEE EEEEEEGDGERKRGDASKPLSDAQGKEHSVSSSREVTSSVLPNDFFSTNPPKAPIIPHSG SIEKAEIHEKVVERRENTAEALPEGFFDDPEVDARVRKVDAPKDQMDKEWDEFQKAMRQV NTISEAIVAEEDEEGRLDRQIGEIDEQIECYRRVEKLRNRQDEIKNKLKEILTIKELQKK EEENADSDDEGELQDLLSQDWRVKGALL >gi568815581r:34828051_35061393|GENSCAN_predicted_CDS_3|1167_bp atgcctcgcccgacggaaaccagaatcgttttgggtctggtcgccaagatggcgtcctcc gcctccgcccggactccggcagggaagcgagtgataaatcaggaagaattgcggcggtta atgaaggagaagcagcgtctgagcaccagtcggaaacggatagaatctccattcgcgaag tacaaccgtttggggcagctgagttgtgccctgtgtaacactccggttaagagcgagctc ctgtggcagactcacgtcctgggaaagcagcaccgagagaaagtggccgagctgaaaggc gcgaaggaagccagccagggttcgtccgccagttcagcgcctcattccgtcaagaggaaa gcgccggacgcagacgaccaagatgtcaagagagcgaaggccaccttggtgcctcaggta cagccctccacatctgcgtggaccaccaactttgacaaaataggaaaggagttcattaga gcgactcccagtaagccttcaggactcagtttactccccgattatgaagatgaggaggag gaggaagaggaggaggaaggagatggagaaagaaaaaggggggacgccagcaagccgctc tccgacgcacagggcaaggagcactcagtttcctcttcacgggaggtaacaagtagtgtg ctgccaaacgatttctttagtactaatcctcccaaggcccccataattcctcattcaggg tcaattgagaaagcagaaatacatgaaaaagtggtggaaaggagagaaaacaccgcggaa gcgttaccggaaggtttttttgacgaccctgaggtagatgcaagagtacgaaaggttgat gctccaaaagatcagatggacaaagagtgggacgaattccaaaaagccatgaggcaggtc aacactatttccgaagccatagttgccgaagaggatgaggagggacggttggaccgccag attggggagatcgatgagcagatagagtgttaccgacgggtggaaaagctacggaatcgc caggatgaaataaaaaataaacttaaagaaatcctgaccataaaagaactgcagaaaaag gaagaagagaatgctgacagcgatgatgagggggaactacaggatttgttgtctcaggat tggagggtgaaaggggcattgttatag >gi568815581r:34828051_35061393|GENSCAN_predicted_peptide_4|907_aa MAEQRFCVDYAKRGTAGCKKCKEKIVKGVCRIGKVVPNPFSESGGDMKEWYHIKCMFEKL ERARATTKKIEDLTELEGWEELEDNEKEQITQHIADLSSKAAGTPKKKAVVQAKLTTTGQ VTSPVKGASFVTSTNPRKFSGFSERQNEFISVVSPTAKPNNSGEAPSSPTPKRSLSSSKC DPRHKDCLLREFRKLCAMVADNPSYNTKTQIIQDFLRKGSAGDGFHGDVYLTVKLLLPGV IKTVYNLNDKQIVKLFSRIFNCNPDDMARDLEQEVDEFLLRLSKLTKEDEQQQALQDIAS RCTANDLKCIIRLIKHDLKMNSGAKHVLDALDPNAYEAFKASRNLQDVVERVLHNAQEVE KEPGQRRALSVQASLMTPVQPMLAEACKSVEYAMKKCPNGMFSEIKYDGERVQVHKNGDH FSYFSRSLKPVLPHKVAHFKDYIPQAFPGGHSMILDSEVLLIDNKTGKPLPFGTLGVHKK AAFQDANVCLFVFDCIYFNDVSLMDRPLCERRKFLHDNMVEIPNRIMFSEMKRVTKALDL ADMITRVIQEGLEGLVLKDVKGTYEPGKRHWLKVKKDYLNEGAMADTADLVVLGAFYGQG SKGGMMSIFLMGCYDPGSQKWCTVTKCAGGHDDATLARLQNELDMVKISKDPSKIPSWLK VNKIYYPDFIVPDPKKAAVWEITGAEFSKSEAHTADGISIRFPRCTRIRDDKDWKSATNL PQLKELYQLSKEKADFTVVAGDEGSSTTGGSSEENKGPSGSAVSRKAPSKPSASTKKAEG KLSNSNSKDGNMQTAKPSAMKVGEKLATKSSPVKVGEKRKAADETLCQTKVLLDIFTGVR LYLPPSTPDFSRLRRYFVAFDGDLVQEFDMTSATHVLGSRDKNPAAQQVSPEWIWACIRK RRLVAPC >gi568815581r:34828051_35061393|GENSCAN_predicted_CDS_4|2724_bp atggctgagcaacggttctgtgtggactatgccaagcgtggcacagctggctgcaaaaaa tgcaaggaaaagattgtgaagggcgtatgccgaattggcaaagtggtgcccaatcccttc tcagagtctgggggtgatatgaaagagtggtaccacattaaatgcatgtttgagaaacta gagcgggcccgggccaccacaaaaaaaatcgaggacctcacagagctggaaggctgggaa gagctggaagataatgagaaggaacagataacccagcacattgcagatctgtcttctaag gcagcaggtacaccaaagaagaaagctgttgtccaggctaagttgacaaccactggccag gtgacttctccagtgaaaggcgcctcatttgtcaccagtaccaatccccggaaattttct ggcttttcagaaaggcagaatgaattcatctctgttgtttctccaacagccaagcccaac aactctggggaagccccctcgagccccacccctaagagaagtctgtcttcaagcaaatgt gaccccaggcataaggactgtctgctacgggagtttcgaaagttatgcgccatggtggcc gataatcctagctacaacacgaagacccagatcatccaggacttccttcggaaaggctca gcaggagatggtttccacggtgatgtgtacctaacagtgaagctgctgctgccaggagtc attaagactgtttacaacttgaacgataagcagattgtgaagcttttcagtcgcattttt aactgcaacccagatgatatggcacgggacctagagcaggaagtggatgagttccttctg cggctgtccaagctcaccaaggaggatgagcagcaacaggccctacaggacattgcctcc aggtgtacagccaatgaccttaaatgcatcatcaggttgatcaaacatgatctgaagatg aactcaggtgcaaaacatgtgttagacgcccttgaccccaatgcctatgaagccttcaaa gcctcgcgcaacctgcaggatgtggtggagcgggtccttcacaacgcgcaggaggtggag aaggagccgggccagagacgagctctgagcgtccaggcctcgctgatgacacctgtgcag cccatgttggcggaggcctgcaagtccgttgagtatgcaatgaagaaatgtcccaatggc atgttctctgagatcaagtacgatggagagcgagtccaggtgcataagaatggagaccac ttcagctacttcagccgcagtctcaagcccgtccttcctcacaaggtggcccactttaag gactacattccccaggcttttcctgggggccacagcatgatcttggattctgaagtgctt ctgattgacaacaagacaggcaaaccactgccctttgggactctgggagtacacaagaaa gcagccttccaggatgctaatgtctgcctgtttgtttttgattgtatctactttaatgat gtcagcttgatggacagacctctgtgtgagcggcggaagtttcttcatgacaacatggtt gaaattccaaaccggatcatgttctcagaaatgaagcgagtcacaaaagctttggacttg gctgacatgataacccgggtgatccaggagggattggaggggctggtgctgaaggatgtg aagggtacatatgagcctgggaagcggcactggctgaaagtgaagaaagactatttgaac gagggggccatggccgacacagctgacctggtggtccttggagccttctatgggcaaggg agcaaaggcggcatgatgtcaatcttcctcatgggctgctacgaccctggcagccagaag tggtgcacagtcaccaagtgtgcaggaggccatgatgatgccacgcttgcccgcctgcag aatgaactagacatggtgaagatcagcaaggaccccagcaaaatacccagctggttgaag gtcaacaagatctactatcctgacttcatcgtcccagacccaaagaaagctgccgtgtgg gagatcacaggggctgaattctccaaatcggaggctcatacagctgacgggatctccatc cgattccctcgctgcacccgaatccgagatgataaggactggaaatctgccactaacctt ccccaactcaaggaactgtaccagttgtccaaggagaaggcagacttcactgtagtggct ggagatgaggggagctccactacagggggtagcagtgaagagaataagggtccctcaggg tctgctgtgtcccgcaaggcccccagcaagccctcagccagtaccaagaaagcagaaggg aagctgagtaactccaacagcaaagatggcaacatgcagactgcaaagccttccgctatg aaggtgggggagaagctggccacaaagtcttctccagtgaaagtaggggagaagcggaaa gctgctgatgagacgctgtgccaaacaaaggtattgctggacatcttcactggggtgcgg ctttacttgccaccctccacaccagacttcagccgtctcagacgctactttgtggcattc gacggggacctggtacaggaatttgatatgacttcagccacgcacgtgctgggtagcagg gacaagaaccctgcggcccagcaggtctccccagagtggatttgggcatgtatccggaaa cggagactggtagctccctgctag >gi568815581r:34828051_35061393|GENSCAN_predicted_peptide_5|411_aa MEYYAAIKNDEFMSFVGTWMKLEIIILSKLLQGQKTKHRMFSLIDFIMWATCCNWFCLDG QPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARKQQTCLDCKKNFCM TCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDISTEMCREKEELVLL VLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSSSAQATSVPPAQV QENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEG LTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLE ENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHVFRS >gi568815581r:34828051_35061393|GENSCAN_predicted_CDS_5|1236_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaatcatcattctcagtaaactattgcaaggacaaaaaaccaaacaccgcatg ttctcgctcatagattttatcatgtgggcaacctgctgcaactggttctgcctggatgga cagcctgaggaggtcccaccaccccagggagccaggatgcaggcctattccaaccctggg tacagctccttcccttccccaacaggcttggaaccaagctgcaagtcctgtggggctcac tttgcaaacacggccaggaagcagcagacctgcttggactgtaagaaaaatttttgcatg acctgttcgagccaagtagggaatgggccccgcctctgccttctctgccaacggtttcga gctacagcctttcagcgagaggagctcatgaagatgaaggtgaaggacttgagggactat ctcagcctccatgacatctctaccgaaatgtgccgggagaaagaagagctggtgctcttg gtccttggccagcagcctgtaatctcccaggaggacaggactcgtgcctccaccttgtcc ccagactttcctgagcagcaggccttcctgacccagcctcactccagcatggttccacct acctcacccaacctcccctcttcatctgcacaagccacctctgttcccccagcccaggtt caggagaatcagcaggccaatggccatgtgtctcaggatcaagaggaacccgtctacctg gagagcgtggccagagtacctgctgaggatgagacccagtctattgactcagaggacagc tttgtcccaggccgaagggcctctctgtctgacctgactgacctggaggacattgaaggc ctgacagtgcggcagctgaaagagatcttggctcgcaactttgtcaactacaagggctgc tgtgagaagtgggagctgatggagagagtgacccggctatacaaggatcagaaaggactc cagcacctggtcagtggtgccgaagaccaaaacgggggagcagtaccatcaggcttggag gagaacctgtgtaagatctgcatggactcacccattgactgtgttcttctggagtgtggc cacatggtaacctgtaccaagtgtggcaagcgcatgaatgaatgtcccatctgccggcag tatgtaatccgagctgtgcatgtcttccggtcctga >gi568815581r:34828051_35061393|GENSCAN_predicted_peptide_6|89_aa VFLLYFKYLEVVKTCCPGGVSNQAVNQLAADMKGYLLVEGLGYSGGKRSDSPKQEVKCKQ DVFVCCNSCACVLKLGAVKKSRLLVFELG >gi568815581r:34828051_35061393|GENSCAN_predicted_CDS_6|270_bp gtttttcttttatacttcaaatatctggaagttgtgaaaacctgctgccctggtggagta tcaaaccaggctgtgaaccagttggcagcagatatgaaaggttacctacttgttgaaggt ctagggtacagcggtggaaagagatcagatagcccaaaacaggaagtcaagtgcaaacag gatgtgtttgtctgctgtaactcgtgtgcctgtgttttgaaacttggtgctgttaaaaag tcaagacttctagtatttgagcttggctga