GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:13:04 Sequence gi568815586r:31515771_31716439 : 200669 bp : 41.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2182 2370 189 0 0 126 38 103 0.851 5.57 1.02 PlyA + 2740 2745 6 1.05 2.03 PlyA - 4022 4017 6 1.05 2.02 Term - 12583 12416 168 1 0 66 47 112 0.059 1.80 2.01 Init - 36228 35878 351 0 0 82 86 184 0.053 14.81 2.00 Prom - 36708 36669 40 -11.74 3.02 PlyA - 37079 37074 6 1.05 3.01 Sngl - 37908 37168 741 0 0 49 48 293 0.907 17.35 3.00 Prom - 38001 37962 40 -6.15 4.05 PlyA - 38170 38165 6 1.05 4.04 Term - 39240 38512 729 0 0 -15 36 475 0.850 24.48 4.03 Intr - 39691 39502 190 0 1 -13 47 208 0.268 5.27 4.02 Intr - 74634 74616 19 1 1 67 117 25 0.003 -1.95 4.01 Init - 75062 74936 127 2 1 81 90 193 0.208 17.17 4.00 Prom - 75962 75923 40 -7.15 5.02 PlyA - 76332 76327 6 1.05 5.01 Sngl - 81639 81337 303 0 0 60 38 280 0.944 15.78 5.00 Prom - 86315 86276 40 -5.85 6.02 PlyA - 88960 88955 6 1.05 6.01 Sngl - 100669 99998 672 1 0 68 46 693 0.999 57.02 6.00 Prom - 105811 105772 40 -4.65 7.00 Prom + 108414 108453 40 -4.05 7.01 Init + 117108 117196 89 2 2 83 12 84 0.003 0.46 7.02 Intr + 127246 127300 55 1 1 75 94 70 0.010 4.36 7.03 Intr + 134877 134933 57 2 0 105 88 21 0.032 1.96 7.04 Intr + 146241 146419 179 2 2 74 48 118 0.149 4.20 7.05 Intr + 150317 150447 131 2 2 96 66 143 0.997 12.32 7.06 Term + 151877 152220 344 0 2 69 44 212 0.533 8.59 7.07 PlyA + 153295 153300 6 1.05 8.07 PlyA - 154591 154586 6 1.05 8.06 Term - 156607 156534 74 2 2 114 44 54 0.340 0.69 8.05 Intr - 173348 173237 112 2 1 91 92 24 0.379 2.13 8.04 Intr - 181647 181591 57 0 0 100 76 74 0.838 5.56 8.03 Intr - 182187 181970 218 1 2 86 99 130 0.995 11.20 8.02 Intr - 186237 186093 145 0 1 45 95 84 0.953 3.73 8.01 Init - 190233 190114 120 0 0 46 97 55 0.591 2.44 8.00 Prom - 197249 197210 40 -5.25 9.02 PlyA - 197257 197252 6 1.05 9.01 Term - 199937 199785 153 2 0 23 54 162 0.605 3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 128156 128085 72 2 0 67 80 89 0.851 7.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_1|62_aa VDEKVGAPSLNSALHCWNKKPEFPKVDLRAVSAMGAVQSGEAGKELYKLKLWKSETSHIQ LL >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_1|189_bp gtggatgagaaagtgggggctccttccctcaactcagccttgcattgctggaataaaaag cctgagttcccaaaggtagatctgcgggcagtctctgcaatgggagcagtgcaaagtgga gaggcagggaaagagctttataaactgaagctctggaagtcagaaacctctcacattcag cttctctga >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_2|172_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKIDVQKSQAFFYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKGLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKREA KGSLTHAGGSDVKTGRVWSDAAVSQRMRIATRTWKKQEMNFPLDRLEGVRPC >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_2|519_bp atgattgtctatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcgatgtgcaaaaatcacaagcattcttttac accaataacagacaaacagaaagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaagggcctcttcaaggag aactacaaaccactgctcaatgaaataaaagaagatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagagagaggca aagggaagtttgacacacgcaggaggaagtgatgtgaagacaggcagagtttggagtgat gcagctgtaagccaaagaatgcgcatagcaaccagaacttggaagaaacaagaaatgaat tttcccctagaccgtctggagggagttcggccttgctga >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_3|246_aa MGDFNTPLSTLDRSMTQKVNKDTQELNSALHQVDLIDIYRTLHPKSTEYTFFAAPHHTYS KIDYIVGSKALLNKCKRTEIITNCLSDHSAIKLGLRIKRLTQKHSTTWKLNNLLLNDYWV NNEMKAEIKMFLETKENKDTTYQNLWDTFKAVCTGKFIALNVHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELEEIETKKTLQKINESRTWFFEKINKIDRPLARLVK KKRRIK >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_3|741_bp atgggagactttaacaccccactgtcaacattagacagatcaatgacacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattctttgcagcaccacaccacacctactcc aaaattgactacatagttggaagtaaagcactcctcaacaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactaggactcaggattaagagactc actcaaaaacactcaactacatggaaactgaacaacctgctcctgaatgactactgggta aataatgaaatgaaggcagaaataaagatgttccttgaaaccaaggagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtacagggaaatttatagcacta aatgtccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactggaggaaatagagaccaaaaaaacccttcaaaaaattaatgaatcc aggacctggttttttgaaaagatcaacaaaattgacagaccgctagcaagactagtaaag aagaaaagaagaatcaaatag >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_4|354_aa MSGSCAAPGPGSGSSPAACRFAHYFVLCGIDADSGLEPDELAAALGGGRGGLTPHTARYS SETKLPEERSGSSICGSPISAVLQPLLLIPRQTGSGVDLQQTPTDLQLRVLSEEVQTNGK EVKSFEKKLDEWITRITNAEKSLKDLMELKTKARELRDECRSLSSQCDQLAERVSVMEDE MNEVKREEKFRAKRIKRSEQSLQEIWDYVKRPNLHLIGVPESDGENGTKLENTLQDIGKL PQSSKAGQHSNSGNTEDATKILLERATPRHIIVRFTKVEMKEKMLRAAREKGRVTHKGKP IRLTADLLAEILQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_4|1065_bp atgagcgggagctgcgcggcgcccggcccgggctcgggctcctccccggccgcctgccgc ttcgcgcactacttcgtgctgtgcgggatcgacgcggacagcgggctggagcctgacgag ctggcggccgcactcggaggcggccggggtggactgacacctcacacggccaggtactcc tctgagacaaaacttccagaagaacgatcaggcagcagcatctgcggttcaccaatatcc gctgttctgcagccactgctgctgatacccaggcaaacagggtctggagtggacctccag caaactccaacagacctgcagctgagggtcctgtctgaggaagttcaaaccaatggcaaa gaagttaaaagctttgaaaaaaaattagatgaatggataactagaataaccaatgcagag aagtccttaaaggacctgatggagctgaaaaccaaggcacgagagctacgtgacgaatgc agaagcctcagtagccaatgtgatcaactggcagaaagggtatcagtgatggaagacgaa atgaatgaagtgaagcgagaagagaagtttagagcaaaaagaataaaaagaagtgaacaa agcctccaagaaatatgggactatgtgaaaagaccaaatctacatctgattggtgtacct gaaagtgacggggagaatggaaccaagttggaaaacactctgcaggatataggaaaactt ccccaatctagcaaggcaggccaacattcaaattcaggaaatacagaggacgcgacaaag atactcctcgagagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaaaaaatgttaagggcagccagagagaaaggtcgggttacccacaaagggaagccc atcagactaacagcagatctcttggcagaaattctacaagccagaagagagtgggggcca atattcaacattcttaaagaaaagaattttcaacccagaatatcatatccagccaaacta agcttcataagtgaaggagaaataaaatactttacagacaagtaa >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_5|100_aa MPPAKKGGEKNYSSAISELVTREYTINIHKRIHGVGFKKSAPRALSEIWKFAIKDMGTTD VHTDTRLNKAVLTKGLRNVPYGIPMLSKIYRQSMWMRTNH >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_5|303_bp atgcctcccgcaaagaagggtggtgagaagaattacagttctgccatcagtgagctggtg accagagaatataccatcaacattcacaagcgcattcatggagttggtttcaagaagagt gcccctcgggcactcagcgagatctggaaatttgccatcaaggacatgggaactacggat gtgcacactgataccaggcttaacaaagctgtcttgaccaaaggattaaggaatgtccca tatggcatccctatgctttcaaaaatctacagacagtcaatgtggatgagaactaaccac tga >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_6|223_aa MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEVAKQYIEK SLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLK DRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDAAKP VIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_6|672_bp atggcttccaaactcctgcgcgcggtcatcctcgggccgcccggctcgggcaagggcacc gtgtgccagaggatcgcccagaactttggtctccagcatctctccagcggccacttcttg cgggagaacatcaaggccagcaccgaagttggtgaggtggcaaagcagtatatagagaaa agtcttttggttccagaccatgtgatcacacgcctaatgatgtccgagttggagaatagg cgtggccagcactggctccttgatggttttcctaggacattaggacaagccgaggccctg gacaaaatctgtgaagtggatctagtgatcagtttgaatattccatttgaaacacttaaa gatcgtctcagccgccgttggattcaccctcctagcggaagggtatataacctggacttc aatccacctcatgtacatggtattgatgacgtcactggtgaaccattagtccagcaggag gatgataaacccgaagcagttgctgccaggctaagacagtacaaagacgcggcaaagcca gtcattgaattatacaagagccgaggagtgctccaccaattttccggaacggagacgaac aaaatctggccctacgtttacacacttttctcaaacaagatcacacctattcagtccaaa gaagcatattga >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_7|284_aa MDGVGSCYPQQTNTGTENQTPYILDYKRELQIHRSTERLRHLIKLTELNHSGFSVEKRPK EARRDKEALRSSGLLLFPCGQCPWRGAGSFLDPEIKAFLEENTEVTSSGSLTPEIQLRLL TPRCKFWYLLDNPDVVRGKSVLDLGSGCGATAIAAKMSGASRILANDIDPIAGMAITLNC ELNRLNPFPILIQNILNLEQDKWDLVVLGDMFYDEDLADSLHQWLKKCFWTYRTRVLIGD PGRPQFSGHSIQHHLHKVVEYSLLESTRQENSGLTTSTVWGFQP >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_7|855_bp atggatggagttggaagctgttatcctcagcaaactaacacaggaacagaaaaccaaaca ccgtatattctcgattataagcgagagctacaaattcaccgaagcacagagaggttacgg cacttaatcaagctcacagagctaaatcactctggcttttctgttgagaagagaccaaag gaagcaaggagggataaggaggctctgcgaagcagtggtcttctcttgtttccctgtggc cagtgtccctggagaggagctggaagctttttggaccctgagataaaggctttcctggag gagaacactgaagtcaccagcagtggtagcctcacccctgaaatccagttgcggcttttg acccccagatgcaaattctggtatcttttggataatcctgatgttgtcagaggaaaatct gtattagatcttgggagtggatgtggagctacagctattgctgctaagatgagtggggca tcaaggatcttggccaatgacatagaccctattgcaggaatggctattacactaaattgt gaattgaacagactgaatccttttcctattttaatccaaaacattttgaatttggaacaa gataagtgggaccttgttgttcttggcgatatgttttatgatgaagaccttgcagatagt cttcatcagtggctgaagaagtgcttctggacctatagaactcgagtactgattggtgac cctgggcggccccagttcagtggacacagcattcagcatcacctgcacaaagtggtagaa tattcacttttggagtctactaggcaggaaaacagtggactgacaacaagcacagtgtgg ggttttcagccttga >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_8|241_aa MKIHCKGATGRTSKRGPNCQETKACWGFYRMELGLIDSTKILHPEVQTLDLRSCDISDAA LLHLSNCRKLKKLNLNASKGNRVSVTSEGIKAVASSCSYLHEASLKRCCNLTDEGVVALA LNCQLLKIIDLGGCLSITDVSLHALGKNCPFLQCVDFSATQVSDSGVIALVSGPCAKKLE EIHMGHCVNLTDGAVEAVLTYCPQIRILLFHGCPLITDHSREVLEQLVGPNKLKQVTWTV Y >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_8|726_bp atgaaaatacattgcaagggagcaacgggcagaaccagcaagagaggacccaactgccag gagacaaaggcttgctggggattttatagaatggaacttgggctgattgacagcaccaag attttacatcctgaagtccaaactctagatctacggagctgcgatatatcagatgctgct ctcctgcacctgtctaactgtagaaaactgaagaaattaaatttaaatgcttcaaaaggg aaccgagtttctgtaacttcagaaggaataaaagctgtggcttcatcttgttcataccta cacgaagcttctttgaaaagatgctgcaatctcactgacgaaggagtcgttgctcttgca ctcaattgccagctgctaaagatcatcgatttaggtggctgcttaagtattactgatgtg tccttacatgcattaggaaaaaactgcccatttttgcagtgtgtcgacttttcagctact caggtatctgacagtggtgtgattgcacttgttagtggaccttgtgcgaagaaattagag gagattcatatgggacattgtgtaaatctgactgatggggctgtcgaagctgtccttact tactgtcctcaaatacgtatattactcttccatggatgccccttgataacagatcattcc cgagaagtgttggagcaattagtaggcccaaacaaactaaagcaagtgacatggactgtt tattga >gi568815586r:31515771_31716439|GENSCAN_predicted_peptide_9|50_aa IPSKESIPEFLISPPKKKDLSLEEIQNKSEASEERHKPCEAEALKQLAEK >gi568815586r:31515771_31716439|GENSCAN_predicted_CDS_9|153_bp ataccatcaaaagaatccatcccagagttccttatttcccctccaaagaagaaagatctt tccctggaggaaattcagaataaatcagaagcttcagaagaaagacacaagccctgcgaa gctgaggccttgaagcagcttgctgaaaaatga