GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:27:08 Sequence gi568815575f:150492615_150771592 : 278978 bp : 43.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4157 4252 96 1 0 75 78 48 0.336 2.81 1.02 Intr + 10741 10903 163 0 1 69 111 89 0.834 8.85 1.03 Intr + 15237 15400 164 1 2 79 18 127 0.727 4.49 1.04 Intr + 17348 17432 85 1 1 78 121 50 0.883 6.69 1.05 Term + 19390 20459 1070 2 2 79 35 562 0.930 42.29 1.06 PlyA + 20710 20715 6 1.05 2.07 PlyA - 20771 20766 6 1.05 2.06 Term - 26275 26169 107 2 2 37 45 103 0.101 -0.53 2.05 Intr - 32948 32804 145 2 1 62 34 79 0.020 -0.24 2.04 Intr - 34286 34182 105 2 0 84 101 7 0.038 2.01 2.03 Intr - 41329 41265 65 2 2 87 82 29 0.030 0.64 2.02 Intr - 52085 51979 107 1 2 70 97 46 0.147 3.56 2.01 Init - 56237 56185 53 2 2 73 89 44 0.197 3.63 2.00 Prom - 84507 84468 40 -2.06 3.00 Prom + 94854 94893 40 -2.06 3.01 Init + 98846 98986 141 1 0 69 93 55 0.523 3.01 3.02 Intr + 103884 103956 73 2 1 117 95 41 0.561 6.68 3.03 Intr + 112630 112708 79 2 1 53 57 61 0.012 -1.89 3.04 Intr + 118672 118696 25 1 1 102 80 19 0.031 0.53 3.05 Intr + 126424 126525 102 0 0 123 59 25 0.015 3.47 3.06 Intr + 146329 146412 84 0 0 89 110 42 0.988 6.52 3.07 Intr + 148655 148804 150 1 0 139 81 74 0.999 12.16 3.08 Intr + 153069 153257 189 2 0 70 113 109 0.959 11.38 3.09 Intr + 165207 165413 207 2 0 93 65 40 0.360 1.47 3.10 Intr + 167757 167870 114 2 0 71 109 71 0.977 8.14 3.11 Intr + 170819 170995 177 1 0 105 108 25 0.825 6.32 3.12 Term + 178817 178981 165 1 0 60 55 140 0.536 5.82 3.13 PlyA + 179013 179018 6 1.05 4.00 Prom + 189223 189262 40 -3.76 4.01 Init + 200917 201062 146 0 2 101 115 193 0.999 20.89 4.02 Intr + 206581 206686 106 1 1 49 121 26 0.223 2.22 4.03 Intr + 226011 226086 76 2 1 91 111 22 0.518 3.89 4.04 Intr + 234601 234695 95 2 2 71 96 95 0.766 8.28 4.05 Intr + 237495 237596 102 2 0 74 99 38 0.931 3.87 4.06 Intr + 237911 237994 84 1 0 65 103 29 0.813 2.12 4.07 Intr + 238856 239005 150 1 0 78 95 112 0.999 11.26 4.08 Intr + 239928 240116 189 2 0 29 71 200 0.991 12.18 4.09 Intr + 243981 244166 186 2 0 93 65 101 0.994 8.19 4.10 Intr + 244628 244834 207 1 0 57 119 120 0.998 11.27 4.11 Intr + 251747 251839 93 1 0 87 94 60 0.880 6.66 4.12 Intr + 263075 263251 177 1 0 105 70 0 0.162 0.02 4.13 Term + 269951 270115 165 1 0 118 45 178 0.827 14.42 4.14 PlyA + 272378 272383 6 1.05 5.04 PlyA - 273742 273737 6 1.05 5.03 Term - 276487 276420 68 2 2 103 55 109 0.997 7.20 5.02 Intr - 277755 277690 66 1 0 123 100 118 0.962 15.38 5.01 Init - 278014 277891 124 1 1 69 23 143 0.759 6.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:150492615_150771592|GENSCAN_predicted_peptide_1|525_aa MPHSLNLRSGKYTADSLYAKHYGISTGMCHSEALGSESFLPGSSFAHELARVTSSYSTSE AAPWGSWDPKAWRQVPAPLLPSCDATVTLSSLHLPLPLNPYLCTTVLFDETNAATGGDGV KDRLDLASHAGICPQKQRKHLQEEQRSGLMAMTPERQNAYISQQMSPFEAVQEQVTSKCS RIKASPPSSKHLMPPRTGLLQNNLSPGMIPLTRHQSCEGMGVISPTLGKRQGIFTSSPQC PILSHSGQTPLGRLDSVCQHMQSPKATPPEVPLPGFCPSSLGTQSLSPHQLRRPSVPRMP TAFNNAAWVTAAAAVTTAVSGKTPLSQVDNSVQQHSPSGQACLQRPSDWEAQVPAAMGTQ VPLANNPSFSLLGSQSLRQSPVQGPVPVANTTKFLQQGMASFSPLSPIQGIEPPSYVAAA ATAAAASAVAASQFPGPFDRTDIPPELPPADFLRQPQPPLNDLISSPDCNEVDFIEALLK GSCVSPDEDWVCNLRLIDDILEQHAAAQNATAQNSGQVTQDAGAL >gi568815575f:150492615_150771592|GENSCAN_predicted_CDS_1|1578_bp atgccccacagcctcaacctcagaagtgggaagtatactgcagattcactctatgccaag cactatgggatatccacaggaatgtgccacagtgaggccctggggagtgagtccttcctg cccggcagctcctttgctcatgagctggcccgagtcacctcctcgtacagcacctcagag gcagcgccctggggcagctgggatccgaaggcctggaggcaggtgcccgctccactactg cctagctgcgacgccacagtcacactctccagcctccatctgcccctgcccctgaacccc tacctctgtacaactgtgttatttgatgaaactaatgcagcaaccggtggggatggagtt aaggaccgtttagacctagccagccacgcaggaatctgtccccagaagcagcggaagcac ctgcaagaggaacagagatcaggtcttatggcaatgacccctgaacggcagaatgcatat atctcccaacagatgagtccatttgaagccgtccaagaacaagtcacctccaagtgtagc cggatcaaggcaagccccccatctagcaagcacttgatgccacccagaactgggcttctt cagaacaatctgagtccaggaatgatcccactcaccaggcaccagagctgcgagggcatg ggagtgatctcaccaactctggggaagcggcaaggaattttcacctccagcccccagtgt cccatcctctcacactcaggccagactcccctgggcagacttgactctgtctgccagcat atgcagagtcccaaggccaccccaccagaagtgcccctgcctgggttctgtcccagctcc ctgggcacccagtccttgagtccccaccagctcagacggcctagtgtgccaagaatgccc actgcgttcaacaatgctgcatgggtcacagcggcagcagctgtgaccacagcagtttcg gggaaaacacccctcagccaagtggataatagcgttcagcagcactcaccttctggccag gcctgccttcagaggccatctgattgggaggcacaagtgcccgctgcgatgggaacacaa gtgcccctggccaacaaccccagcttcagcctgctgggcagccagagcctcaggcagagc ccggtacagggcccggtgcctgtagcaaacaccaccaagttcctccagcagggtatggcc agctttagtcccctgagccccatacagggcatcgagccaccaagctatgtggctgctgct gccaccgctgctgctgcttctgccgttgctgccagccagttcccaggtccgttcgacaga acggatattccccctgagctgccacctgccgactttttgcgccagccccaacccccacta aatgatctgatttcgtcacctgactgcaatgaggtagatttcattgaagctctcttgaaa ggctcctgtgtgagcccagatgaagactgggtgtgcaacttgaggctgatcgacgacatt ttggaacagcatgctgctgctcaaaatgccacagcccagaattctgggcaagtcacccag gatgctggggcactttaa >gi568815575f:150492615_150771592|GENSCAN_predicted_peptide_2|193_aa MIPTVVSENKFELSYPSHLIFRHLPNARSPSTPLSRHLTSQGLHSGVEKGVGLDYISDGH LPRKIFSDLPLPSGQPSQRQAMPRAGEPGAWALHSHPLVGELARHAQHIMKALCHVPHIF SGQKISIYRKPADASAVMARLPDLLMGKPPPVALQRKREYRKGEGKTNIKDKSQVLDLET KRIWGIREQGREV >gi568815575f:150492615_150771592|GENSCAN_predicted_CDS_2|582_bp atgataccaaccgtggtgtcagaaaacaagtttgagctttcctacccttctcacctcatc ttccgccacttgcccaacgctcgctccccctccacacctctctcccgtcacctcaccagc cagggcctgcattctggagtggagaagggagtgggcctcgattatatctcagatggccat ctccccaggaaaatcttctcggacctccccctgccttctggccagcccagccagcgccag gccatgcccagagctggggaaccaggtgcctgggctctgcattcacatcctctcgtgggt gaacttgcacgacatgcccaacacataatgaaggctttgtgccacgtgcctcacatcttc agtggccagaagatttcaatttacagaaaaccagcagatgcttcagcagtcatggcaagg ctcccagatcttctaatgggaaagcctccacctgtggccctacagaggaagagagaatac agaaagggtgagggaaagacaaacatcaaggacaagtctcaggttctggacttggagacc aagaggatttggggcatccgggagcagggcagggaggtgtga >gi568815575f:150492615_150771592|GENSCAN_predicted_peptide_3|501_aa MAPIWPSPHLFILVAVFFCDFVTFLDYFLRTVASTVKLLNPTRFCAKTSRDGVNRDLTEA VPRLPGETLITEWNIGNVEEAKGMGAFKASSGADPNPCYWHQARRQDMRNLRFALKQEGH SRRDMFEILTRYAFPLAHSLPLFAFLNEEKFNVDGWTVYNPVEEYRRQGLPNHHWRITFI NKCYELCDTYPALLVVPYRASDDDLRRVATFRSRNRIPVLSWIHPENKTVIVRCSQPLVG MSGKRNKDDEKYLDVIRETNKQISKLTIYDARPSVNAVANKLVLTGAIQVADKVSSGKSS VLVHCSDGWDRTAQLTSLAMLMLDSFYRSIEGFEILVQKEWISFGHKFASFPTAFEFNEQ FLIIILDHLYSCRFGTFLFNCESARERQKVTERTVSLWSLINSNKEKFKNPFYTKEINRV LYPVASMRHLELWVNYYIRWNPRIKQQPNPVEQRYMELLALRDEYIKRLEELQLANSAKL SDPPTSPSSPSQMMPHVQTHF >gi568815575f:150492615_150771592|GENSCAN_predicted_CDS_3|1506_bp atggcccctatctggccatctccccacctcttcatcctggtggctgtctttttctgtgac ttcgtcacctttcttgactatttcttaaggacagtagccagtactgtgaagctgttaaat cccactagattttgtgccaagacgtctcgagatggagtcaatcgagatctcactgaggct gttcctcgacttccaggagaaacactaatcactgaatggaatataggaaatgtggaggaa gctaaaggaatgggggcttttaaggcttccagtggagcagatcctaacccttgctactgg catcaggcacgacgccaggacatgagaaacctgaggttcgctttgaaacaggaaggccac agcagaagagatatgtttgagatcctcacgagatacgcgtttcccctggctcacagtctg ccattatttgcatttttaaatgaagaaaagtttaacgtggatggatggacagtttacaat ccagtggaagaatacaggaggcagggcttgcccaatcaccattggagaataacttttatt aataagtgctatgagctctgtgacacttaccctgctcttttggtggttccgtatcgtgcc tcagatgatgacctccggagagttgcaacttttaggtcccgaaatcgaattccagtgctg tcatggattcatccagaaaataagacggtcattgtgcgttgcagtcagcctcttgtcggt atgagtgggaaacgaaataaagatgatgagaaatatctcgatgttatcagggagactaat aaacaaatttctaaactcaccatttatgatgcaagacccagcgtaaatgcagtggccaac aagctcgttttgacaggagccattcaagtagcagacaaagtttcttcagggaagagttca gtgcttgtgcattgcagtgacggatgggacaggactgctcagctgacatccttggccatg ctgatgttggatagcttctataggagcattgaagggttcgaaatactggtacaaaaagaa tggataagttttggacataaatttgcatctttccctacagcttttgaattcaatgaacaa tttttgattataattttggatcatctgtatagttgccgatttggtactttcttattcaac tgtgaatctgctcgagaaagacagaaggttacagaaaggactgtttctttatggtcactg ataaacagtaataaagaaaaattcaaaaaccccttctatactaaagaaatcaatcgagtt ttatatccagttgccagtatgcgtcacttggaactctgggtgaattactacattagatgg aaccccaggatcaagcaacaaccgaatccagtggagcagcgttacatggagctcttagcc ttacgcgacgaatacataaagcggcttgaggaactgcagctcgccaactctgccaagctt tctgatcccccaacttcaccttccagtccttcgcaaatgatgccccatgtgcaaactcac ttctga >gi568815575f:150492615_150771592|GENSCAN_predicted_peptide_4|591_aa MDRPAAAAAAGCEGGGGPNPGPAGGRRPPRAAGGATAGSRQPSVETLDSPTGSHVEWCKQ LIAATISSQISGSVTSENVSRDYKALRDGNKLAQMEEAPLFPGESIKAIVKDVMYICPFM GAVSGTLTVTDFKLYFKNVERDMRNLRLAYKQEEQSKLGIFENLNKHAFPLSNGQALFAF SYKEKFPINGWKVYDPVSEYKRQGLPNESWKISKINSNYEFCDTYPAIIVVPTSVKDDDL SKVAAFRAKGRVPVLSWIHPESQATITRCSQPLVGPNDKRCKEDEKYLQTIMDANAQSHK LIIFDARQNSVADTNKTKGGGYESESAYPNAELVFLEIHNIHVMRESLRKLKEIVYPSID EARWLSNVDGTHWLEYIRMLLAGAVRIADKIESGKTSVVVHCSDGWDRTAQLTSLAMLML DSYYRTIKGFETLVEKEWISFGHRFALRVGHGNDNHADADRSPIFLQFVDCVWQMTRQDV YTKTISLWSYINSQLDEFSNPFFVNYENHVLYPVASLSHLELWVNYYVRWNPRMRPQMPI HQNLKELLAVRAELQKRVEGLQREVATRAVSSSSERGSSPSHSATSVHTSV >gi568815575f:150492615_150771592|GENSCAN_predicted_CDS_4|1776_bp atggacaggccggcggcggcggcggcggcgggctgcgagggcggcgggggcccgaacccg gggccggcgggcggcaggaggcctcctcgggccgcggggggcgccaccgccggctcccgg cagcccagcgtggagaccctggacagtcccacaggatcacatgttgaatggtgtaaacag cttatagctgctacaatttctagtcagatttcaggttcagtgacatcagaaaatgtgtcc agagattacaaggctctaagggatggaaataagctggcacagatggaagaggctccactt ttcccaggagaatcaattaaagccattgtgaaagatgtcatgtatatctgcccatttatg ggagcagtgagtggaaccctgacagtgacggactttaagctgtacttcaaaaatgtcgag agggatatgaggaacttgcggcttgcttataaacaggaagaacagagtaaactagggata tttgaaaacctcaacaaacatgcatttcctctttctaacggacaggcactatttgcattc agctataaagaaaaatttccaattaatggctggaaagtttatgatccagtatctgaatat aagagacagggcttgccaaatgagagttggaaaatatccaaaataaacagtaattatgag ttctgtgacacctaccctgccatcattgttgtgccaactagtgtaaaagatgatgacctt tcaaaagtggcagcttttcgagcaaaaggcagagtccctgtgttgtcatggattcatccg gaaagtcaagcaacgattacccgttgcagccagccacttgtgggtcccaatgataagcgc tgcaaagaggatgaaaaatacttgcaaacaataatggatgctaacgcacagtcacacaag cttatcatctttgatgctcgacaaaacagtgtcgctgataccaacaagacaaagggtgga ggatatgaaagtgaaagtgcttacccaaatgcagaacttgtgttcttggagatccacaac attcatgtcatgcgagagtcactacgcaaattaaaagagattgtgtacccttcgatcgat gaggcgcggtggctctccaatgtggatgggacgcattggctggaatatataaggatgctg cttgctggggcagtaagaattgctgataaaatagaatctgggaaaacatctgtggtggtg cattgcagcgacggttgggaccgaacagcccagctcacatctctggctatgctaatgttg gacagttactacaggaccattaaaggatttgaaactctcgtagaaaaggagtggataagc tttggacacaggtttgcactgcgagtgggccatggtaatgacaaccatgcggatgctgac cgatctcccatatttctgcagtttgttgattgtgtttggcaaatgacaaggcaggatgta tatacaaagacgatatctttatggtcgtatatcaatagccagctagacgagttttctaat cccttctttgtgaattatgaaaaccacgtgttatatcctgttgctagtctgagtcatttg gaattgtgggtaaattattatgtacgatggaatccacggatgagacctcagatgcccatt caccagaatctcaaggagctgctggccgtcagggcggagctgcagaagcgtgtggagggc ctacagcgggaggtggccacgcgcgccgtctcatcctcatctgagcggggctcctcgccc tcccactccgccacctccgtccacacctcggtctga >gi568815575f:150492615_150771592|GENSCAN_predicted_peptide_5|85_aa MECRKPSLHVVVISEDDDENSWAVNSGTNSLHPDKDVQVAQEGLNADYVKGENLEAVVCE EPQVKYSTLHTQSAEPPPPPEPARI >gi568815575f:150492615_150771592|GENSCAN_predicted_CDS_5|258_bp atggagtgccggaagccttcgctgcatgtcgttgttatttctgaggacgatgatgagaac tcctgggcagtgaattctgggacgaacagtcttcatccggataaggatgttcaggtggct caggagggtctcaacgcagactacgtgaagggagagaacctggaagccgtggtatgtgag gaaccccaagtgaaatactccacgttgcacacgcagtctgcagagccgccgccgccgccc gaaccagcccggatctga