GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:03:14 Sequence gi568815596f:190781085_191062983 : 281899 bp : 38.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5367 5477 111 1 0 59 56 73 0.194 0.63 1.02 Intr + 12757 12828 72 2 0 70 89 38 0.491 0.56 1.03 Intr + 28034 28085 52 0 1 120 115 32 0.357 6.15 1.04 Term + 29690 29909 220 2 1 40 42 128 0.323 -1.07 1.05 PlyA + 32730 32735 6 1.05 2.03 PlyA - 34173 34168 6 1.05 2.02 Term - 37131 37024 108 0 0 92 50 42 0.526 -1.67 2.01 Init - 40383 40306 78 0 0 90 116 44 0.854 8.51 2.00 Prom - 49806 49767 40 -4.05 3.00 Prom + 63349 63388 40 -5.95 3.01 Init + 90637 90708 72 0 0 88 65 80 0.450 6.82 3.02 Intr + 99733 99877 145 0 1 42 9 351 0.335 21.53 3.03 Intr + 99978 100386 409 1 1 -47 87 278 0.135 6.60 3.04 Intr + 119480 119609 130 2 1 79 90 59 0.623 4.98 3.05 Intr + 123960 124083 124 2 1 61 95 106 0.675 7.84 3.06 Intr + 129179 129237 59 0 2 77 93 10 0.803 -1.92 3.07 Intr + 139940 139972 33 1 0 103 92 39 0.779 3.30 3.08 Intr + 143459 143509 51 1 0 96 89 19 0.293 0.99 3.09 Intr + 146222 146398 177 1 0 30 103 124 0.507 7.29 3.10 Intr + 149353 149484 132 0 0 59 87 33 0.503 0.22 3.11 Intr + 150461 150553 93 1 0 82 98 39 0.797 3.54 3.12 Intr + 172481 172542 62 1 2 125 95 29 0.017 3.91 3.13 Term + 181746 181902 157 0 1 82 39 179 0.968 8.82 3.14 PlyA + 182080 182085 6 1.05 4.20 PlyA - 182190 182185 6 1.05 4.19 Term - 192736 192632 105 1 0 79 29 99 0.850 0.53 4.18 Intr - 193848 193746 103 2 1 88 101 124 0.866 12.96 4.17 Intr - 194803 194728 76 2 1 42 84 97 0.973 2.35 4.16 Intr - 195941 195756 186 0 0 94 115 203 0.999 22.34 4.15 Intr - 197917 197772 146 0 2 61 101 215 0.029 19.11 4.14 Intr - 199585 199536 50 1 2 72 100 68 0.029 2.96 4.13 Intr - 203309 203226 84 2 0 88 94 20 0.033 1.70 4.12 Intr - 205863 205770 94 2 1 53 58 136 0.288 6.15 4.11 Intr - 208119 208024 96 2 0 92 60 54 0.165 1.21 4.10 Intr - 208590 208531 60 2 0 103 60 44 0.121 0.03 4.09 Intr - 210236 210144 93 1 0 68 77 59 0.136 0.96 4.08 Intr - 214135 213977 159 0 0 83 78 213 0.631 17.98 4.07 Intr - 216923 216772 152 2 2 97 99 127 0.999 12.74 4.06 Intr - 217224 217133 92 1 2 64 63 57 0.975 -0.41 4.05 Intr - 218620 218542 79 1 1 54 94 103 0.975 5.71 4.04 Intr - 220079 219990 90 2 0 92 101 82 0.543 9.17 4.03 Intr - 226577 226479 99 2 0 45 80 68 0.592 1.09 4.02 Intr - 228023 227879 145 1 1 35 80 109 0.941 4.16 4.01 Init - 228919 228792 128 1 2 23 95 150 0.552 8.88 4.00 Prom - 229076 229037 40 -6.25 5.00 Prom + 229656 229695 40 -9.95 5.01 Init + 230249 230306 58 1 1 64 93 83 0.592 7.93 5.02 Term + 232605 232960 356 1 2 -25 48 288 0.931 7.17 5.03 PlyA + 234343 234348 6 1.05 6.00 Prom + 237019 237058 40 -7.05 6.01 Init + 239117 239186 70 1 1 61 80 75 0.958 5.26 6.02 Intr + 239712 239783 72 1 0 56 107 69 0.928 4.06 6.03 Intr + 240809 240951 143 0 2 102 86 66 0.965 6.95 6.04 Term + 242314 242514 201 0 0 62 44 104 0.817 -0.19 6.05 PlyA + 243674 243679 6 1.05 7.14 PlyA - 244031 244026 6 1.05 7.13 Term - 247720 247518 203 2 2 25 49 179 0.560 4.27 7.12 Intr - 252065 251874 192 0 0 60 72 169 0.933 11.04 7.11 Intr - 252542 252406 137 1 2 90 21 73 0.654 0.09 7.10 Intr - 252921 252827 95 0 2 109 106 14 0.667 3.14 7.09 Intr - 253513 253464 50 2 2 82 82 12 0.537 -2.52 7.08 Intr - 255215 255080 136 2 1 76 94 61 0.870 4.72 7.07 Intr - 258213 258115 99 0 0 86 74 52 0.879 2.99 7.06 Intr - 258382 258263 120 1 0 66 73 56 0.744 1.67 7.05 Intr - 260064 259981 84 0 0 108 92 65 0.630 8.00 7.04 Intr - 277027 276934 94 0 1 49 111 104 0.908 7.85 7.03 Intr - 277685 277626 60 1 0 90 108 11 0.499 0.23 7.02 Intr - 280737 280645 93 2 0 72 102 53 0.894 3.26 7.01 Intr - 281836 281678 159 0 0 62 100 94 0.563 6.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 180863 180879 17 1 2 96 101 -9 0.837 0.95 S.002 Term - 204995 204727 269 0 2 58 47 193 0.832 6.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_1|151_aa XHCKRHMPPCRAALLPQIMPLAAASDITASRKWLTNYVLNARSPRSRCQQVWFLLKPLFL AWHYLPKSPNDPALPQPPCATLSKKKEKERKERKGKERKGKERKGKERQKAKGMAKGKRK GKEEKRSKIRSLRTSYHPPGLIVLQPLSPFF >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_1|456_bp ngacattgcaaaaggcacatgccaccctgtcgtgctgccttgctcccccagatcatgccc ttggcagctgcctcagatattactgcttccagaaagtggcttaccaactacgttctgaat gctagaagcccaagatcaaggtgtcagcaggtttggtttctcctgaagcctctcttcttg gcttggcattatcttccaaagagtcctaatgacccagctctgccacagccaccatgtgca accctctcaaaaaagaaagaaaaagagagaaaggaaaggaaaggaaaggaaaggaaagga aaggaaaggaaaggaaaggaaaggcaaaaggcaaaaggaatggcaaaaggcaaaaggaaa ggaaaagaagaaaagagatccaaaatccgaagcctccggacctcataccaccctcctggg ctcattgtgctccagcctctctcaccctttttctga >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_2|61_aa MVVQRRRTQVRKKSFWLMVPFETALKLDVRQLERATTPQSQLKLFKITNPKPAYLASRVP S >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_2|186_bp atggttgttcagcggaggagaacacaggtgagaaagaaaagtttctggctgatggtccca tttgagacagcactgaagctagacgtcaggcaactagagagggccactacaccccagagc caactgaaattattcaaaataaccaatcctaaacctgcttaccttgcttcacgtgttcct tcctag >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_3|547_aa MSKNMVSAIDESHLMEEGITWQMSPQCGALGGAKRTGRGNPSAQQQQQQQQQQQQQQQHP HPLRESEPEPHPTGADGPGGMMRLRGSGMLRDLLLRSPAGVSATLRRAQPLVTLCRRPRG GGRPAAGPAAAARLHPWWGGGGWPAEPLARGLSSSPSEILQELGKGSTHPQPGVSPPAAP AAPGPKDGPGETDAFGNSEGKELVASGEKCVQSNIVLLTQAFRRKFVIPDFMSFTSHIDE LYESAKKQSGGKSCVKPLKYAIAVNDLGTEYVHRYVGKEPSGLRFNKLFLNEDDKPHNPM VNAGAIVVTSLIKQGVNNAEKFDYCFPEGTDMVGILDFYFQLCSIEVTCESASVMAATLA NGGFCPITGERVLSPEAVRNTLSLMHSCGMYDFSGQFAFHVGLPAKSGVAGGILLVVPNV MGMMCWSPPLDKMGNSVKGIHFCHDLVSLCNFHNYDNLRHFAKKLDPRREGGDQRVKSVI NLLFAAYTGDVSALRRWNNTPMDEALHFGHHDVFKILQEYQVQYTPQGDSDNGKENQTVH KNLDGLL >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_3|1644_bp atgtccaagaatatggtgtcagcaattgatgagagtcatctcatggaggaaggcatcaca tggcaaatgagtcctcagtgcggagccttaggcggagcgaagagaaccggtcgcggcaat cctagcgcgcagcagcagcagcagcagcagcagcagcagcagcagcagcagcagcacccg catccgctgcgggagtccgagccggaaccacacccaacgggcgctgacggacccggcggc atgatgcggctgcgaggctcggggatgctgcgggacctgctcctgcggtcgcccgccggc gtgagcgcgactctgcggcgggcacagcccttggtcaccctgtgccggcgtccccgaggc gggggacggccggccgcgggcccggctgccgccgcgcgactccacccgtggtggggcggg ggcggctggccggcggagcccctcgcgcggggcctgtccagctctccttcggagatcttg caggagctgggcaaggggagcacgcatccgcagcccggggtgtcgccacccgctgccccg gcggcgcccggccccaaggacggccccggggagacggacgcgtttggcaacagcgagggc aaagagctggtggcctcaggtgaaaaatgtgttcagagcaacattgttttgttgacacaa gcatttagaagaaagtttgtgattcctgactttatgtcttttacctcacacattgatgag ttatatgaaagtgctaaaaagcagtctggaggaaagtcctgtgtaaaacctttgaaatat gccattgctgttaatgatcttggaactgaatatgtgcatcgatatgttggaaaagagccg agtggactaagattcaacaaactatttttgaatgaagatgataaaccacataatcctatg gtaaatgctggagcaattgttgtgacttcactaataaagcaaggagtaaataatgctgaa aaatttgactattgttttccagaaggcacagacatggttggtatattagacttctacttc cagctgtgctccattgaagtgacttgtgaatcagccagtgtgatggctgcgacactggct aatggtggtttctgcccaattactggtgaaagagtactgagccctgaagcagttcgaaat acattgagtttgatgcattcctgtggcatgtatgacttctcagggcagtttgctttccat gttggtcttcctgcaaaatctggagttgctgggggcattcttttagttgtccccaatgtt atgggtatgatgtgctggtctcctcctctggataagatgggcaacagtgttaagggaatt cacttttgtcacgatcttgtttctctgtgtaatttccataactatgataatttgagacac tttgcaaaaaaacttgatcctcgaagagaaggtggtgatcaaagggtaaagtcagtgata aatcttttgtttgctgcatatactggagatgtgtctgcacttcgaaggtggaataacact cccatggatgaagcactgcactttggacaccatgatgtatttaaaattctccaagaatac caagtccagtacacacctcaaggagattctgacaacgggaaggaaaatcaaaccgtccat aagaatcttgatggattgttgtaa >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_4|678_aa MSQWYELQQLDSKFLEQVHQLYDDSFPMEIRQYLAQWLEKQDWEHAANDVSFATIRFHDL LSQLDDQYSRFSLENNFLLQHNIRKSKRNLQDNFQEDPIQMSMIIYSCLKEERKILENAQ RFNQAQSGNIQSTVMLDKQKELDSKVRNVKDKVMCIEHEIKSLEDLQDEYDFKCKTLQNR EHETNGVAKSDQKQEQLLLKKMYLMLDNKRKEVVHKIIELLNVTELTQNALINDELVEWK RRQQSACIGGPPNACLDQLQNWFTIVAESLQQVRQQLKKLEELEQKYTYEHDPITKNKQV LWDRTFSLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQFTVKLRLLVKLQELNYNLKV KVLFDKSAPKERPLIQPEAERPGCHDSVVIWGFFSGARFRKFNILGTHTKVMNMEESTNG SLAAEFRHLGPLIVTEELHSLSFETQLCQPGLVIDLEVLTPAPMVSFRGRGFVRCIMGFI SKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYTKKELSAV TFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELDGPKGTGYI KTELISVSEVHPSRLQTTDNLLPMSPEEFDEVSRIVGSVEFDSMHGLQSLEIVPSYTFLI VVDEIGHLEKQLAIVAAK >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_4|2037_bp atgtctcagtggtacgaacttcagcagcttgactcaaaattcctggagcaggttcaccag ctttatgatgacagttttcccatggaaatcagacagtacctggcacagtggttagaaaag caagactgggagcacgctgccaatgatgtttcatttgccaccatccgttttcatgacctc ctgtcacagctggatgatcaatatagtcgcttttctttggagaataacttcttgctacag cataacataaggaaaagcaagcgtaatcttcaggataattttcaggaagacccaatccag atgtctatgatcatttacagctgtctgaaggaagaaaggaaaattctggaaaacgcccag agatttaatcaggctcagtcggggaatattcagagcacagtgatgttagacaaacagaaa gagcttgacagtaaagtcagaaatgtgaaggacaaggttatgtgtatagagcatgaaatc aagagcctggaagatttacaagatgaatatgacttcaaatgcaaaaccttgcagaacaga gaacacgagaccaatggtgtggcaaagagtgatcagaaacaagaacagctgttactcaag aagatgtatttaatgcttgacaataagagaaaggaagtagttcacaaaataatagagttg ctgaatgtcactgaacttacccagaatgccctgattaatgatgaactagtggagtggaag cggagacagcagagcgcctgtattggggggccgcccaatgcttgcttggatcagctgcag aactggttcactatagttgcggagagtctgcagcaagttcggcagcagcttaaaaagttg gaggaattggaacagaaatacacctacgaacatgaccctatcacaaaaaacaaacaagtg ttatgggaccgcaccttcagtcttttccagcagctcattcagagctcgtttgtggtggaa agacagccctgcatgccaacgcaccctcagaggccgctggtcttgaagacaggggtccag ttcactgtgaagttgagactgttggtgaaattgcaagagctgaattataatttgaaagtc aaagtcttatttgataagtcagccccaaaggagaggcccctgatccaacccgaggctgaa cgtccaggctgccatgattctgtggtcatttgggggtttttttctggagcgagatttagg aagttcaacattttgggcacgcacacaaaagtgatgaacatggaggagtccaccaatggc agtctggcggctgaatttcggcacctgggtcctctcatcgttactgaagagcttcactcc cttagttttgaaacccaattgtgccagcctggtttggtaattgacctcgaggtcctaacg ccagccccgatggtctcattccgtggacgaggttttgtaaggtgcatcatgggcttcatc agcaaggagcgagagcgtgccctgttgaaggaccagcagccggggaccttcctgctgcgg ttcagtgagagctcccgggaaggggccatcacattcacatgggtggagcggtcccagaac ggaggcgaacctgacttccatgcggttgaaccctacacgaagaaagaactttctgctgtt actttccctgacatcattcgcaattacaaagtcatggctgctgagaatattcctgagaat cccctgaagtatctgtatccaaatattgacaaagaccatgcctttggaaagtattactcc aggccaaaggaagcaccagagccaatggaacttgatggccctaaaggaactggatatatc aagactgagttgatttctgtgtctgaagttcacccttctagacttcagaccacagacaac ctgctccccatgtctcctgaggagtttgacgaggtgtctcggatagtgggctctgtagaa ttcgacagtatgcacgggctacaatcattagagattgtaccttcctacactttcctgatt gttgttgacgaaataggccatttagaaaaacagttagctattgtggcagcgaaatag >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_5|137_aa MLDRQAMGAATPLISQANPEENVTDGKGWKREKVTVNGKVQCLLEENYISQVLFTARARL ARTIVLSSAISENRWHCSHLEYSGRVLPLGSLFAGQHRRPVLARRPTARPEDSVPARPTP GTRNLTATHSARRIRKG >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_5|414_bp atgctggaccgtcaggcaatgggagctgcaacgcctcttattagccaagcgaacccagaa gaaaacgtgactgatggaaaggggtggaaacgggagaaagtgacggtaaatgggaaggtg cagtgccttctagaggaaaattatatttcacaagtcctcttcactgccagagcacgactg gcaaggacaatcgtgctgagcagtgccatctccgagaacaggtggcactgctcgcacttg gaatactcaggacgcgttcttcctctgggatcactattcgcgggtcagcacagacggccc gtcctcgcccggcgtccaactgcacggcccgaggactctgtccctgcgcgtcccaccccc ggcacccggaacctcacggccactcactctgcgcgcaggatccggaagggctag >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_6|161_aa MSTITFAVKNRSGTLWSPSFPSAGLFYPLHFAIYTVCTCTDSDLVLSGLSSVIISSKTPS LIASSSDCITLAILITLVKPIPVYYLSVFISPRQREGFPEPEECVNIIQVQGYMGWGWRI ERMWQGKQREHGPSKKLKVVQSVLCADVLVDKKLLGRSEER >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_6|486_bp atgagcactatcacattcgcggtcaaaaaccgctcaggaacgctttggagccctagcttt ccgagcgcaggtctcttttacccgcttcacttcgctatctacacagtttgcacttgcact gactcagatcttgtcctgtctggtctcagttcagtcatcatctcctcaaagacaccctca ttgattgcttcttccagtgactgtatcacacttgcaatcctcataacacttgttaaacct atccctgtttattatttgtctgttttcatctctcctcgccagcgtgaaggctttccagag cctgaagaatgcgtaaacattatccaggtgcagggatatatgggatggggttggagaatt gagaggatgtggcaggggaaacagagagaacatgggccatcaaagaaactgaaagtagtt cagtcagtcttgtgtgcagatgttttggtggataagaagctgttggggaggagtgaagag agatga >gi568815596f:190781085_191062983|GENSCAN_predicted_peptide_7|507_aa XFTLLAESLFQLRRQLEKLEEQSTKMTYEGDPIPMQRTHMLERVTFLIYNLFKNSFVVER QPCMPTHPQRPLVLKTLIQFTVKLRLLIKLPELNYQVKVKASIDNNRRFVLCGTNVKAMS IEESSNGSLSVEFRHLGCHMVTEELHSITFETQICLYGLTIDLEKVPWPSQVISVTRAPW TSGLEVWGWLEIGPLEARVKHKINTSSLPVVMISNVSQLPNAWASIIWYNVSTNDSQNLV FFNNPPPATLSQLLEVMSWQFSSYVGRGLNSDQLHMLAEKLTVQSSYSDGHLTWAKFCKE HLPGKSFTFWTWLEAILDLIKKHILPLWIDGYVMGFVSKEKERLLLKDKMPGTFLLRFSE SHLGGITFTWVDHSESGEVRFHSVEPYNKGRLSALPFADILRDYKVIMAENIPENPLKYL YPDIPKDKAFGKHYSSQPCEAAVLSNSMELSSAQDLESSLCLLSPGDAAESVEQVMLLRH LPCQLQKNPEPGFKMSPDCRVRDGSPL >gi568815596f:190781085_191062983|GENSCAN_predicted_CDS_7|1524_bp nnctttacactattggcagaaagtcttttccaactgagaaggcaattggagaaactagag gagcaatctaccaaaatgacatatgaaggtgatcccattccaatgcaaagaactcacatg ctagaaagagtcaccttcttgatctacaaccttttcaagaactcatttgtggttgagcga cagccatgtatgccaacccaccctcagaggccgttggtacttaaaaccctaattcagttc actgtaaaactaaggctactaataaaattgccagaactaaactatcaggtaaaggttaag gcatcaattgacaacaaccgaagatttgtactttgtggaactaatgtcaaagccatgtct attgaagaatcttccaatgggagtctctcagtagaatttcgacatttgggctgtcacatg gtgactgaagaacttcattccataacgtttgaaacacagatctgcctctatggcctgacc atagatttggagaaagttccatggccttcacaagtcatttcagtgaccagggcaccatgg acatcaggactggaggtgtggggctggcttgagattggccctttagaggcgagggtcaag cataagatcaataccagctcattgcctgtggtgatgatttccaatgtcagtcagttacct aatgcttgggcatccatcatttggtacaacgtgtcaaccaacgattcccagaacttggtt ttctttaataatcctccacctgccacattgagtcaactactggaggtgatgagctggcag ttttcatcgtacgttggtcgtggtcttaactcagatcaactccatatgctggcagagaag cttacagtccaatctagctacagtgatggtcacctcacctgggccaagttctgcaaggaa catttacctggtaaatcatttaccttttggacatggcttgaagcaatattggatctaatt aagaaacacattcttcccctttggattgatgggtatgtcatgggctttgttagcaaagag aaggaacggctgttgctaaaggataaaatgcctggcacctttttattaagattcagtgaa agccatctcggaggaataactttcacctgggtggaccattctgaaagtggggaagtgaga ttccactctgtagaaccctacaataaaggccggttgtctgctctgccattcgctgacatc ctgcgagactacaaagttattatggctgaaaacattcctgaaaaccctctgaagtaccta tatcctgacattcccaaagacaaagccttcggtaaacactacagctctcagccttgcgaa gctgctgtccttagtaatagcatggagctctccagtgctcaagacctggagagctctcta tgcctgctgagccctggggatgctgctgaatctgttgagcaagtaatgctactgaggcat ctgccatgccagctgcagaaaaatccagaaccaggtttcaagatgagccctgactgcaga gtcagagatggcagccccctctga