GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:22:17 Sequence gi568815595f:53078423_53292263 : 213841 bp : 50.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 3352 3347 6 1.05 1.14 Term - 6208 6143 66 1 0 98 42 41 0.015 -1.56 1.13 Intr - 13839 13693 147 0 0 122 65 -5 0.603 1.03 1.12 Intr - 14196 13947 250 2 1 129 78 172 0.914 17.74 1.11 Intr - 21064 20959 106 2 1 73 94 56 0.980 3.97 1.10 Intr - 25675 25531 145 1 1 90 92 189 0.999 19.36 1.09 Intr - 27381 27251 131 1 2 109 95 32 0.996 6.41 1.08 Intr - 28447 28397 51 2 0 122 90 -20 0.547 0.48 1.07 Intr - 33486 33408 79 0 1 129 115 -31 0.910 2.82 1.06 Intr - 35938 35729 210 1 0 55 29 132 0.544 3.11 1.05 Intr - 43378 43277 102 1 0 88 110 14 0.893 3.97 1.04 Intr - 44141 43952 190 1 1 105 86 21 0.949 3.19 1.03 Intr - 45418 45302 117 0 0 110 92 103 0.998 12.48 1.02 Intr - 47572 47487 86 1 2 106 97 31 0.753 4.42 1.01 Init - 51978 51916 63 0 0 64 78 132 0.840 8.96 1.00 Prom - 59679 59640 40 -2.06 2.00 Prom + 61371 61410 40 -5.86 2.01 Init + 63825 63940 116 2 2 51 77 83 0.354 1.99 2.02 Intr + 65675 65740 66 2 0 97 89 44 0.544 3.42 2.03 Intr + 71415 71644 230 0 2 49 81 101 0.085 3.01 2.04 Intr + 78349 78533 185 2 2 23 94 61 0.104 -0.29 2.05 Intr + 82914 83006 93 2 0 59 107 31 0.330 2.26 2.06 Intr + 86682 86793 112 2 1 134 91 -2 0.273 4.65 2.07 Intr + 87640 87700 61 2 1 59 65 46 0.073 -2.81 2.08 Intr + 99982 100115 134 1 2 81 96 289 0.224 29.29 2.09 Intr + 101155 101354 200 2 2 105 82 332 0.999 33.37 2.10 Intr + 102785 102845 61 1 1 85 86 83 0.988 6.11 2.11 Intr + 103022 103184 163 0 1 68 85 215 0.957 18.23 2.12 Intr + 103279 103310 32 1 2 107 88 46 0.940 4.27 2.13 Intr + 104515 104568 54 2 0 107 60 28 0.428 0.85 2.14 Intr + 104699 104784 86 0 2 72 47 162 0.999 10.04 2.15 Intr + 105030 105159 130 2 1 62 100 195 0.570 18.37 2.16 Intr + 106452 106552 101 1 2 109 80 124 0.998 13.53 2.17 Intr + 107182 107278 97 0 1 102 67 33 0.981 2.18 2.18 Intr + 107505 107605 101 1 2 52 100 201 0.987 17.53 2.19 Intr + 107745 107918 174 2 0 109 44 306 0.736 28.44 2.20 Intr + 108182 108273 92 1 2 120 75 162 0.662 16.89 2.21 Intr + 110298 110436 139 0 1 119 83 266 0.805 29.67 2.22 Intr + 110636 110824 189 1 0 107 44 341 0.877 31.38 2.23 Intr + 111451 111579 129 0 0 65 68 85 0.971 5.09 2.24 Term + 113686 113844 159 0 0 82 51 286 0.994 22.24 2.25 PlyA + 114197 114202 6 -0.45 3.03 PlyA - 114263 114258 6 1.05 3.02 Term - 114998 114995 4 1 1 142 48 0 0.014 -2.22 3.01 Init - 120953 120832 122 2 2 103 27 133 0.169 6.36 3.00 Prom - 143019 142980 40 -1.96 4.16 PlyA - 143042 143037 6 1.05 4.15 Term - 146554 146357 198 1 0 114 53 84 0.276 4.90 4.14 Intr - 147509 147361 149 0 2 158 40 185 0.840 20.65 4.13 Intr - 148456 148334 123 2 0 129 116 233 0.999 30.76 4.12 Intr - 149727 149634 94 0 1 120 100 174 0.999 21.34 4.11 Intr - 149937 149854 84 0 0 80 86 176 0.975 16.62 4.10 Intr - 150715 150585 131 2 2 73 100 116 0.999 11.71 4.09 Intr - 151014 150858 157 0 1 123 109 302 0.999 35.38 4.08 Intr - 152199 152035 165 0 0 94 78 452 0.999 44.96 4.07 Intr - 153128 152935 194 0 2 75 82 253 0.991 22.61 4.06 Intr - 154852 154734 119 0 2 94 60 248 0.985 22.71 4.05 Intr - 156752 156561 192 1 0 90 79 244 0.924 22.31 4.04 Intr - 161926 161829 98 1 2 80 92 207 0.995 19.11 4.03 Intr - 162823 162710 114 1 0 117 131 159 0.999 23.74 4.02 Intr - 163820 163703 118 1 1 75 116 179 0.929 19.87 4.01 Init - 177520 177414 107 1 2 108 92 147 0.946 16.79 4.00 Prom - 191576 191537 40 -4.86 5.03 PlyA - 193209 193204 6 1.05 5.02 Term - 203032 202790 243 1 0 51 48 141 0.651 2.30 5.01 Intr - 212434 212369 66 1 0 34 107 79 0.529 3.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 195381 195473 93 2 0 85 86 80 0.806 7.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:53078423_53292263|GENSCAN_predicted_peptide_1|580_aa MGSQEVLGHAARLASSGLLLQVLFRLITFVLNAFILRFLSKEIVGVVNVRLTLLYSTTLF LAREAFRRACLSGGTQRDWSQTLNLLWLTVPLGVFWSLFLGWIWLQLLEVPDPNVVPHYA TGVVLFGLSAVVELLGEPFWVLAQAHMFVKLKVIAESLSVILKSVLTAFLVLWLPHWGLY IFSLAQNFVFKELNHLPKSIHNLQANRHHESLMTSSSLRFLLKGDRSCFADPWEDHHSMA RGEEQCEREKVLKPAVAFINWKEAKLTWSFFKQSFLKQILTEGERYVMTFLNVLNFGDQG VYDIVNNLGSLVARLIFQPIEESFYIFFAKVLERGKDATLQKQEDVAVAAAVLESLLKLA LLAGLTITVFGFAYSQLALDIYGGTMLSSGSGPVLLRSYCLYVLLLAINGVTECFTFAAM SKEEVDRYNFVMLALSSSFLVLSYLLTRWCGSVGFILANCFNMGIRITQSLCFIHRYYRR SPHRPLAGLHLSPVLLGTFALSGGVTAVSEASRAASSDNVTRECCQWGNNGPAVWLLCLL SDLEAGTTAVFPGWSQRKECPITGWGLGTGSGAVIAVSEQ >gi568815595f:53078423_53292263|GENSCAN_predicted_CDS_1|1743_bp atgggcagccaggaggtgctgggccacgcggcccggctggcctcctccggtctcctcctg caggtgttgtttcggttgatcacctttgtcttgaatgcatttattcttcgcttcctgtca aaggaaatcgttggcgtagtaaatgtaagactaacgctgctttactcaaccaccctcttc ctggccagagaggccttccgcagagcatgtctcagtgggggcacccagcgagactggagc cagaccctcaacctgctgtggctaacagtccccctgggtgtgttttggtccttattcctg ggctggatctggttgcagctgcttgaagtgcctgatcctaatgttgtccctcactatgca actggagtggtgctgtttggtctctcggcagtggtggagcttctaggagagcccttttgg gtcttggcacaagcacatatgtttgtgaagctcaaggtgattgcagagagcctgtcggta attcttaagagcgttctgacagcttttctcgtgctgtggttgcctcactggggattgtac attttctctttggcccagaactttgtctttaaagagctcaatcatcttcccaaatccatt cacaacctgcaggcaaacaggcatcacgagagcctcatgacaagcagcagccttcgtttc cttttaaaaggggacaggtcctgctttgctgacccctgggaggaccaccacagcatggca aggggtgaggagcagtgtgagagagagaaggtgctgaagccggctgtggcgtttataaac tggaaagaggctaaactgacttggagttttttcaaacagtctttcttgaaacagattttg acagaaggcgagcgatatgtgatgacatttttgaatgtattgaactttggtgatcagggt gtgtatgatatagtgaataatcttggctcccttgtggccagattaattttccagccaata gaggaaagtttttatatattttttgctaaggtgctggagaggggaaaggatgccacactt cagaagcaggaggacgttgctgtggctgctgcagtcttggagtccctgctcaagctggcc ctgctggccggcctgaccatcactgtttttggctttgcctattctcagctggctctggat atctacggagggaccatgcttagctcaggatccggtcctgttttgctgcgttcctactgt ctctatgttctcctgcttgccatcaatggagtgacagagtgtttcacatttgctgccatg agcaaagaggaggtcgacaggtacaattttgtgatgctggccctgtcctcctcattcctg gtgttatcctatctcttgacccgttggtgtggcagcgtgggcttcatcttggccaactgc tttaacatgggcattcggatcacgcagagcctttgcttcatccaccgctactaccgaagg agcccccacaggcccctggctggcctgcacctatcgccagtcctgctcgggacatttgcc ctcagtggtggggttactgctgtttcggaggcctctagggcagcatcttctgataatgtg acaagagaatgttgccaatggggaaacaatggacctgcagtatggctgttatgcctttta tctgacttagaagctggaaccacagctgtcttccctggctggtcccagaggaaggagtgc cccatcacaggttggggccttggaacaggctcgggagctgtcatcgcagtcagtgagcag tga >gi568815595f:53078423_53292263|GENSCAN_predicted_peptide_2|967_aa MPPFPKAVINISVLTAPTAPQEHLDLYMRDLILQENPPSVPICSVEELASLVQEGPGKRP RDFQGRACQCSWVQGAPEVQANPMQLNFWPLLYGGWGLKKPRNMQSLRLLSAVSGHSDVN TGNLVFTEYLLCTRPSQVSPSQPANLEQPPFSSGPPPTLFPMGQAGFGELSSEDVGDEGL RGGSGELRHPQARERKERQSAPRRTPSARAGERRPRRCRRDPWRLPLQREVCRELARQGG RPVSPGGWCVVAAAGARTKDKQELGAPGVLPEQTLTANPTGCYAPIGLPTAGPTMAPFLR IAFNSYELGSLQAEDEANQPFCAVKMKEALSTERGKTLVQKKPTMYPEWKSTFDAHIYEG RVIQIVLMRAAEEPVSEVTVGVSVLAERCKKNNGKAEFWLDLQPQAKVLMSVQYFLEDVD CKQSMRSEDEAKFPTMNRRGAIKQAKIHYIKNHEFIATFFGQPTFCSVCKDFVWGLNKQG YKCRRSSFEGWQEEDSSAGPLRECNAAIHKKCIDKIIGRCTGTAANSRDTIFQKERFNID MPHRFKVHNYMSPTFCDHCGSLLWGLVKQGLKCEDCGMNVHHKCREKVANLCGINQKLLA EALNQVTQRASRRSDSASSEPVGIYQGFEKKTGVAGEDMQDNSGTYGKIWEGSSKCNINN FIFHKVLGKGSFGKVLLGELKGRGEYFAIKALKKDVVLIDDDVECTMVEKRVLTLAAENP FLTHLICTFQTKDHLFFVMEFLNGGDLMYHIQDKGRFELYRATDLKLDNVLLDRDGHIKI ADFGMCKENIFGESRASTFCGTPDYIAPEILQGLKYTFSVDWWSFGVLLYEMLIGQSPFH GDDEDELFESIRVDTPHYPRWITKESKDILEKLFEREPTKRLGVTGNIKIHPFFKTINWT LLEKRRLEPPFRPKVKSPRDYSNFDQEFLNEKARLSYSDKNLIDSMDQSAFAGFSFVNPK FEHLLED >gi568815595f:53078423_53292263|GENSCAN_predicted_CDS_2|2904_bp atgcctccctttccaaaggccgtgatcaacatcagtgtcctaacagcacccactgcgccc caggagcatctggacctttacatgcgtgatctcattcttcaagagaaccctccaagtgtg cctatttgctccgtggaggagttggccagcctggtccaagagggccctggaaaaagaccc agagacttccagggcagggcctgccagtgctcctgggtgcagggagccccagaagtccag gccaaccccatgcagcttaacttttggcccttgctatatggaggctgggggctgaagaag ccaagaaacatgcagtcactaaggttgctgagtgctgtctctgggcacagcgatgtcaac acaggaaacctggtcttcactgagtacctgctgtgtacccggccctcacaagtcagtcct tcccagcctgccaacctcgagcagccgcctttcagctcagggccaccccctacactgttt cccatgggccaagctgggtttggtgagctgtcctctgaggatgtgggggatgaggggctg agaggtgggagcggggagctgaggcaccctcaggccagggagcgaaaggaaaggcagtca gcgccgcgccgaaccccgtccgcgcgcgccggggagcggcgcccccgccgctgccgccgc gacccttggcgcctgcccctgcaacgggaggtctgcagggaactggccaggcaagggggc aggcccgtttctcctggtggttggtgcgttgtagcagcagcgggagccaggactaaggac aagcaggagctgggagccccaggagtgctccctgagcagaccctcacagccaaccctact ggctgttacgcacctataggtctccccactgcaggccccaccatggcgccgttcctgcgc atcgccttcaactcctatgagctgggctccctgcaggccgaggacgaggcgaaccagccc ttctgtgccgtgaagatgaaggaggcgctcagcacagagcgtgggaaaacactggtgcag aagaagccgaccatgtatcctgagtggaagtcgacgttcgatgcccacatctatgagggg cgcgtcatccagattgtgctaatgcgggcagcagaggagccagtgtctgaggtgaccgtg ggtgtgtcggtgctggccgagcgctgcaagaagaacaatggcaaggctgagttctggctg gacctgcagcctcaggccaaggtgttgatgtctgttcagtatttcctggaggacgtggat tgcaaacagtctatgcgcagtgaggacgaggccaagttcccaacgatgaaccgccgcgga gccatcaaacaggccaaaatccactacatcaagaaccatgagtttatcgccaccttcttt gggcaacccaccttctgttctgtgtgcaaagactttgtctggggcctcaacaagcaaggc tacaaatgcaggcgctcctccttcgagggctggcaggaggaagactcaagcgctgggcct ctgcgggaatgtaacgctgccatccacaagaaatgcatcgacaagatcatcggcagatgc actggcaccgcggccaacagccgggacactatattccagaaagaacgcttcaacatcgac atgccgcaccgcttcaaggttcacaactacatgagccccaccttctgtgaccactgcggc agcctgctctggggactggtgaagcagggattaaagtgtgaagactgcggcatgaatgtg caccataaatgccgggagaaggtggccaacctctgcggcatcaaccagaagcttttggct gaggccttgaaccaagtcacccagagagcctcccggagatcagactcagcctcctcagag cctgttgggatatatcagggtttcgagaagaagaccggagttgctggggaggacatgcaa gacaacagtgggacctacggcaagatctgggagggcagcagcaagtgcaacatcaacaac ttcatcttccacaaggtcctgggcaaaggcagcttcgggaaggtgctgcttggagagctg aagggcagaggagagtactttgccatcaaggccctcaagaaggatgtggtcctgatcgac gacgacgtggagtgcaccatggttgagaagcgggtgctgacacttgccgcagagaatccc tttctcacccacctcatctgcaccttccagaccaaggaccacctgttctttgtgatggag ttcctcaacgggggggacctgatgtaccacatccaggacaaaggccgctttgaactctac cgtgccacggacctcaaactggacaatgtgctgctggaccgggatggccacatcaagatt gccgactttgggatgtgcaaagagaacatattcggggagagccgggccagcaccttctgc ggcacccctgactatatcgcccctgagatcctacagggcctgaagtacacattctctgtg gactggtggtctttcggggtccttctgtacgagatgctcattggccagtcccccttccat ggtgatgatgaggatgaactcttcgagtccatccgtgtggacacgccacattatccccgc tggatcaccaaggagtccaaggacatcctggagaagctctttgaaagggaaccaaccaag aggctgggagtgaccggaaacatcaaaatccaccccttcttcaagaccataaactggact ctgctggaaaagcggaggttggagccacctttcaggcccaaagtgaagtcacccagagac tacagtaactttgaccaggagttcctgaacgagaaggcgcgcctctcctacagcgacaag aacctcatcgactccatggaccagtctgcattcgctggcttctcctttgtgaaccccaaa ttcgagcacctcctggaagattga >gi568815595f:53078423_53292263|GENSCAN_predicted_peptide_3|41_aa MPSHCPALEPGPHPRLGRLASTTHLLEDMNSKALGFGYLPP >gi568815595f:53078423_53292263|GENSCAN_predicted_CDS_3|126_bp atgcccagccactgccctgctcttgagccgggtccccaccctcggctggggcggctggcc tccaccacccacctgctggaggacatgaactctaaggccttgggctttggttatctgcca ccgtag >gi568815595f:53078423_53292263|GENSCAN_predicted_peptide_4|680_aa MESYHKPDQQKLQALKDTANRLRISSIQATTAAGSGHPTSCCSAAEIMAVLFFHTMRYKS QDPRNPHNDRFVLSKGHAAPILYAVWAEAGFLAEAELLNLRKISSDLDGHPVPKQAFTDV ATGSLGQGLGAACGMAYTGKYFDKASYRVYCLLGDGELSEGSVWEAMAFASIYKLDNLVA ILDINRLGQSDPAPLQHQMDIYQKRCEAFGWHAIIVDGHSVEELCKAFGQAKHQPTAIIA KTFKGRGITGVEDKESWHGKPLPKNMAEQIIQEIYSQIQSKKKILATPPQEDAPSVDIAN IRMPSLPSYKVGDKIATRKAYGQALAKLGHASDRIIALDGDTKNSTFSEIFKKEHPDRFI ECYIAEQNMVSIAVGCATRNRTVPFCSTFAAFFTRAFDQIRMAAISESNINLCGSHCGVS IGEDGPSQMALEDLAMFRSVPTSTVFYPSDGVATEKAVELAANTKGICFIRTSRPENAII YNNNEDFQVGQAKVVLKSKDDQVTVIGAGVTLHEALAAAELLKKEKINIRVLDPFTIKPL DRKLILDSARATKGRILTVEDHYYEGGIGEAVSSAVVGEPGITVTHLAVNRVPRSGKPAE LLKMFGIDRDAIAQAVTALTLLTTVDFSPCCSGSAGVSPAYKVNCAKPYTGQEQTSAGGG GLGAPATGGVPRKSSELLRG >gi568815595f:53078423_53292263|GENSCAN_predicted_CDS_4|2043_bp atggagagctaccacaagcctgaccagcagaagctgcaggccttgaaggacacggccaac cgcctacgtatcagctccatccaggccaccactgcggcgggctctggccaccccacgtca tgctgcagcgccgcagagatcatggctgtcctctttttccacaccatgcgctacaagtcc caggacccccggaatccgcacaatgaccgctttgtgctctccaagggccatgcagctccc atcctctacgcggtctgggctgaagctggtttcctggccgaggcggagctgctgaacctg aggaagatcagctccgacttggacgggcacccggtcccgaaacaagctttcaccgacgtg gccactggctccctgggccagggcctcggggccgcttgtgggatggcctacaccggcaaa tacttcgacaaggccagctaccgagtctattgcttgctgggagacggggagctgtcagag ggctctgtatgggaggccatggccttcgccagcatctataagctggacaaccttgtggcc attctagacatcaatcgcctgggccagagtgacccggccccactgcagcaccagatggac atctaccagaagcggtgcgaggccttcggttggcatgccatcatcgtggatggacacagc gtggaggagctgtgcaaggcctttggccaggccaagcaccagccaacagccatcattgcc aagaccttcaagggccgagggatcacgggggtagaagataaggagtcttggcatgggaag cccctccccaaaaacatggctgagcagatcatccaggagatctacagccagatccagagc aaaaagaagatcctggcaacccctccacaggaggacgcaccctcagtggacattgccaac atccgcatgcccagcctgcccagctacaaagttggggacaagatagccacccgcaaggcc tacgggcaggcactggccaagctgggccatgccagtgaccgcatcatcgccctggatggg gacaccaaaaattccaccttctcggagatcttcaaaaaggagcacccggaccgcttcatc gagtgctacattgctgagcagaacatggtgagcatcgcggtgggctgtgccacccgcaac aggacggtgcccttctgcagcacttttgcagccttcttcacgcgggcctttgaccagatt cgcatggccgccatctccgagagcaacatcaacctctgcggctcccactgcggcgtttcc atcggggaagacgggccctcccagatggccctagaagatctggctatgtttcggtcagtc cccacatcaactgtcttttacccaagtgatggcgttgctacagagaaggcagtggaacta gccgccaatacaaagggtatctgcttcatccggaccagccgcccagaaaatgccatcatc tataacaacaatgaggacttccaggtcggacaagccaaggtggtcctgaagagcaaggat gaccaggtgaccgttatcggggctggggtgaccctgcacgaggccttggccgctgccgaa ctgctgaagaaagaaaagatcaacatccgcgtgctggaccccttcaccatcaagcccctg gacagaaaactcattctcgacagcgctcgtgccaccaagggcaggatcctcaccgtggag gaccattattatgaaggtggcattggtgaggctgtgtccagtgcagtagtgggcgagcct ggcatcactgtcacccacctggcagttaaccgggtaccaagaagtgggaagccggctgag ctgctgaagatgtttggtatcgacagggatgccattgcacaagctgtgactgccctcaca ctgctgaccacagtggatttctccccctgctgctcgggctcagctggggtcagccctgct tataaggtcaactgtgcaaaaccttatactggccaagaacaaactagtgctgggggagga gggctgggtgccccggccactggtggagtccccaggaaatcctcagagctgttgcgagga tga >gi568815595f:53078423_53292263|GENSCAN_predicted_peptide_5|102_aa SMQQNQDPEVFVQPKVLSSAIPKMEKALNLVEDMNRKRVLIDSMLHQKALSLYKDLKDLL KWGHQAIYCKKGMVTQISGIQKVNSSLMLYHNAYAITSLLIM >gi568815595f:53078423_53292263|GENSCAN_predicted_CDS_5|309_bp tctatgcagcagaaccaggatcctgaagtatttgtgcagcctaaggtgttatccagtgcc atcccgaagatggaaaaagcattaaatctcgtggaagacatgaacagaaaacgtgttctg attgacagtatgttgcaccaaaaagcattgagcctatacaaagacctaaaagatctcttg aaatggggacaccaagccatttactgcaagaaagggatggttacacagatttcaggaata cagaaggtcaacagtagcctaatgctatatcacaatgcctacgccatcacctcacttctc atcatgtag