GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:13:02 Sequence gi568815591f:4957287_5165598 : 208312 bp : 47.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 906 744 163 2 1 74 89 29 0.016 1.58 1.04 Intr - 1788 1685 104 0 2 87 83 124 0.016 10.77 1.03 Intr - 17003 16723 281 0 2 45 11 214 0.248 6.60 1.02 Intr - 17401 17279 123 2 0 51 48 90 0.601 1.86 1.01 Init - 19039 18991 49 1 1 85 61 32 0.593 1.31 1.00 Prom - 23320 23281 40 -3.26 2.00 Prom + 23831 23870 40 -5.56 2.01 Init + 27706 27789 84 0 0 64 73 100 0.531 6.92 2.02 Intr + 29281 29345 65 0 2 43 94 42 0.579 -2.18 2.03 Intr + 36334 36527 194 1 2 59 65 285 0.711 22.44 2.04 Intr + 39324 39512 189 1 0 139 47 124 0.488 12.96 2.05 Intr + 52599 52669 71 1 2 -18 92 112 0.004 -0.10 2.06 Term + 53905 53946 42 0 0 100 47 42 0.151 -1.54 2.07 PlyA + 54656 54661 6 1.05 3.03 PlyA - 55695 55690 6 1.05 3.02 Term - 59342 59174 169 1 1 69 43 163 0.408 7.25 3.01 Init - 63535 63501 35 1 2 56 83 29 0.251 -1.46 3.00 Prom - 71115 71076 40 -4.76 4.06 PlyA - 72055 72050 6 1.05 4.05 Term - 73419 73315 105 0 0 110 43 77 0.994 3.81 4.04 Intr - 74672 74504 169 1 1 75 90 192 0.663 18.05 4.03 Intr - 76765 76680 86 1 2 84 65 13 0.358 -2.78 4.02 Intr - 78048 77990 59 1 2 87 78 37 0.285 1.20 4.01 Init - 80545 80437 109 1 1 70 94 88 0.482 7.98 4.00 Prom - 81449 81410 40 -6.66 5.00 Prom + 82614 82653 40 -7.16 5.01 Init + 82979 82988 10 1 1 83 85 12 0.467 0.85 5.02 Intr + 90747 90805 59 1 2 155 109 30 0.727 10.40 5.03 Intr + 100009 100135 127 0 1 76 72 116 0.975 9.05 5.04 Intr + 100398 100493 96 1 0 99 111 54 0.940 8.78 5.05 Term + 106409 108315 1907 0 2 78 45 1372 0.050 117.26 5.06 PlyA + 108736 108741 6 1.05 6.04 PlyA - 109287 109282 6 1.05 6.03 Term - 114889 114746 144 1 0 75 38 88 0.881 0.41 6.02 Intr - 115228 115061 168 1 0 103 86 232 0.903 24.64 6.01 Init - 121714 121712 3 1 0 113 81 0 0.578 1.80 6.00 Prom - 135255 135216 40 -5.16 7.00 Prom + 135474 135513 40 -2.66 7.01 Init + 149840 149842 3 1 0 90 64 0 0.142 -2.20 7.02 Intr + 151654 151746 93 0 0 89 98 34 0.405 4.66 7.03 Intr + 159458 159497 40 1 1 83 92 1 0.351 -2.20 7.04 Intr + 159833 160275 443 0 2 112 -48 712 0.999 53.17 7.05 Term + 160355 160918 564 1 0 70 35 890 0.984 76.29 7.06 PlyA + 162926 162931 6 1.05 8.05 PlyA - 163491 163486 6 -0.45 8.04 Term - 164065 163953 113 2 2 79 49 195 0.571 13.42 8.03 Intr - 164937 164602 336 1 0 71 0 221 0.496 7.09 8.02 Intr - 169639 169544 96 2 0 80 99 67 0.984 6.98 8.01 Init - 170562 170331 232 0 1 59 68 340 0.972 27.52 8.00 Prom - 171641 171602 40 -7.46 9.07 PlyA - 171775 171770 6 1.05 9.06 Term - 174057 173912 146 1 2 -2 44 121 0.270 -3.23 9.05 Intr - 175203 175162 42 1 0 121 109 40 0.660 7.61 9.04 Intr - 179403 179269 135 1 0 84 115 24 0.621 5.24 9.03 Intr - 179601 179575 27 1 0 101 86 12 0.516 0.39 9.02 Intr - 187034 186961 74 1 2 123 88 84 0.941 10.95 9.01 Intr - 187258 187091 168 0 0 34 72 120 0.844 4.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 145767 145603 165 0 0 77 47 130 0.831 2.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_1|240_aa MVTICFKRAEKEAKRNTPGPPSRASPQRPAERSDALAAPSRVPGDSRQVSRNNTRSRSLW CSRPAPPPPPPQLNGGVTQGGRLASQTRPEARARPAADTHSPLNSPASQAGSATFATTPL TPPKLQLRAPWQPLHSFRPCPGVTSRKRTARALPATRALSPGPDSHGSAAMFAPRLLDFQ KTKYASGLRRAQTRLLGAARGGLFGGGTRLSSRSPFPLCHLGSPAFVNRGAWCVLRLQWV >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_1|720_bp atggtgaccatctgtttcaaaagagctgaaaaggaagcaaagaggaacacgcccggaccc ccgagccgggccagcccccagcgccccgcagaaaggtctgacgcacttgcggcgccctcg cgagtgccgggtgacagtcgccaagtttcacggaacaacacgcgctccagatctctgtgg tgctcacggcccgcgcccccgcccccacccccgcagctcaacggaggcgtcacgcaggga ggccgcctcgcctcacagacccgcccagaagctcgagcgcgccccgcagccgacactcac tcgccactcaactcgccggctagccaggcgggttcggcgaccttcgctaccacgccgctt actcctccaaagctgcagctacgagctccgtggcagccgctgcattccttccggccctgc cccggcgtcacctcgcgcaagcgtacagcgcgcgctctgccagctacccgcgctctgagc ccggggccagattcccatggaagcgccgcgatgttcgccccccggctgctggatttccag aagacgaaatacgcgagtggtcttcggcgagcacagacccgtctgctgggagcagcccgc gggggcctctttggagggggcaccaggctgtcctccaggtcacctttccccctctgccac ctggggtcccctgccttcgtgaacagaggggcctggtgtgtgctgaggctccagtgggtg >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_2|214_aa MADFKVLSSQDIMWALHELKGHYAITRKDFESHQHMENLTSQDEVPRDTRCMQERGQGLL VWQQEEPSEFDLAYANFLSLDISMLRLFETLETAPQLTLVLAIMLQSGWAEYYQWVGICT SFLGISWALLDYHQALHTCLPSKPLLGLGSSVIYVLWNLLLLWPRVLAVALFSALFPNRP VEQNKERGNKAKYYSQQIIDKMCLEEGDNCLLDC >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_2|645_bp atggccgacttcaaagtgctcagtagtcaggacatcatgtgggccctgcacgagctcaaa ggacactatgcaatcacccgaaaggactttgagagtcatcagcatatggaaaatctcacc agccaggatgaggtacccagagataccaggtgcatgcaggagcgggggcaggggctgctg gtgtggcagcaggaggagccctctgagtttgacttggcctacgccaatttcctctccctg gatatcagcatgctgcggctctttgagaccttggagacggcaccacagctcacgctggtg ctggccatcatgctgcagagtggctgggctgagtactaccagtgggttggcatctgcaca tccttcctgggcatctcgtgggcactgctcgattaccaccaggccttgcacacctgcctc ccctccaagcccctcctgggcctgggctcctctgtgatctacgtcctgtggaacctgctg ctactgtggccccgagtcctagctgtggccctgttctcagccctcttccccaacagacca gtggaacagaataaagagcgcggaaataaagccaaatactacagccaacagatcatcgac aagatgtgcctagaagaaggagacaactgcctcttggactgttga >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_3|67_aa MRERYRERKTRCQGSTIQKNLAREDWQTDINVMPMAPGGFRYLLVLTDIFTSYTGAFPCQ TENAGDH >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_3|204_bp atgagggaaagataccgagagaggaaaaccaggtgccaaggcagtacaattcagaagaac cttgccagggaagactggcaaacggacatcaacgtgatgcctatggctcctggaggattt agatacctcctggtgctcactgatatcttcaccagttacacgggggcttttccatgccag actgaaaatgcaggagatcactga >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_4|175_aa MTSHQPHLQDEEQSPQPSASGYPLQEVVDDDMSGPSEDPVVKRLLVWDKDLRVSDKYLLA MVIAYSSPASLFSWQYQRIHFFLALYLTNDMEEDSETPKQNIFYFLYGKNCSQIALSHKL WFQFFHSVRCRAWVFPEELEENAGPRGDADFHQELYSNANGRHQEEGEEPFVQII >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_4|528_bp atgaccagccatcaaccacacctccaggatgaggagcagagcccccagcccagcgcctcg gggtaccccctccaggaggtggtggatgatgacatgtcaggaccatcagaggatcctgtc gttaaaagactcctggtctgggacaaagatctgagggtgtcggacaaatatctcctggct atggtcatagcgtattctagcccggccagcctcttctcctggcaataccaacgcattcat ttcttcctggctctctacctgaccaatgacatggaggaggacagcgagacccccaaacaa aacatcttctacttcctgtacgggaagaactgctctcagatagccttgtcccacaagctt tggttccagttcttccattccgtgcgctgcagggcttgggttttcccggaggagttggag gagaatgctgggcccaggggagatgcggattttcatcaggaactttattccaatgctaat ggcaggcaccaggaagaaggagaggagccatttgtgcagatcatctag >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_5|732_aa MAAGLPATVSARFQEQQKMNTLQGPVSFKDVAVDFTQEEWQQLDPDEKITYRDVMLENYS HLVSVGYDTTKPNVIIKLEQGEEPWIMGGEFPCQHSPEAWRVDDLIERIQENEDKHSRQA ACINSKTLTEEKENTFSQIYMETSLVPSSIIAHNCVSCGKNLESISQLISSDGSYARTKP DECNECGKTYHGEKMCEFNQNGDTYSHNEENILQKISILEKPFEYNECMEALDNEAVFIA HKRAYIGEKPYEWNDSGPDFIQMSNFNAYQRSQMEMKPFECSECGKSFCKKSKFIIHQRA HTGEKPYECNVCGKSFSQKGTLTVHRRSHLEEKPYKCNECGKTFCQKLHLTQHLRTHSGE KPYECSECGKTFCQKTHLTLHQRNHSGERPYPCNECGKSFSRKSALSDHQRTHTGEKLYK CNECGKSYYRKSTLITHQRTHTGEKPYQCSECGKFFSRVSYLTIHYRSHLEEKPYECNEC GKTFNLNSAFIRHRKVHTEEKSHECSECGKFSQLYLTDHHTAHLEEKPYECNECGKTFLV NSAFDGHQPLPKGEKSYECNVCGKLFNELSYYTEHYRSHSEEKPYGCSECGKTFSHNSSL FRHQRVHTGEKPYECYECGKFFSQKSYLTIHHRIHSGEKPYECSKCGKVFSRMSNLTVHY RSHSGEKPYECNECGKVFSQKSYLTVHYRTHSGEKPYECNECGKKFHHRSAFNSHQRIHR RGNMNVLDVENL >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_5|2199_bp atggctgcaggtctaccagccacagtctctgcacgtttccaagagcagcagaaaatgaac acattgcaggggccagtgtcattcaaagatgtggctgtggatttcacccaggaggagtgg cagcagctggaccctgatgagaagataacttacagggatgtgatgttggagaactatagc catctagtttctgtgggatatgataccaccaagccaaacgtcatcattaagttggagcag ggagaggagccgtggataatgggaggtgaatttccatgtcaacatagtccagaagcctgg agagttgatgacctgatagagagaatccaagaaaacgaagacaaacattcaaggcaagct gcttgtatcaatagcaaaaccctgactgaagagaaagagaatacatttagtcaaatttac atggaaacaagccttgttccttcaagcataatagctcataattgtgtctcatgtggaaag aatttagaatctatttcgcaattaattagtagtgatggaagctatgctaggacaaaacct gatgagtgtaatgaatgtgggaaaacatatcatggagagaaaatgtgtgaatttaatcaa aatggggatacctattctcacaatgaagaaaatattcttcagaaaattagtattttggag aaaccctttgaatataatgaatgcatggaagccttagacaatgaggctgtttttattgct cataagagagcttacataggggagaagccctatgagtggaatgattctggaccagacttc atacagatgtcaaattttaatgcatatcagagatcacaaatggaaatgaagccctttgaa tgcagtgaatgtggaaaatccttctgtaaaaagtcaaaattcatcatccaccagagggct cacacaggagagaaaccttatgaatgtaatgtatgtgggaaatccttcagccaaaaggga accctcactgtacatcggagatcacacttagaggagaagccctataaatgtaatgaatgt gggaaaaccttttgtcagaagttacacctcactcaacacctaagaactcattcaggagag aaaccctacgaatgtagcgaatgtgggaaaaccttctgccaaaagacacatctcaccctg caccagaggaatcattcaggagagaggccctatccatgtaacgaatgtgggaaatccttc tcccgcaagtctgctctcagtgaccatcagagaactcacacgggagagaagctttataaa tgtaatgaatgtgggaaatcctactaccgaaagtctactctgattacacatcagagaaca cacacgggagagaagccctatcagtgtagcgagtgtgggaaattcttttctcgggtgtca tacctcactatacattatagaagtcatttagaagagaaaccctatgaatgtaatgaatgt ggcaaaaccttcaatttaaattcagccttcattagacatcggaaagtacacacagaagag aaatcccatgaatgtagtgaatgtggaaagttctctcagttgtatctcaccgaccatcat acagctcatttagaagagaaaccctatgaatgtaatgaatgtgggaaaaccttccttgta aattcagccttcgatgggcaccagccacttccaaaaggggagaaatcctatgaatgtaat gtatgtggaaagttattcaatgagttgtcatactatactgaacattatagaagtcattca gaagagaaaccttatggatgtagcgaatgtgggaaaaccttttcccataattcatccctc ttcagacatcaaagagtacacacaggcgagaaaccctatgaatgttacgaatgtggaaaa ttcttctctcagaaatcatatctcactatacatcatcgaattcattcaggagagaaaccc tatgaatgtagtaaatgtggaaaagtcttctctcggatgtcaaacctcactgtccactac agaagccattcaggagagaaaccctatgaatgtaatgaatgtgggaaagtcttttctcag aagtcatacctcactgtacactatagaactcattcaggagagaaaccctatgaatgtaac gagtgtgggaaaaaattccaccacagatcagccttcaatagccatcagagaattcataga agaggaaatatgaacgtacttgatgtggaaaatctctga >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_6|104_aa MHGAVETLEQVLKAFLGVSIRVGSGFGGGDRGGREQQQQQQQRGPHGLRAGRLSGDKGLL DPDSSAPELPWSSDASTSLSHALEPEPRPPTVNASPKPLRMTTP >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_6|315_bp atgcatggggcggttgaaaccttggagcaggttctgaaggccttcctgggtgttagcatc cgggtggggagcggctttggcggtggggaccggggcggccgggagcagcagcagcagcag caacagcgggggccacatggcctgcgggcagggcggctgtcgggggacaagggattgtta gatccggacagttctgccccagagctgccctggagctcggacgcctccacgagcctgagc cacgccctcgagcccgagccacgcccgcccactgttaacgcctcacccaaacccctacgc atgaccacgccctag >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_7|380_aa MAKTKARLLPQGTGKFDLGALSVATALEFLIKAGNQDACLPRYGRDPNASCWPESNGTAL QEFVILGFSVWPPGLRVLLFVLFLPLYLVTLAGNLLILGLALVDPALHSPMYFFLGALSA AQAAYTLVLTPRMLAGFLLPSRGQAVDPSTCAAQMGLFVALGGSECLLLAAMALDRYLAI CHPLCYPRLMTMDLPFCRGGLVNHVFCDLPAVLVLACGCRALQERVLLVACLLLLVLPLL LILLSYTRVLVVILGVGGVAGRRKAFNTVASHLTVAVLHYGCATAMYARPLNSRSLEEDK LVSLIYINVTPLLYPAIYTLRNRDMQEALQCMVGQRTLGMATRWILPDAGCQAVSVLRFL PLRGISPFWSHLSLPNAYGV >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_7|1143_bp atggcgaagacaaaagctaggctcctgcctcagggaactggaaaatttgatcttggggca ctctcagtggccacagcccttgagttcctcatcaaggctgggaaccaggatgcctgcctt cctaggtatggacgagaccccaacgcgagctgctggcctgagtccaatgggacagccctg caggagttcgtgatcctgggcttctccgtgtggcccccggggctccgcgtgctgctcttc gtcctcttcttgccgctctacctggtcaccctggcggggaacctactgatcctgggcctg gccctggtggaccccgccctgcactcgccaatgtacttcttcctgggggcgctgtccgcg gcgcaggcagcctacacgctggtgctcacgccacgcatgctggccggcttcctcctgccc tccaggggccaggctgtggacccctctacctgcgccgcccagatgggcctcttcgtggcc ctggggggctccgagtgcctgctgctggctgccatggccctagaccgctacctggctatc tgccatccactctgctaccctcggctcatgaccatggacctgcccttctgccgcggcggc ctggtcaaccacgtcttctgcgacctcccggccgtgctggtgctggcctgcgggtgccgg gccctgcaggagcgcgtcctcctggtggcctgcctgctgctgctggtgctacccctgctc ctcatcctgctctcctacacccgggtgctggtggtcatcctgggtgttgggggggtcgcg ggccgccgcaaggccttcaacacggtggcgtcccacctcaccgtggctgtgctccactac ggctgtgccacggccatgtatgccaggcccctgaacagccgttcccttgaggaggacaag ctggtctcgctcatctatatcaacgtcaccccgctgctgtacccggccatctacacgctg cggaaccgggacatgcaggaggccctgcaatgcatggtcggccagaggacgctggggatg gccaccaggtggattttgcctgatgctgggtgccaggcggtgtctgtcctgagatttctc ccactgaggggaatctcaccattctggagccatctaagcctccccaacgcctacggggtg taa >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_8|258_aa MGKADVGRFKINLLYAFAMVPSANHSTLISHVLFQGSLSFGDVAVGFTRKEWQQLDLEQR TLYQDVMLEIYSHLLSVGCQVSKPAVISSLEQGKEPWMEEEEIRTWSFPEEVWQVATQPD SQQQHEDQHLSHTFLDKKDWTGNELHECNELGKKLHQNPNLLPSKQQVRTRDLCRKSLMC NLDFTPNAYLARRRFQCDGHGNFSIRNLKLHLQERIHAEVTSGLAGKKPYECSDCGKTSR KANLIRHHGIHTREKLHG >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_8|777_bp atggggaaagctgacgtcggaaggtttaagatcaatctcctctacgcctttgccatggta cccagtgccaaccactccacattaatcagccatgtgctgttccaggggtcactgtcattc ggggacgtggctgtgggcttcacccggaaagagtggcagcagctggacctggagcagagg accctgtaccaggatgtgatgctggagatctacagccacctgctctctgtggggtgtcaa gtcagcaaaccagctgtgatctccagtttggagcaggggaaggagccatggatggaggag gaagagataaggacgtggagcttcccagaagaagtttggcaagttgctacccagccagat agccaacagcaacacgaagaccaacatttgagccatacgtttctagacaagaaagactgg accggaaatgagcttcatgaatgtaacgaacttggaaaaaaactccatcagaacccaaac ctccttccatcaaaacagcaggtccgcacacgtgacttgtgcagaaagagtttgatgtgt aacctggacttcactcctaacgcctacctggcgaggaggagatttcagtgcgacggccac ggaaacttctctattcgaaacttgaaactccaccttcaggagcgaatccacgcggaggtc accagcggactcgcggggaagaaaccgtacgagtgctccgactgcgggaagacctcccgg aaggcgaacctcatccgccaccacgggatccacacgagggagaagctgcatgggtga >gi568815591f:4957287_5165598|GENSCAN_predicted_peptide_9|197_aa XHVWEERRLGSRAVRSASVAGTHLLQSRHCTGVLGAAPPDVSAGCRVLPSWAGGRHSSAS SDPTVPWKLAQGTCHCDHLDADWFLSISGKGFPGVFAKAQDRTLSARWVPPLSIASLFLP WSREPQRLKLESPPEGVCFSRIEKKNYGPGLQRQPCAVPSDLDVTLHYLEERMNELRVLE TACHVYRHTTLNVPDLI >gi568815591f:4957287_5165598|GENSCAN_predicted_CDS_9|594_bp nnacacgtgtgggaggaacggcgtcttggctcccgggctgtgcgcagcgccagcgtggcc ggcacccacttgctgcagagccggcactgcactggggtcctgggggcggctcctccagac gtctccgcgggttgtcgggtgctgccaagttgggctgggggacgccacagctctgcctcc tcggaccccaccgtcccctggaagctcgcgcagggcacctgccactgtgaccacttggac gcggattggtttctctctatcagcggtaaaggttttcctggtgtctttgcaaaagctcag gatcggactttgtcagcaaggtgggtccctcctctcagcatagcaagcttgtttctaccc tggtcccgagaacctcagaggctgaagcttgagtccccacctgaaggtgtctgcttttcc agaatagagaagaagaactatggcccaggtctacagagacagccttgtgcggtgccaagt gacctggatgtaacactgcattatctagaggaacgaatgaatgaactgcgtgttcttgaa acagcctgccatgtctaccgccacaccaccctgaacgtgcctgatctcatctga