GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:17:49 Sequence gi568815587f:68437665_68713114 : 275450 bp : 45.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 782 1018 237 0 0 109 110 373 0.999 39.19 1.02 Intr + 2113 2252 140 2 2 102 93 174 0.999 19.48 1.03 Intr + 2969 2983 15 1 0 100 113 11 0.384 0.54 1.04 Intr + 6953 7106 154 1 1 32 29 87 0.432 -3.15 1.05 Intr + 7909 8038 130 2 1 114 57 84 0.504 7.65 1.06 Intr + 8081 8187 107 2 2 129 3 41 0.203 -0.44 1.07 Intr + 10745 10832 88 0 1 40 108 68 0.430 3.03 1.08 Term + 11145 11406 262 0 1 107 43 399 0.997 32.30 1.09 PlyA + 11494 11499 6 1.05 2.07 PlyA - 15856 15851 6 1.05 2.06 Term - 23126 23058 69 2 0 36 44 138 0.155 2.14 2.05 Intr - 33916 33793 124 0 1 98 15 86 0.120 2.89 2.04 Intr - 35834 35746 89 2 2 50 116 4 0.169 -1.83 2.03 Intr - 51070 50933 138 1 0 13 42 137 0.003 2.26 2.02 Intr - 77785 77625 161 2 2 97 53 58 0.188 2.91 2.01 Init - 89032 88930 103 1 1 81 78 76 0.354 6.30 2.00 Prom - 91324 91285 40 -5.96 3.00 Prom + 91978 92017 40 -5.56 3.01 Init + 96425 96460 36 1 0 102 113 14 0.549 5.33 3.02 Intr + 99995 100227 233 1 2 51 71 127 0.165 3.87 3.03 Intr + 105650 105734 85 2 1 50 62 50 0.304 -1.58 3.04 Intr + 105816 105924 109 2 1 114 89 7 0.265 3.26 3.05 Intr + 105983 106088 106 0 1 92 47 28 0.251 -1.63 3.06 Intr + 107174 107360 187 2 1 101 99 -20 0.249 0.09 3.07 Intr + 110403 110540 138 2 0 100 82 32 0.925 4.46 3.08 Intr + 113457 113522 66 2 0 77 95 77 0.972 6.40 3.09 Intr + 116481 116593 113 2 2 78 53 44 0.582 -0.92 3.10 Intr + 120902 121015 114 2 0 85 85 26 0.638 1.56 3.11 Intr + 122059 122304 246 1 0 103 64 216 0.338 17.27 3.12 Intr + 126639 126768 130 0 1 84 97 7 0.391 1.90 3.13 Intr + 129350 129502 153 1 0 80 77 55 0.662 3.87 3.14 Intr + 132084 132233 150 2 0 82 77 27 0.493 1.36 3.15 Intr + 133376 133440 65 1 2 92 98 7 0.434 -0.38 3.16 Intr + 136445 136560 116 2 2 75 115 50 0.675 6.49 3.17 Intr + 142115 142179 65 0 2 77 89 21 0.376 -0.46 3.18 Intr + 145379 145465 87 1 0 51 89 45 0.506 1.07 3.19 Intr + 150263 150360 98 1 2 32 89 163 0.536 9.61 3.20 Intr + 152996 153050 55 2 1 93 84 55 0.990 4.58 3.21 Intr + 153912 154042 131 2 2 73 67 102 0.996 6.09 3.22 Intr + 158433 158554 122 0 2 84 111 146 0.877 16.64 3.23 Intr + 162677 162830 154 0 1 74 83 54 0.847 2.63 3.24 Intr + 164199 164305 107 0 2 66 100 63 0.772 5.16 3.25 Intr + 165678 165828 151 1 1 99 86 191 0.598 19.22 3.26 Term + 171589 171706 118 1 1 96 54 21 0.055 -2.49 3.27 PlyA + 173135 173140 6 1.05 4.00 Prom + 181513 181552 40 -5.36 4.01 Init + 185626 185683 58 0 1 42 107 36 0.821 2.37 4.02 Intr + 187222 187355 134 2 2 115 80 106 0.951 12.86 4.03 Term + 198311 198517 207 1 0 58 48 132 0.598 3.54 4.04 PlyA + 200928 200933 6 1.05 5.00 Prom + 201642 201681 40 -5.56 5.01 Init + 202502 202555 54 1 0 63 63 52 0.310 1.48 5.02 Intr + 203548 203670 123 0 0 96 60 18 0.097 0.58 5.03 Term + 212244 212771 528 2 0 87 44 295 0.080 19.25 5.04 PlyA + 213176 213181 6 1.05 6.00 Prom + 222884 222923 40 -4.76 6.01 Init + 230342 230537 196 1 1 90 77 154 0.978 13.66 6.02 Intr + 236506 236621 116 2 2 56 46 95 0.311 2.27 6.03 Intr + 238292 238320 29 1 2 133 92 1 0.381 1.91 6.04 Term + 240029 240134 106 2 1 69 55 74 0.395 0.08 6.05 PlyA + 240532 240537 6 1.05 7.00 Prom + 240815 240854 40 -1.16 7.01 Init + 247260 247340 81 2 0 56 131 195 0.880 19.57 7.02 Intr + 247930 247984 55 0 1 114 113 84 0.998 12.05 7.03 Intr + 250350 250436 87 1 0 45 94 149 0.427 11.14 7.04 Intr + 251185 251262 78 2 0 99 95 40 0.975 5.32 7.05 Intr + 254645 254726 82 0 1 126 56 26 0.156 1.90 7.06 Intr + 255999 256032 34 0 1 106 100 17 0.051 2.93 7.07 Term + 271698 271841 144 2 0 126 43 119 0.805 9.11 7.08 PlyA + 271969 271974 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_1|377_aa XITKPPSDDSPAHSSAIGPVIGIILSLFVMGGVYFVCQRVVCQRYAGANGPFPHEYVSGT PHVPLNFIAPGGSQHGPFTGIACGKSMMSSVSLMGGRGGVPLYDRNHVTGASSSSSSSTK ATLYPPLYAPKGLAVVHGTSETLPRRCTGGSCQKSAQCRFGQNLKLPWDVGEKPGLVKFM GRGARIWLYAFQDGLGDPHKPYTLWEAACWVGVPSVALVEGADLCGAGCVLPGPGGHVHV VNLSIYYPRFGAGASTGPCEGGRISCILHFRNGYYFIPTFINEKLKLEEVKPYIIRGMAP PTTPCSTDVCDSDYSASRWKASKYYLDLNSDSDPYPPPPTPHSQYLSAEDSCPPSPATER SYFHLFPPPPSPCTDSS >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_1|1134_bp naaatcaccaagccgccctcagacgacagcccggcccacagcagtgccatcgggcccgtc attggcatcatcctctctctcttcgtcatgggtggtgtctattttgtgtgccagcgcgtg gtgtgccagcgctatgcgggggccaacgggcccttcccgcacgagtatgtcagcgggacc ccgcacgtgcccctcaatttcatagccccgggcggttcccagcatggccccttcacaggc atcgcatgcggaaagtccatgatgagctccgtgagcctgatggggggccggggcggggtg cccctctacgaccggaaccacgtcacaggggcctcgtccagcagctcgtccagcacgaag gccacgctgtacccgccgctgtatgcccctaagggcctggccgtggttcacgggacatct gagacattgccgaggcgctgcactggtggatcttgccagaagtctgcccagtgcagattt gggcagaatctcaaactgccttgggatgtaggagagaaaccaggcctggtcaagttcatg ggaagaggggctcggatctggctgtatgctttccaggatggccttggagacccacataag ccctacaccctttgggaagctgcatgttgggttggggtgccgtcagtggcacttgtggaa ggtgcagacctgtgtggggctggttgtgtgctgcctggacctggggggcacgttcacgtg gtgaatttgtctatttactatccccgctttggggctggtgccagcacaggcccttgtgaa gggggcagaatctcatgtatccttcactttcgaaatgggtactatttcatccccactttt atcaatgagaaactaaagctcgaagaggtcaagccctacatcattcgaggaatggcgccc ccgacgacgccctgcagcaccgacgtgtgtgacagcgactacagcgccagccgctggaag gccagcaagtactacctggatttgaactcggactcagacccctatccacccccacccacg ccccacagccagtacctgtcggcggaggacagctgcccgccctcgcccgccaccgagagg agctacttccatctcttcccgccccctccgtccccctgcacggactcatcctga >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_2|227_aa MVVEFLPVVFLRDEGNRGEARNTQLVPDARVHAGGLLPPYRLTPPAPGSLSQPSPVVLCC VCLVETLRMFWMELPSPRQAGPVPGLNKAIMINQDTPYNYPVPLLHDENMPDVSSHHQDP PGALGPKLGVAEKISGLQNLGKQVYQFIINNITKDTDGEIRMARLFLHHHVGTALKIVDE FHAARAMRRVLVLISGDLTAAALAEAGNALTDTISEAPKMADVSSRP >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_2|684_bp atggtagtcgagtttttgccagttgtcttcttaagagatgaaggaaaccgaggagaggca aggaacacccagctggttccagatgcacgtgtgcatgctggaggattgctgcccccctac cgcctcacccctccagctcctggttctttgtcccagccatctcctgtggtactgtgctgt gtctgccttgtggagaccctgaggatgttctggatggagttaccaagcccccgacaggct ggtccagttcctggcctcaacaaagccatcatgatcaatcaggacacaccctacaactac ccagtgccccttctacatgatgagaacatgcctgatgtgtccagccaccatcaggacccc ccaggtgctctagggcccaagcttggagtggctgagaaaattagtggcttacagaactta gggaaacaagtttaccagtttatcataaacaatattacaaaggatacagatggagagata cgcatggcaaggcttttcctgcaccaccatgtgggaacagctctcaagatcgtcgatgaa ttccacgctgccagagccatgagacgagtcttggtccttatctcaggtgacctcacggca gcagcactggcagaggcgggaaacgcgctcaccgacaccatcagcgaagcgcccaaaatg gcggacgtatcaagcaggccttga >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_3|1044_aa MEETLSCGHTYTTSMFWKFDLHSSSHIDTLLEREDVTLKELMDEEDVLQECKAQNRKLIE FLLKAECLEDLVSFIIEEPPQDMDEKIRYKGLYSGILKPMLRKLREETLLPKDPYICQAS HQSLSSAVAFWFVTSGLFVPGSCCRQHLLALDAPATFGYFAISLSCSAFELATPDDLLGR WVLAVQQGGVYPNISCELLTSDVSQMNDRLGEDESLLMKLYSFLLNDSPLNPLLASFFSK VLSILISRKPEQIVDFLKKKHDFVDLIIKHIGTSAIMDLLLRLLTCIEPPQPRQDVLNWL NEEKIIQRLVEIVHPSQEEDRHSNASQSLCEIVRLSRDQMLQIQNSTEPDPLLATLEKQE IIEQLLSNIFHKEKNESAIVSAIQILLTLLETRRPTKISTIFLPQGYSALFPPQGYSTLF PPQRYSTLFPPQRYGALFPPQRYGALFPPQRYGALFPPQRYGALFSPQRYGALFPPQRFE GHIEICPPGMSHSACSVNKSVLEAIRGRLGSFHELLLEPPKKSVMKTTWGVLDPPVGNTR LNVIRLISSLLQTNTSSINGDLMELNSIGVILNMFFKYTWNNFLHTQVEICIALILASPF ENTENATITDQDSTGDNLLLKHLFQKCQLIERILEAWEMNEKKQAEGGRRHGYMGHLTRI ANCIVHSTDKGPNSALVQQLIKDTQILITMLQLPQYSVQYCAVQVTTCHIHSSSDDEIDF KETGFSQDSSLQQAFSDYQMQQMTSNFIDQFGFNDEKFADQDDIGNVSFDRVSDINFTLN TNESGNIALFEACCKERIQQFDDGGSDEEDIWEEKHIAFTPESQRRSSSGSTDSEESTDS EEEDGAKQDLFEPSSANTEDKMEVDLSEPPNWSANFDVPMETTHGAPLDSVGSDVWSTEE PMPTKETGWASFSEFTSSLSTKDSLRSNSPVEMETSTEPMDPLTPSAAALAVQPEAAGSV AMEASSDGEEDAESTDKVTETVMNGGMKETLSLTVDAKTETAVFKRISSWPPLLPLAFVL LTHCSRATRGAVKSDAVNTVTSCD >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_3|3135_bp atggaggaaactctgtcatgtggccacacctacacgaccagcatgttttggaaatttgat cttcactcatcatcccacatagacacacttctagaaagagaagatgtaacactgaaggag ttaatggatgaggaagatgttttacaggaatgtaaagctcagaaccgcaaacttatagag tttctgttaaaagcagaatgtctcgaagatttagtctcattcattatagaagaaccacct caagacatggatgaaaagatcagatacaagggtttatattctggcatcttgaagcccatg ctacgtaaattgagggaggagactttacttcccaaagacccatacatctgccaggcctca caccagagcctgtcaagtgccgtggctttctggttcgtcaccagtggtctttttgtgcca gggtcttgctgcaggcagcacttgcttgccttggatgccccagccacgtttggctatttt gccatctctctgtcttgctcagcttttgagcttgccactcctgatgacttgttggggcgc tgggtcctggctgtccagcagggaggagtgtatccaaatatatcttgtgagttgctcact tctgatgtctcccagatgaatgatagactgggagaagatgaatccttgctaatgaaatta tatagcttcctcctaaacgattcccctttgaatccactacttgccagtttcttcagcaag gtgctaagtattcttatcagcagaaaaccagaacagattgtggatttcttaaagaagaag catgattttgtagaccttattataaagcacataggaacttctgctatcatggatttgttg ctcaggctcctgacgtgtatcgaacctccacagcccaggcaagatgtgctgaattggtta aatgaggagaaaattatccagaggcttgtggaaatagttcatccatcgcaagaagaagat cgacattcaaatgcatcacaatcactttgtgaaattgttcgcctgagcagagaccagatg ttacaaattcagaacagtacagagcccgaccccctgcttgccactctagaaaagcaagaa attatagagcagcttctatcaaatattttccacaaggagaaaaatgagtcagccatagtc agtgcaatccagatattgctgactttacttgagacacgacgaccaacgaaaatcagtacc atcttcctaccccagggatacagtgccctcttcccaccccaggggtacagtaccctcttc ccaccccagcggtacagtaccctcttcccaccccagcggtacggtgccctcttcccaccc cagcggtacggtgccctcttcccaccccagcggtacggtgccctcttcccaccccagcgg tacggtgccctcttctcaccccagcggtacggtgccctcttcccaccccagcgatttgaa ggccatatagagatctgcccaccaggcatgagccattcagcttgttcagtaaacaagagt gttctagaagccatcagaggaagacttggatcttttcatgaactcctgctggagccaccc aagaaaagtgtgatgaagaccacatggggtgtgctggatcctcctgtggggaatacccgg ttgaatgtcattaggttgatatccagcctgcttcaaaccaataccagcagtataaatggg gaccttatggagctgaatagcattggagtcatattgaacatgttcttcaagtatacatgg aataactttttgcatacacaagtggaaatttgtattgcactgattcttgcaagtcctttt gaaaacacagaaaatgccacaattaccgatcaagactccactggtgataatttgttatta aaacatcttttccaaaaatgtcaattaatagaacgaatacttgaagcctgggaaatgaat gagaagaaacaggctgagggaggaagacggcatggttacatgggacacctaacgaggata gctaactgtatcgtgcacagcactgacaagggccccaacagtgcattagtgcagcagctt atcaaagatacacaaatacttatcaccatgctccagttgcctcagtattcagtgcagtac tgtgctgtgcaggttacaacctgccatattcattcatccagtgatgatgaaattgacttt aaagaaacgggtttctcacaggattcttctttgcagcaagccttttctgattatcagatg caacaaatgacgtccaattttattgaccagtttggcttcaacgatgagaagtttgcagat caagatgacattggcaatgtttcttttgatcgagtatcagacatcaactttactctcaat acaaatgaaagtggaaatattgccttgtttgaagcatgttgtaaggaaagaatacaacag tttgatgatggtggctctgatgaggaagatatatgggaggaaaagcacatcgcattcaca ccagaatcccaaagacgatccagctcggggagtacagacagtgaggaaagtacagactct gaagaagaagatggagcaaagcaagacttgtttgaacccagcagtgccaacacggaggat aaaatggaggtggacctgagtgaaccacccaactggtcagctaactttgatgtcccaatg gaaacaacccacggtgctccattggattctgtgggatctgatgtctggagcacagaggag ccgatgccaactaaagagacgggctgggcttctttttcagagttcacgtcttccctgagc acaaaagattctttaaggagtaattctccagtggaaatggaaaccagcactgaacccatg gaccctctgactcccagtgcggctgccctggcagtgcagccagaagcggcaggcagtgtg gccatggaagccagctctgacggagaggaggatgcagaaagtacagacaaggtaactgag acagtgatgaatggcggcatgaaggaaacgctcagcctcactgtagatgccaagacagag actgcggtcttcaaaaggatcagctcttggccccctcttctccctctggcgtttgtgctt ctcacacattgttcgagggcaactcgaggagccgtgaaatccgatgcagtcaacacggtg acctcatgtgactga >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_4|132_aa MTSGPQTNQPKKHLTNFKSDERERDFLKSNLTLITAGFTSQASRKALEQFPERIPNGTTR QIPQELATSARNLATRPRNAYSPEFLLSCIPSVRDPTGNRTVQLTWQPLPELLEVWPKAD CFPDLLGLAAED >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_4|399_bp atgacctcaggtcctcagaccaaccagcccaagaaacatctcaccaatttcaaatctgat gaacgggaaagagattttctcaagtccaatctcacgctgataaccgctggcttcacgagc caggcctccaggaaggcattagagcagtttcctgagaggatccccaatggaactaccagg caaattccccaggagcttgctacaagtgccagaaatctggccaccaggccaaggaatgcc tacagcccagaattcctcctaagctgcatcccatctgtgcgggaccccactggaaatcgg actgttcaactcacctggcagccactcccagagctcctggaagtctggcccaaggctgac tgcttcccagatcttcttggcttagcggctgaagactga >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_5|234_aa MDSRDPENSGSGTLNDGLSWLCKVSGSGSKGGVCLPEDRTTSCSFHQGTAGSFWPSQQAR GQFHSHRTLQASCPRLCPGRCFLPGPWSAPRTPPQIPATAALFGVSDGELSFLCPRSLQG SAPATVAPLSRLLCWMLGFSPGPSGALEPSSAPRPGPVMGHFRALDGGPAPWRRISHPWQ TPAIRILPFLPAPLFLRVAQKQGWSFRWLLLHTRPWGHRGPGDIMVPASMELTL >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_5|705_bp atggactcaagagacccggagaacagcggaagtgggactcttaatgatggtctttcttgg ctctgcaaggttagcggttctggttccaaaggaggtgtttgcttgccagaggacagaact acaagctgtagcttccaccagggtactgcaggctccttttggcccagccagcaagcccga ggacagtttcactcccaccggaccctccaagcctcttgcccacggctgtgtcctggccgc tgcttcctgcctgggccctggagtgctccgaggacccctccccaaatcccagccaccgcc gccctgtttggcgtctctgatggagagctgagcttcctgtgtcctcggagcctgcaggga tcagcacccgccacggtagctcccctttcccgcctgctctgctggatgctgggtttcagt ccaggtccttcgggcgccctggaaccctcttccgctccccgacctgggcctgtcatgggc cactttagggccctggatggcggccctgctccctggcgccgcatcagccatccgtggcag actcccgccatccggattctccctttcctgccggcacctctcttcttgcgagtggcccag aagcaggggtggtcgttccgctggctcctgctgcacaccaggccctggggacatcgtggc cctggggacatcatggtccctgcctccatggagctgacgctctag >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_6|148_aa MIPKLPRRRNTCLGITLNGTTCYPPSPPCTPHPPAVPSRPLGHRGLGFRKKTADESPVMD PIVPHVFSLPGDAAPLGGQLLGTGLPFGQLARQLCSRKWRALAMRGIGELGWVGLSYPEN PQLDESPRAQVDSRTPGCDPASDQQGGP >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_6|447_bp atgatccccaaactacctcgccggcgcaacacgtgcctgggcatcacgctgaacggcacg acctgctaccctccttccccgccctgcaccccccaccccccagccgtgccatcaaggcct ctgggccaccgaggacttggattcagaaagaaaacagcagatgaaagtcctgtaatggac cccattgtgccccatgtgttcagtctccctggtgatgcagccccactcggtggccagctg ctgggcacggggctgccgttcggacagcttgcaagacagctgtgctcccggaaatggagg gccctggcgatgaggggcattggggagttgggttgggttgggctgagctacccagaaaac ccccagctggatgagtcacccagggcacaggtggattcacggactcccggctgtgaccct gcctctgaccagcaaggaggtccctga >gi568815587f:68437665_68713114|GENSCAN_predicted_peptide_7|186_aa MARGSALLLASLLLAAALSASAGLWSPAKEKRGWTLNSAGYLLGPHAVGNHRSFSDKNGL TSKRELRPEDDMKPGSFDRSIPENNIMRTIIEFLSFLHLKGSTLTLGGLGSPALQTTPSP EVVTSSTWYKRSVAGGSQKAHVQPEKAKVLRTRLSNLGVLLSGQPSPSMPLGAVKEVLED KLNRIT >gi568815587f:68437665_68713114|GENSCAN_predicted_CDS_7|561_bp atggcccgaggcagcgccctcctgctcgcctccctcctcctcgccgcggccctttctgcc tctgcggggctctggtcgccggccaaggaaaaacgaggctggaccctgaacagcgcgggc tacctgctgggcccacatgccgttggcaaccacaggtcattcagcgacaagaatggcctc accagcaagcgggagctgcggcccgaagatgacatgaaaccaggaagctttgacaggtcc atacctgaaaacaatatcatgcgcacaatcattgagtttctgtctttcttgcatctcaaa ggctccacgctcacgcttggtggccttgggtcgccggcgcttcagaccactcccagcccg gaggtagtgacgtcatccacctggtataaacggagtgtggcaggtggcagccagaaggcc cacgtgcagccagagaaggccaaggttctgaggactcgcctgtcaaaccttggggtcctg ctgtcaggacagccttccccatcaatgccgcttggagctgtgaaagaggttctggaagac aagttgaaccgcatcacatag