GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:06:21 Sequence gi568815590f:90970295_91184865 : 214571 bp : 36.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 847 772 76 1 1 70 93 110 0.012 7.87 1.05 Intr - 8358 8285 74 1 2 88 89 -16 0.009 -3.29 1.04 Intr - 10167 10069 99 1 0 68 109 71 0.059 6.36 1.03 Intr - 14733 14489 245 2 2 2 10 144 0.037 -5.78 1.02 Intr - 14933 14820 114 1 0 36 123 111 0.928 8.04 1.01 Init - 15785 15703 83 2 2 66 58 59 0.580 1.19 1.00 Prom - 24120 24081 40 -5.65 2.12 PlyA - 24329 24324 6 1.05 2.11 Term - 25526 25383 144 2 0 86 43 112 0.907 3.43 2.10 Intr - 26450 26360 91 1 1 73 74 54 0.511 1.58 2.09 Intr - 45521 45443 79 0 1 115 64 74 0.277 5.39 2.08 Intr - 48219 48137 83 2 2 72 82 60 0.561 2.16 2.07 Intr - 49969 49863 107 1 2 74 99 29 0.974 0.69 2.06 Intr - 51110 50962 149 0 2 71 127 135 0.966 14.73 2.05 Intr - 55847 55751 97 2 1 43 56 50 0.146 -4.04 2.04 Intr - 61171 61067 105 1 0 72 73 59 0.213 2.29 2.03 Intr - 70181 70098 84 2 0 34 66 100 0.663 1.50 2.02 Intr - 70475 70350 126 2 0 52 94 98 0.701 6.76 2.01 Init - 70675 70523 153 1 0 89 -51 215 0.801 5.73 2.00 Prom - 71038 70999 40 -5.55 3.00 Prom + 75380 75419 40 -6.95 3.01 Init + 78151 78470 320 0 2 71 51 258 0.276 17.25 3.02 Term + 84349 84460 112 1 1 56 32 116 0.115 -0.15 3.03 PlyA + 85405 85410 6 1.05 4.00 Prom + 87763 87802 40 -5.85 4.01 Init + 100091 100172 82 1 1 98 100 185 0.650 21.88 4.02 Intr + 100844 100995 152 0 2 70 95 197 0.999 17.56 4.03 Intr + 103537 103617 81 0 0 111 59 52 0.917 3.52 4.04 Intr + 108062 108374 313 1 1 85 91 282 0.989 23.03 4.05 Intr + 110375 110436 62 0 2 46 107 45 0.359 -0.27 4.06 Intr + 121343 121525 183 1 0 24 67 120 0.031 2.56 4.07 Intr + 132365 132550 186 1 0 62 94 95 0.024 6.46 4.08 Intr + 150226 150317 92 0 2 29 103 88 0.074 2.27 4.09 Term + 158790 158916 127 0 1 46 47 127 0.242 1.17 4.10 PlyA + 159256 159261 6 1.05 5.05 PlyA - 159952 159947 6 1.05 5.04 Term - 172424 172264 161 0 2 60 44 96 0.684 -0.48 5.03 Intr - 178100 177931 170 1 2 93 65 71 0.706 3.97 5.02 Intr - 179594 179172 423 1 0 3 39 225 0.132 1.16 5.01 Init - 181020 180386 635 0 2 47 72 364 0.344 25.56 5.00 Prom - 181334 181295 40 -7.15 6.00 Prom + 185621 185660 40 -5.35 6.01 Sngl + 187403 188230 828 1 0 92 44 669 0.539 58.38 6.02 PlyA + 188684 188689 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 14733 14325 409 2 1 2 32 201 0.822 -0.40 S.002 Sngl + 121237 121419 183 0 0 107 49 157 0.852 8.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:90970295_91184865|GENSCAN_predicted_peptide_1|231_aa MVNPDSAPLKTVQCLPVHFQLSFWCLGTQRTSGRSRSPTGSRALGLRVWAERGDAAAPKS PWILSRPDPGKVEARKRSSEVGREISQGPFSTLWTLNGSGMGARASDEGKESVGGQGSPT EGQVAETKAIAKKGAKYTAPLGSAPAPVWNLKVFVTMETKKLIGKPLQPARPVRHLTSPP GAVFPFNFQNEYPCNTQCIQSGVSRCKTNGMQAFSQGLNEQQQQQSPVKKX >gi568815590f:90970295_91184865|GENSCAN_predicted_CDS_1|693_bp atggtaaatcctgatagtgccccacttaaaaccgttcagtgtttgccagtgcattttcaa ctgtccttttggtgcttggggacccagcggaccagcggcaggagccgttccccgacgggc agcagggcgctcggcctccgcgtgtgggctgagcgcggcgacgctgctgccccgaaatcc ccgtggattttgagtcggcccgaccccgggaaagttgaggcgaggaagagaagctctgag gtgggtcgggaaatctctcagggccccttcagtaccctgtggaccctgaatgggagtggg atgggagcgagagcgtcggatgaggggaaagaaagtgtagggggccaggggagcccgacg gaggggcaagtggccgagaccaaagccatcgccaaaaaaggagctaagtacacagctccc cttggctctgccccagcacctgtatggaacctcaaagtctttgtgacaatggaaaccaaa aaattaattggtaaaccgcttcaaccagcaagacctgttcgtcatctgacttctccccca ggagcagtgttccctttcaactttcaaaacgaatatccatgcaacactcagtgcatacaa agtggagttagcagatgtaagacgaatggaatgcaagccttttctcaaggtcttaatgag caacagcaacagcagtctccagttaagaaagnn >gi568815590f:90970295_91184865|GENSCAN_predicted_peptide_2|405_aa MPPSALPFPRLLVLHPPLPSPRFLPCDTSRALPEAAAAAAAAAAAAAEAAAAPPLPQSWL LMGWTNARLCCQHPTPEMSLPPPHRTCKKAAPELLRRVFLVVVVMVVVVVVVMVVVVVVV MGENLEDGMLQYDILLFERLSAKTDNPEEKQYREMINISWSRLTKPKCGLQSSCSSITKE SKGGFGTEAELPPPYTAIASPDASGIPVINCRVCQSLINLDGKLHQHVVKCTVCNEATPI KNPPTGKKYVRCPCNCLLICKDTSRRIGCPRPNCRRIINLGPVMLISEEQPAQPALPIQP EESSKVLANQDKYRMRSPVPANRNQTQHSSVGSALPRRRCCAYITIGMICIFIGVGLTVG TPDFARRFRATYVSWAIAYLLGLICLIRACYWGAIRVSYPEHSFA >gi568815590f:90970295_91184865|GENSCAN_predicted_CDS_2|1218_bp atgcctcccagcgcgcttcccttcccccgcctcctggtcctgcaccctcccctcccctcc ccccgcttcctgccttgtgacacatcccgggccctcccggaggcggcagcagcagcagct gcagcggcagcagcggcagcagaggcagcagcagccccgccgctgccgcagtcatggctg ctgatggggtggacgaacgctcgcctctgctgtcagcatcccactccggaaatgtcactc ccaccgccccaccgtacttgcaagaaagcagccccagagcttctccgcagggtttttctg gtggtggtggtgatggtggtggtggtggtggtggtgatggtggtggtggtggtggtggtg atgggggagaatttggaagacggcatgttacaatatgacattttattgtttgagagatta agtgctaagactgacaatccagaggagaaacaatatagagaaatgataaacatatcctgg agccggctgacaaaaccaaaatgtggtttgcagagttcctgctccagcatcacgaaagag agtaaaggtggatttggaactgaggcggagctcccacctccatatacagccattgccagt ccagacgccagtggtattccagtaataaactgccgtgtgtgccaatcactaatcaatttg gatggcaagcttcaccagcatgtggttaagtgcacagtttgcaatgaagctacgccaatc aaaaaccccccaacaggcaagaaatatgttagatgcccttgtaattgtcttctcatttgt aaggacacatctcggcgaataggatgcccaagacccaactgtagacggataattaacctt ggcccagtaatgcttatttctgaagaacaaccagctcagcctgcattgccaatccaacca gaagaaagttctaaggtgttagccaatcaagacaaatacagaatgcgaagtcctgttcca gccaatagaaaccagacacagcactcctcagtgggtagtgcacttccacgaagacgctgc tgtgcatatattaccattggaatgatatgtattttcattggagttgggttaactgttggc accccagattttgcaaggcgatttcgagcaacctatgtttcttgggcaattgcttatctc ctaggattgatctgccttatccgagcttgttattggggagccataagagtcagttatcca gaacacagttttgcataa >gi568815590f:90970295_91184865|GENSCAN_predicted_peptide_3|143_aa MEYTSKCKGLGGLDEVKQDAVDWEGFEITHLCVSSINPVFECLPLRGHCQWTVIIGPAIN VKHRKEDFTNGPMNLSDSSLCPLTGEARISKREEEAENQRQNILSPIGCCNRALSCKQRM TLTGHRFGWHLDLGLYSIQNCEK >gi568815590f:90970295_91184865|GENSCAN_predicted_CDS_3|432_bp atggagtataccagtaaatgcaagggccttggagggttagatgaggtgaagcaggacgct gtggattgggaaggctttgaaataactcacctttgtgtctcctctataaatccagtattt gagtgcctgccattgagagggcattgccaatggacagtcattattggcccagcaattaat gtcaaacacaggaaagaagacttcactaatggcccaatgaatctttctgattcatccttg tgtccgctcactggagaagctagaatatccaagagggaagaagaagctgagaaccagagg caaaatatattgtccccaataggatgctgtaacagggcgctatcttgcaagcagagaatg accctcactgggcaccgatttggctggcaccttgatcttggactttacagcatccaaaac tgtgagaaataa >gi568815590f:90970295_91184865|GENSCAN_predicted_peptide_4|425_aa MEAVLTEELDEEEQLLRRHRKEKKELQAKIQGMKNAVPKNDKKRRKQLTEDVAKLEKEME QKHREELEQLKLTTKENKIDSVAVNISNLVLENQPPRISKAQKRREKKAALEKEREERIA EAEIENLTGARHMESEKLAQILAARQLEIKQIPSDGHCMYKAIEDQLKEKDCALTVVALR SQTAEYMQSHVEDFLPFLTNPNTGDMYTPEEFQKYCEDIVNTAAWGGQLETSFSVGTLGL TLVICQGLLGLQLKTEGCTVGFPTFEVLGLGLASLLLILQMAYCGTSPYDRIMTERLLIK ALSGGKNTKIITLNGKKMTKMPSALGKLPGLKTLVLQNNLIPKVCPELCNLTQHRGKGVG LLCPGICHGVAAVWKDGAEAQPSRISVHGLGVKFGQVRVLHLNSSVMDSAAQAQEEDSMG ILVTE >gi568815590f:90970295_91184865|GENSCAN_predicted_CDS_4|1278_bp atggaggcggtattgaccgaagagcttgatgaggaagagcagctgctgagaaggcatcgc aaagagaagaaggagttgcaagccaaaattcagggcatgaagaatgctgttcccaagaat gacaagaagaggaggaagcaactcaccgaagatgtggccaagttggaaaaagaaatggaa cagaaacatagagaggaactggagcaattgaagctgactactaaggagaataagatagat tctgttgctgttaacatttcaaacttggtgcttgagaatcagccacctcggatatcaaaa gcacaaaagagacgggaaaagaaagctgcattggaaaaggagcgagaagaacggatagct gaagctgaaattgaaaacttaacaggagccagacatatggaaagtgagaaacttgctcaa atattggcagctagacagttagaaattaaacagattccatctgatggccactgtatgtat aaagccattgaagatcaactgaaagaaaaggattgtgctctgactgtggttgccttgaga agtcagaccgctgagtatatgcaaagccatgtggaagactttctgccatttttaacaaac cctaatacaggagatatgtatactccagaagaatttcagaagtactgtgaagatattgta aacacagctgcatggggaggtcagcttgagacttcattcagcgttgggactcttggactt accctagtgatttgccaggggctcttgggccttcagctaaaaactgaaggctgcactgtt ggcttccctacttttgaggttttgggactcggactggcttccttgctcctcatcttgcag atggcctattgtgggacttcaccttatgatcgtatcatgactgagagattgttaataaaa gcattgagtggtggtaaaaatacgaagatcattactttgaatgggaagaagatgacaaag atgccctcagcattaggaaaactgcctggcctgaagactctagtccttcagaataaccta atccccaaagtgtgtccagagttatgcaacttgacccagcacagaggcaaaggggttggg ctgttatgtcctggcatctgccatggtgtagcagctgtctggaaggatggagcagaggca caaccttccagaatatcagttcatggacttggagtcaaatttgggcaggtaagagtttta cacctaaacagctctgtcatggacagcgctgcacaggcacaggaagaggacagcatgggc atcctggtgacagaatga >gi568815590f:90970295_91184865|GENSCAN_predicted_peptide_5|462_aa MFFETTENKDTTYQNLWDTCKALCRGKFIALNAHKTKQERSKIDTLTSQLKELEKQEQTH SKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIKKKREKNETD TIKNDKEDITNDPTEIQTTIREYYKHLYANKLESLEEMDKFLDTYTLPRLNQEEVESLNR PITGSEIEAIINSLPTKKNPGPDGFTAEFYQRQTESQIMSELPLTTASKRIKYLGIQLTR DVKDLFKENYKPLLNEVKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLTMT FFTELEKTTLKFIWNQKRAHSAKSIQNQKNKAGGITLPDFKLYYKATVTKTAWDMDEAGN HHSQQTITRTKNQTLHVLTHRWELNNENTWTQEGEHHTLGPDVGWGERGGQWFSKCSSWT YSISITKEHNRNANLWDPQPTDSEILEGGPSLLGFKKASRGF >gi568815590f:90970295_91184865|GENSCAN_predicted_CDS_5|1389_bp atgttctttgaaaccaccgagaacaaagacacaacataccagaatctctgggacacatgc aaagcactgtgtagagggaaatttatagcactaaatgcccacaagacaaagcaggaaaga tccaaaattgacaccctcacatcacaattaaaagaactagagaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataactaagatcagagcagaactgaaggaaatagag acacaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaac aaaattgatagaccactagcaagactaataaagaaaaaaagagagaagaatgaaacagac acaataaaaaatgataaagaggatatcaccaacgatcccacagaaatacaaactaccatc agagaatactataaacacctctatgcaaacaaactagaaagtctagaagaaatggataaa ttcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctgaataga ccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaaaaatcca ggaccagatggattcacagccgaattctaccagagacaaacagagagccaaatcatgagt gaactcccattgacaactgcttcaaagagaataaaatacctaggaatccaacttacaagg gacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaagtaaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaacaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccac agtgccaagtcgatccaaaaccaaaagaacaaagctggaggcatcacgctacctgacttc aaactatactacaaggctacagtaaccaaaacagcatgggacatggatgaagctggaaac catcattctcagcagactatcacaaggacaaaaaaccaaacactgcatgttcttactcat aggtgggaattgaacaatgagaacacatggacacaggaaggggaacatcacacactgggg cctgatgtggggtggggggagcggggaggccaatggttttccaagtgtagttcctggact tacagcatcagcattaccaaagaacataacagaaatgcaaacctgtgggatccccagcct actgactcagaaatcctagaaggagggcctagccttctgggttttaagaaggcatccagg ggattctga >gi568815590f:90970295_91184865|GENSCAN_predicted_peptide_6|275_aa MKQKYTVNQCRWQSEDSTFYLGERTYYIAAVEVEWDYSPQREWEKELHHLQEQNVSNAFL DKGEFYIGSKYKKVVYRQYTDCTFRIPVERKAEEEHLGILGLQLHADVGDKVKIIFKNMT TRPYSIHAHGVQTESSTFIPALPGETLTYLWKIPERSGAGTEDSACIPWAYYSTVDQVKD LYSGLIGPLIVCRRHYLKVFNPRKKLEFTLLFLVFDENESWYLDDNIKTNSDHPKKVNKD DEEFIESNKMHAVNGRMFGNPQGLTMHMGDEANGR >gi568815590f:90970295_91184865|GENSCAN_predicted_CDS_6|828_bp atgaagcaaaaatatactgtgaaccaatgcaggtggcagtctgaggattccaccttctac ctgggagagaggacatactatatcgcagcagtggaagtggaatgggattattccccacaa agggagtgggaaaaggagctgcatcatttacaagagcagaatgtttcaaatgcattttta gataagggagagttttacataggctcaaagtacaagaaagttgtgtatcggcagtatact gattgcacgttccgtattccagtggagagaaaagctgaagaagaacatctgggaattcta ggtctacaacttcatgcagatgttggagacaaagtcaaaattatctttaaaaacatgacc acaaggccctactcaatacatgcccatggggtacaaacggagagttctacatttattcca gcattaccaggtgaaactctcacttacctatggaaaatcccagaaagatctggagctgga acagaggattctgcttgtattccatgggcttactattcaactgtggatcaagttaaggat ctctacagtggattaattggccccctgattgtttgtcgaagacattacttgaaagtattc aatcccagaaagaaactggaatttacccttctgtttctagtttttgatgagaatgaatct tggtatttagatgacaacatcaaaacaaactctgatcaccccaagaaagtaaacaaagat gatgaggaattcatagaaagcaataaaatgcatgctgttaatggaagaatgtttggaaac ccacaaggcctcacaatgcacatgggagatgaagccaatgggcgatga