GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:26:11 Sequence gi568815596r:180772870_180973414 : 200545 bp : 35.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 30813 30852 40 -2.45 1.01 Init + 48835 48884 50 0 2 67 93 28 0.152 1.67 1.02 Term + 54372 54555 184 0 1 26 42 205 0.967 5.73 1.03 PlyA + 54721 54726 6 1.05 2.04 PlyA - 55186 55181 6 1.05 2.03 Term - 56907 56830 78 0 0 84 40 74 0.505 -1.02 2.02 Intr - 57188 57068 121 1 1 20 66 104 0.470 0.98 2.01 Init - 64542 61451 3092 0 2 44 53 1046 0.635 88.17 2.00 Prom - 64800 64761 40 -5.25 3.03 PlyA - 64882 64877 6 -4.04 3.02 Term - 65452 64900 553 0 1 -48 43 347 0.677 9.40 3.01 Init - 65916 65531 386 0 2 88 44 342 0.915 26.16 3.00 Prom - 69395 69356 40 -5.35 4.04 PlyA - 69631 69626 6 1.05 4.03 Term - 85668 85487 182 1 2 -1 44 201 0.023 3.59 4.02 Intr - 100709 100522 188 1 2 14 85 214 0.092 12.11 4.01 Init - 121691 121591 101 2 2 95 83 27 0.451 2.68 4.00 Prom - 121993 121954 40 -2.45 5.00 Prom + 141304 141343 40 -1.75 5.01 Sngl + 191885 192421 537 1 0 20 44 377 0.401 22.33 5.02 PlyA + 192833 192838 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:180772870_180973414|GENSCAN_predicted_peptide_1|77_aa MMTNALEHNYKELQMGSNLDAGDTRSTFQEYGHDTRNHKHDENGMTGEEVPDASLVNEDP ASGDADTTSLKKLNICR >gi568815596r:180772870_180973414|GENSCAN_predicted_CDS_1|234_bp atgatgaccaatgctcttgaacataattataaggaattacaaatgggaagcaatctagat gctggggacacaaggtccaccttccaggaatatggccatgacaccagaaatcacaaacat gatgagaatggaatgactggggaagaagtgccagatgcttcacttgtaaatgaagaccca gcctctggggatgcagataccacctccctgaagaagctgaatatctgcagataa >gi568815596r:180772870_180973414|GENSCAN_predicted_peptide_2|1096_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAVYDKPT ANIILNGQKLEVFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAPITKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNRTTLLKIVHSKVLIKLATVPASNINVILFGEEPFEGVAGTIIVATQEGAR EPSASPKSLVSESEFA >gi568815596r:180772870_180973414|GENSCAN_predicted_CDS_2|3291_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattccttgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctgtctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagtattccctttgaaaactggcacaaga cagggatgccctctctcaccactcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccccatcaccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgctgcatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacgtgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacagaactactcttttgaagattgtgcactca aaagtattaattaaattggcaactgtgcctgccagcaacattaatgtgatactctttggg gaggagccatttgaaggtgttgctggaactataatagtagccactcaagagggtgccagg gaaccttctgcttctcctaaaagcttggtttcagaaagtgaatttgcttga >gi568815596r:180772870_180973414|GENSCAN_predicted_peptide_3|312_aa MGKKQNRKTGNSKTQSASPPPKELSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKAQELCEECRSLRSRCDQLEERVSA MEDEMNEMKPNLRLIGVPESDVENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQR YSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARREWGP IFNILKEKNFQPRISYPAKLSFISEGEIKYFIDKQMLRDFVTTRPALKELLKEALNMERN NRYQPLQNHAKM >gi568815596r:180772870_180973414|GENSCAN_predicted_CDS_3|939_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaactcagctcctcaccagcaacagaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcaagaacta tgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaaaccaaatctacgtctgattggtgtacctgaaagt gatgtggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttc cccaatctagcaaggcaggccaacgttcagattcaggaaatacagagaacgccacaaaga tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaaaaaatgttaagggcagccagagagaaaggtcgggttaccctcaaaggaaagccc atcagactaacagcggatctctcggcagaaaccctacaagccagaagagagtgggggcca atattcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaacta agcttcataagtgaaggagaaataaaatactttatagacaagcaaatgctgagagatttt gtcaccaccaggcctgccctaaaagagctcctgaaggaagcgctaaacatggaaaggaac aaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815596r:180772870_180973414|GENSCAN_predicted_peptide_4|156_aa MGLFFHLVLPDTLKGFDSSSFISKYFKEGPPEVRVSDERRSSPRVVGVSPIASPPSPLHG ALGLPQGSRRCSSAAQPLPPPPPLLSRRNDDRVHLAGRGWNSLEGSEEEGKMWESLELPG DLESSEDRKMWESLELPRDLLNGCDHNTHNDTDNEV >gi568815596r:180772870_180973414|GENSCAN_predicted_CDS_4|471_bp atggggcttttttttcaccttgttcttccagacactctgaagggttttgattcttcaagt tttatttcaaagtacttcaaagagggacctcctgaagtgagagtctctgatgagagacgt tcttcgccgagagtcgtaggggtttcgcccatagccagccctccgtcacctcttcacggc gccctgggactgccccaaggctcccgccgctgctccagcgcggcgcagccactgccgccg ccgccgcctctccttagtcgccgcaatgacgaccgcgtccacctcgcagggagaggttgg aacagtttggagggttcagaagaagaagggaaaatgtgggaaagtttggaacttcctgga gacttggagagctcagaagacaggaagatgtgggaaagtttggaacttcctagagactta ctgaatggctgtgaccacaacactcataatgatacggacaatgaagtctag >gi568815596r:180772870_180973414|GENSCAN_predicted_peptide_5|178_aa MICTNKWVHRSQEVRFENLRLHFRSCMETPRIPRQKFAAGVGPLWRTSARAVQKGNVGSE PPHRVPTGALPSGAVRRGPLSSRHQNGRSTDSLHCAAGKATGIQCQPMKAAGSGAVPCKA TGEELPKTMGIHLLHQLDLDVRHGVKGDHFGALRFHRRAQFRTCMGPTAPLLLPISPI >gi568815596r:180772870_180973414|GENSCAN_predicted_CDS_5|537_bp atgatatgcacaaacaaatgggtgcacagaagtcaagaagtgaggtttgaaaacctccgc ctgcatttcagaagttgtatggaaacacctagaatcccccggcagaaatttgctgcaggg gtggggcccttatggagaacctctgctagggcagtgcagaaaggaaatgtggggtcggag cccccacacagagttcctactggggcactgcctagtggagctgtgagaagagggccactg tcctccagacaccagaatggtagatccactgacagcttgcactgtgcagctggaaaagcc acaggcattcaatgccagcccatgaaagcagctgggagtggggctgtaccctgcaaagcc acaggggaggagctgccaaagaccatgggaatccacctcttgcatcagcttgacctggat gtgagacatggagtcaaaggagatcattttggagctttaagatttcaccgccgtgctcaa tttcggacttgcatggggcctacagctcctttgcttttgccaatttctcccatttga