GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:17:51 Sequence gi568815588f:119551676_119777279 : 225604 bp : 44.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 478 473 6 1.05 1.11 Term - 25075 24937 139 0 1 103 40 165 0.873 10.64 1.10 Intr - 25860 25776 85 1 1 102 92 -12 0.994 -0.52 1.09 Intr - 26061 25966 96 1 0 77 87 18 0.595 0.58 1.08 Intr - 27159 27051 109 0 1 104 69 95 0.973 9.06 1.07 Intr - 28335 28260 76 2 1 105 92 44 0.971 6.02 1.06 Intr - 30334 30247 88 2 1 55 97 44 0.978 1.03 1.05 Intr - 30548 30494 55 2 1 85 100 -7 0.976 -1.25 1.04 Intr - 30882 30784 99 0 0 98 116 55 0.817 9.61 1.03 Intr - 45260 45140 121 1 1 94 -2 129 0.001 5.00 1.02 Intr - 45414 45336 79 1 1 108 51 42 0.015 1.11 1.01 Init - 48069 48003 67 0 1 81 43 76 0.019 3.63 1.00 Prom - 51423 51384 40 -2.96 2.00 Prom + 52425 52464 40 -0.86 2.01 Init + 67005 67080 76 2 1 68 72 125 0.955 10.16 2.02 Term + 74479 74660 182 2 2 46 39 92 0.132 -2.13 2.03 PlyA + 76527 76532 6 1.05 3.00 Prom + 78031 78070 40 -3.66 3.01 Sngl + 86751 87407 657 2 0 75 35 728 0.912 62.38 3.02 PlyA + 87437 87442 6 1.05 4.00 Prom + 98805 98844 40 -3.56 4.01 Init + 100001 100180 180 1 0 84 105 339 0.522 34.38 4.02 Intr + 108572 108688 117 1 0 109 90 54 0.832 8.36 4.03 Intr + 114864 115040 177 2 0 14 -3 181 0.466 1.72 4.04 Intr + 118176 118316 141 2 0 96 55 90 0.755 7.05 4.05 Intr + 118344 118502 159 2 0 61 94 50 0.654 3.08 4.06 Intr + 120580 120981 402 0 0 104 96 224 0.882 19.42 4.07 Intr + 124789 124962 174 0 0 84 75 126 0.630 11.04 4.08 Term + 125044 125607 564 0 0 103 38 604 0.854 51.29 4.09 PlyA + 125746 125751 6 1.05 5.00 Prom + 127442 127481 40 -5.86 5.01 Init + 132132 132230 99 2 0 47 105 41 0.648 2.11 5.02 Intr + 135506 135596 91 1 1 68 76 70 0.685 3.37 5.03 Intr + 135658 135752 95 2 2 59 78 79 0.927 3.68 5.04 Intr + 136198 136261 64 0 1 75 96 65 0.858 4.29 5.05 Term + 142105 142241 137 2 2 113 42 68 0.509 2.88 5.06 PlyA + 144590 144595 6 1.05 6.00 Prom + 166040 166079 40 -4.16 6.01 Init + 174588 174684 97 2 1 109 93 178 0.958 20.87 6.02 Term + 186397 186452 56 2 2 90 48 28 0.143 -3.28 6.03 PlyA + 187669 187674 6 1.05 7.05 PlyA - 187806 187801 6 1.05 7.04 Term - 196838 196469 370 1 1 102 36 120 0.302 2.32 7.03 Intr - 197495 197388 108 1 0 105 53 51 0.218 2.70 7.02 Intr - 197874 197853 22 1 1 99 61 4 0.090 -4.60 7.01 Init - 212347 211984 364 1 1 84 69 181 0.383 13.01 7.00 Prom - 215060 215021 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_1|337_aa MVAARENEDDAKAETPDKTIRSPRRLPGRRTSSRMTTLPRSLLFTSQRGTTIPRDFGFTT FLFLPDLSRSWLGFGRPEAHSDDVIRRERHTSNDPYCFVEFYEHRDAAAALAAMNGRKIL GKEVKVNWATTPSSQKKDTSNHFHVFVGDLSPEITTEDIKSAFAPFGKISDARVVKDMAT GKSKGYGFVSFYNKLDAENAIVHMGGQWLGGRQIRTNWATRKPPAPKSTQENNTKQLRFE DVVNQSSPKNCTVYCGGIASGLTDQLMRQTFSPFGQIMEIRVFPEKGYSFVRLTIVNGAN GAKCMETHNSMDSIWQMGGKYRLMEYTGNHGINKDLE >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_1|1014_bp atggtggcagcaagagaaaatgaggatgatgcaaaagcggaaacccctgataaaaccatc agatctccccgccgacttcctggtcgtcgcacgtcctcacgtatgactacactacccaga agtctcctcttcacgtcccagcgcgggaccacaattcccagagacttcggcttcacgacg tttctctttttgcccgatctctcccggagctggctgggcttcggccggccagaggcccac agcgacgacgtgatccgtcgtgagcggcatacaagcaatgacccatattgctttgtggaa ttttatgaacacagagatgcagctgctgcattagctgctatgaatgggagaaaaattttg ggaaaggaggtcaaagtaaactgggcaaccacaccaagtagccagaaaaaagatacttcc aatcacttccatgtgtttgttggggatttgagtccagaaattacaacagaagatatcaaa tcagcatttgccccctttggtaaaatatcggatgcccgggtagttaaagacatggcaact ggaaaatccaaaggctatggttttgtatctttttataacaaactggatgcagaaaatgcg attgtgcatatgggcggtcagtggttgggtggtcgtcaaatccgaaccaattgggccact cgtaaaccacctgcacctaaaagtacacaagaaaacaacactaagcagttgagatttgaa gatgtagtaaaccagtcaagtccaaaaaattgtactgtgtactgtggaggaattgcgtct gggttaacagatcagcttatgagacagacattctcaccatttggacaaattatggaaata agagttttcccagaaaagggctattcatttgtcaggttgactatagtcaatggggccaat ggagccaagtgtatggaaacccacaacagtatggacagtatatggcaaatgggtggcaag taccgccttatggagtatacgggcaaccatggaatcaacaaggatttggagtag >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_2|85_aa MGPTFSSLWFSESHRMSAALQVLSTEAVIQETTAESAVSFMICLRSHNHLVWNILLVYLI QCGKIQYQKAMVMEPSWRLATTKGN >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_2|258_bp atggggcccacgttcagctccctgtggttctcagagtctcaccggatgtctgccgcgctg caggtgctcagcacagaggcagtgattcaagagaccacagcagaatctgcagtgtctttt atgatttgtcttcgaagtcacaaccatctcgtctggaatatcttattggtctaccttatt cagtgtgggaagatacaataccagaaggcgatggtcatggagccatcttggaggctggct accacaaaagggaactga >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_3|218_aa MGPLSSQRRVMGISQDNWHKRRKTGSKRKPYDKKRKYELGHLAANTKIGPHHIHTVRVRG GNNKYGALRRDMGNFSWGSECCTRKTRITDVVYDAPNSKLVRTKTLVENCFVLTDSTPYH QWYESHYALPLGCKKGAKLTPEEEKTLNKKRSKKIQKKYDEREKNAKISRLLGEQFQQGK LLACVASRLGQCGQAHVYVPGGKEMEFYLRKIKARKGK >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_3|657_bp atggggcctctttccagccagcgccgagtgatgggcatctctcaggacaactggcacaag cgccgcaagactggcagcaagagaaagccctacgacaagaagcggaagtatgagttgggg cacctggctgccaacaccaagattggcccccaccacatccacacagtccgtgtgcgggga ggtaacaataaatacggtgccctgaggcgggacatggggaatttctcctggggttcagag tgttgtactcgcaaaacaaggatcactgatgttgtctacgatgcgcccaatagcaagctg gtccgtaccaagaccctggtggagaactgcttcgtgctcactgacagcacaccgtaccac cagtggtatgagtcccactatgcgctgcccctgggctgcaagaagggagccaaactgact cctgaggaagaaaagactttaaacaaaaaacgatctaaaaaaattcagaagaaatacgat gaaagggaaaagaatgccaaaatcagccgtctcctgggggagcagttccagcagggcaag cttcttgcatgcgtcgcttcaaggctgggacagtgtggccaagcccatgtctatgtgcca gggggcaaggagatggagttctatcttaggaaaatcaaggcccggaaaggcaaataa >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_4|637_aa MSAATHSPMMQVASGNGDRDPLPPGWEIKIDPQTGWPFFVDHNSRTTTWNDPRVPSEGPK ASGRCVKEPQSVRLGALLSQEFVCRAGLFPRAAAQPVGEAQNVLSEMHEVLSEMHEVLSE MHEVLSSWLAQGLSGSRGFPQEELHLLKACLRTPSPGRETPSSANGPSREGSRLPPAREG HPVYPQLRPGYIPIPVLHEGAENRQPGMQRFRTEAAAAAPQRSQSPLRGMPETTQPDKQC GQVAAAAAAQPPASHGPERSQSPAASDCSSSSSSASLPSSGRSSLGSHQLPRGYISIPVI HEQNVTRPAAQPSFHQAQKTHYPAQQGEYQTHQPVYHKIQGDDWEPRPLRAASPFRSSVQ GASSREGSPARSSTPLHSPSPIRVHTVVDRPQQPMTHRETAPVSQPENKPESKPGPVGPE LPPGHIPIQVIRKEVDSKPVSQKPPPPSEKSVATEERAAPSTAPAEATPPKPGEAEAPPK HPGVLKVEAILEKVQGLEQAVDNFEGKKTDKKYLMIEEYLTKELLALDSVDPEGRADVRQ ARRDGVRKVQTILEKLEQKAIDVPGQVQVYELQPSNLEADQPLQAIMEMGAVAADKGKKN AGNAEDPHTETQQPEATAAATSNPSSMTDTPGNPAAP >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_4|1914_bp atgagcgccgccacccactcgcccatgatgcaggtggcgtccggcaacggtgaccgcgac cctttgccccccggatgggagatcaagatcgacccgcagaccggctggcccttcttcgtg gaccacaacagccgcaccactacgtggaacgacccgcgcgtgccctctgagggccccaag gcctcagggagatgtgtcaaggaaccacagtcagtgcggctgggcgccctgctcagccag gagtttgtctgcagagcaggcctcttccccagagcagcggcccagcctgttggagaggca caaaatgtgctcagcgagatgcacgaggtgctcagcgagatgcacgaggtgctcagcgag atgcatgaggtgctcagctcgtggctggctcaagggctttctggatccaggggcttcccc caggaggagctgcatctcctcaaggcctgtctgaggactccaagccctggaagggagact ccatcctctgccaatggcccttcccgggagggctctaggctgccgcctgctagggaaggc caccctgtgtacccccagctccgaccaggctacattcccattcctgtgctccatgaaggc gctgagaaccggcagcctgggatgcagcgattccgaactgaggcggcagcagcggctcct cagaggtcccagtcacctctgcggggcatgccagaaaccactcagccagataaacagtgt ggacaggtggcagcggcggcggcagcccagcccccagcctcccacggacctgagcggtcc cagtctccagctgcctctgactgctcatcctcatcctcctcggccagcctgccttcctcc ggcaggagcagcctgggcagtcaccagctcccgcgggggtacatctccattccggtgata cacgagcagaacgttacccggccagcagcccagccctccttccaccaagcccagaagacg cactacccagcgcagcagggggagtaccagacccaccagcctgtgtaccacaagatccag ggggatgactgggagccccggcccctgcgggcggcatccccgttcaggtcatctgtccag ggtgcatcgagccgggagggctcaccagccaggagcagcacgccactccactccccctcg cccatccgtgtgcacaccgtggtcgacaggcctcagcagcccatgacccatcgagaaact gcacctgtttcccagcctgaaaacaaaccagaaagtaagccaggcccagttggaccagaa ctccctcctggacacatcccaattcaagtgatccgcaaagaggtggattctaaacctgtt tcccagaagcccccacctccctctgagaagagtgtggctacagaagagagggcagccccc agcactgcccctgcagaagctacacctccaaaaccaggagaagccgaggctcccccaaaa catccaggagtgctgaaagtggaagccatcctggagaaggtacaggggctggagcaggct gtagacaactttgaaggcaagaagactgacaaaaagtacctgatgatcgaagagtatttg accaaagagctgctggccctggattcagtggaccccgagggacgagccgatgtgcgtcag gccaggagagacggtgtcaggaaggttcagaccatcttggaaaaacttgaacagaaagcc attgatgtcccaggtcaagtccaggtctatgaactccagcccagcaaccttgaagcagat cagccactgcaggcaatcatggagatgggtgccgtggcagcagacaagggcaagaaaaat gctggaaatgcagaagatccccacacagaaacccagcagccagaagccacagcagcagcg acttcaaaccccagcagcatgacagacacccctggtaacccagcagcaccgtag >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_5|161_aa METDPKVPDRAWSSGTLFLLNAKLIYAYRAHRKDQDRNNVNRKNMLPRMRLQYFTLYEKG KMKALPGGSNDQNREYPTSFMLGGLELPPPCRMGKERGLDPDHKRRFLDLEQEGIQESFL VPLQAICLFHMASVTYSIRIHSDFLYLHNLIPDLTQLDSPT >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_5|486_bp atggaaacggatccaaaagtgccggatcgagcttggagctctggtaccctctttctcctg aatgccaaacttatctatgcctacagggcacacagaaaggaccaggatcgaaataatgtt aacaggaaaaacatgcttcctcgtatgagactgcagtatttcaccttgtatgagaaagga aaaatgaaggctcttcctggtggcagtaatgatcagaaccgggagtacccaacttctttc atgcttgggggactagagctgccccctccatgcagaatgggcaaagaaaggggtcttgat ccagaccacaagagaaggttcttggatctggagcaagaaggaattcaggaatcctttcta gtccctctccaggccatctgcctcttccacatggcctcagtcacctactctatacgaatc cattctgactttctgtatctgcacaacctcatcccagacctcacccaactggacagcccc acctag >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_6|50_aa MELFQAKDHYILQQGERALWCSRRDGGLQLRPGRGGHMKIEAMYCQMDLK >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_6|153_bp atggagctcttccaagccaaggaccactacatcctgcagcagggcgagcgcgcgctgtgg tgcagccgccgcgacggcggcctccagctccgacccggaagaggagggcatatgaaaata gaagccatgtactgccaaatggatctgaaataa >gi568815588f:119551676_119777279|GENSCAN_predicted_peptide_7|287_aa MNREEHSDEVSDENKERVIGQYGDDPGYKVAKNLVEICACSSILWKGGLKSEIGYLPEAI SKQNVKDEVCLLLPDYSKMREESYDLKTELLSKNEVELKDLKSSSQTMEIFKKARSGENT TEIIRGTNQLGSACSHSLASLPAPSTCSGAEQVVAEPGDRHDRVGCPHTLHTSFFLDTGP LNGGTEKAVTQTGPKHAPCSPGCGQQGGKGCGPSGTPDLGALRARAVTVGLCSSWHLQAS GHHCIPQYLQQKLLVVHLVQPQPCTKPVPVPVPGAACPATAGTPGCA >gi568815588f:119551676_119777279|GENSCAN_predicted_CDS_7|864_bp atgaaccgtgaagagcattctgatgaagtctcagatgaaaataaggaacgtgttattgga caatatggagatgatcctggttataaagtggcaaagaacttggttgaaatttgtgcatgt tctagtattttgtggaagggaggacttaagagtgaaattggatatttacctgaggcaatt tctaagcaaaatgtcaaagatgaagtttgcctcctcctgcctgattatagtaaaatgcga gaagagagctatgacttaaagacagagttgttaagcaaaaatgaagtagaacttaaagat ttgaaaagttctagccagactatggaaatttttaaaaaggcccgttcaggggaaaacact acagaaattatacgtggaactaatcagctcggaagtgcctgctcccactccctggcctct ctccctgctcccagcacctgctctggtgcagagcaagttgtggccgagcctggggaccgt cacgaccgagtcggttgtccacataccttgcatacctcattcttcctggacacgggaccc ctgaatggtgggactgaaaaagctgtaacacaaacaggaccaaaacacgccccctgctca ccaggttgtgggcaacaaggaggaaagggctgcggcccttcgggaacaccagacctaggg gctctccgagccagggctgtgacagtggggctctgcagttcctggcatctccaagcttcc gggcatcactgcattccccagtacctgcagcagaaactgcttgtggtacacctggtccag ccacagccttgcacaaagccagtgcctgtaccggtgcctggagctgcctgccctgccacg gctggcacacctggctgtgcatag