GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:00:42 Sequence gi568815576r:36658314_36867865 : 209552 bp : 49.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21566 22171 606 1 0 27 -68 1335 0.966 107.12 1.02 Term + 22173 22610 438 2 0 49 48 639 0.997 51.28 1.03 PlyA + 24062 24067 6 1.05 2.00 Prom + 26358 26397 40 -6.36 2.01 Init + 30976 30982 7 0 1 54 99 5 0.198 -0.98 2.02 Intr + 31828 31913 86 2 2 43 76 74 0.200 1.24 2.03 Intr + 32060 32235 176 1 2 86 111 -20 0.446 -1.16 2.04 Intr + 32747 32890 144 2 0 94 75 88 0.673 7.50 2.05 Intr + 35607 35739 133 0 1 67 95 13 0.514 0.55 2.06 Term + 40177 40299 123 0 0 109 40 41 0.305 -0.22 2.07 PlyA + 42082 42087 6 1.05 3.02 PlyA - 43357 43352 6 1.05 3.01 Sngl - 47978 47745 234 2 0 69 36 347 0.709 22.70 3.00 Prom - 49524 49485 40 -3.96 4.00 Prom + 56117 56156 40 -4.36 4.01 Init + 70432 70463 32 0 2 82 94 44 0.210 1.69 4.02 Intr + 72132 72241 110 0 2 90 64 76 0.296 5.33 4.03 Term + 81523 81605 83 2 2 85 37 90 0.252 1.46 4.04 PlyA + 82152 82157 6 1.05 5.05 PlyA - 82316 82311 6 -0.45 5.04 Term - 82437 82355 83 1 2 100 37 91 0.528 3.06 5.03 Intr - 85923 85820 104 2 2 43 25 59 0.174 -5.08 5.02 Intr - 86160 85975 186 2 0 96 97 41 0.432 4.60 5.01 Init - 87480 87416 65 0 2 51 113 60 0.753 5.42 5.00 Prom - 93222 93183 40 -5.96 6.09 PlyA - 95535 95530 6 1.05 6.08 Term - 100096 99998 99 1 0 58 49 132 0.870 4.43 6.07 Intr - 104700 104591 110 1 2 39 89 60 0.808 1.20 6.06 Intr - 105550 105440 111 2 0 41 109 85 0.932 6.25 6.05 Intr - 105723 105606 118 0 1 79 96 128 0.998 12.74 6.04 Intr - 109052 108993 60 2 0 114 89 73 0.995 8.93 6.03 Intr - 109549 109470 80 2 2 89 93 86 0.974 8.47 6.02 Intr - 117481 117361 121 1 1 113 109 66 0.897 11.27 6.01 Init - 121641 121570 72 0 0 84 41 66 0.360 2.57 6.00 Prom - 121690 121651 40 -8.06 7.14 PlyA - 122151 122146 6 1.05 7.13 Term - 123970 123659 312 1 0 -3 48 333 0.563 15.20 7.12 Intr - 138122 138026 97 1 1 74 76 37 0.071 1.11 7.11 Intr - 149800 149698 103 2 1 90 25 65 0.029 -0.37 7.10 Intr - 154691 154674 18 0 0 91 94 29 0.381 0.38 7.09 Intr - 155442 155333 110 2 2 139 76 165 0.938 20.33 7.08 Intr - 156922 156790 133 2 1 134 86 268 0.999 30.90 7.07 Intr - 157699 157661 39 2 0 99 75 23 0.550 0.30 7.06 Intr - 158699 158632 68 1 2 50 89 114 0.332 6.25 7.05 Intr - 163852 163755 98 1 2 92 63 102 0.325 6.91 7.04 Intr - 169456 169407 50 2 2 112 92 11 0.821 2.30 7.03 Intr - 170953 170798 156 2 0 -18 68 144 0.411 1.78 7.02 Intr - 173530 173393 138 2 0 -19 -75 458 0.433 19.24 7.01 Init - 173641 173584 58 1 1 24 70 46 0.554 -2.13 7.00 Prom - 176289 176250 40 -4.86 8.00 Prom + 184289 184328 40 -4.26 8.01 Sngl + 184452 184877 426 2 0 63 42 217 0.774 10.89 8.02 PlyA + 186439 186444 6 1.05 9.02 PlyA - 186671 186666 6 1.05 9.01 Sngl - 188560 188156 405 1 0 81 51 154 0.676 7.30 9.00 Prom - 193694 193655 40 -3.96 10.00 Prom + 196229 196268 40 -6.06 10.01 Init + 202859 202890 32 1 2 111 105 77 0.998 10.99 10.02 Intr + 205732 205816 85 1 1 67 41 157 0.997 8.72 10.03 Intr + 206606 206759 154 1 1 124 96 321 0.999 36.15 10.04 Intr + 209079 209149 71 1 2 75 76 176 0.714 14.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_1|347_aa MVVIRMITIPVIITSTISAATTFIITTCITIIIITITTTFMITTTCTMAIITTTTISITT TTIIITTITATFRITATCIMAIITTTITTITTTTITVTTITTTITTTTITVTTTTTTIIT TTITIINTTTITTITTTFMITATCIMAIITTAITIITTTIITIITITTTITATITTTIIT IITTTTITTITITTFITTIFITSLLPPSIITTFMITATCIMVTITTTTISTTFMITTTCI MAIITTTITTTNTTIIITTITTIFMITATTTTITTTTITITTTITITTTTTTTFITTIFI TTVIVTINHHHLHDYCHLHHGYNYYHHHHHHYHHHHHHHHHHHQQHI >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_1|1044_bp atggtagttattcggatgataacaatccctgttatcatcacaagcaccatctctgctgcc accaccttcatcatcaccacctgcatcacaatcattatcatcaccatcaccaccaccttc atgatcaccaccacctgcaccatggctataattactaccaccaccatcagcatcactacc accaccatcattatcaccaccatcaccgccaccttcaggatcactgccacctgcatcatg gctataattaccaccaccatcaccaccatcactaccaccaccatcaccgtcaccaccatt accaccaccatcactaccaccaccatcactgtcaccaccactaccaccaccatcatcacc accaccatcaccatcatcaacaccaccaccatcaccaccatcaccaccaccttcatgatc actgccacctgcatcatggctataattaccaccgccatcaccatcatcactaccaccatc atcaccatcatcaccattaccaccaccatcactgccaccatcactaccaccatcatcacc atcatcaccaccaccaccatcaccaccatcaccatcaccactttcatcaccaccatcttt atcacatcattattaccaccatcaatcatcaccaccttcatgatcactgccacctgcatc atggttacaattactaccaccaccatcagcaccaccttcatgatcaccaccacctgcatc atggctataattactaccaccatcaccaccactaacaccaccatcattatcaccaccatc accaccatcttcatgatcactgccaccaccaccaccatcaccactaccaccatcaccatc accactaccatcaccatcaccaccaccactaccaccactttcatcaccaccatctttatc acaaccgttattgtcaccatcaatcaccaccaccttcatgattactgccacctgcatcat ggttacaattactaccaccaccatcaccatcactatcaccaccaccaccatcaccatcat caccaccaccagcagcacatctga >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_2|222_aa MAGVEEDLQTSTPTEAGKRPRTVERSSHVSHEPKMWIFMGKFSICRTFEGQAQLACQSPF ATASVINNKHTTGNDSTRHLSSAHCVPAPRTAPGNGSNLVNFVEEIDEGTHDLEEDTGME TSNIKDIRCWDLEEADPRLNSLRAQIESFNSILESHTVPGKYSGTVVGKEQPSPFSSSSK LLNKRAMAEPSVEHCITSQAIMTWKRFGTIERDTSCSKRTRM >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_2|669_bp atggcaggcgtagaggaggatttgcaaacttcgacgcctacagaggccgggaaacggcca cgcacggtggagcggagcagccatgtaagccacgaaccaaaaatgtggatttttatggga aagttttcaatttgtagaacatttgagggccaagcacaactcgcttgccaatcacccttt gcaactgctagcgtcataaataataaacacaccactggtaatgacagcacccgacatttg tcaagtgctcactgtgtgccggcacctcggacagcacctggcaatggtagtaacttggta aattttgttgaagaaatagatgagggaactcatgatctagaggaagacacaggaatggaa acaagcaacataaaagatattagatgctgggatctggaggaagcagaccccagactgaat tccttgagggcacagattgagtctttcaactctatcctagagtcccacacagtacctggc aaatactcgggaacagttgttggaaaagagcaaccatcacccttttcctcctcaagcaag ttgctgaataagagagccatggctgagccgagcgtggagcactgtatcacctcgcaggcc atcatgacttggaaaagatttgggaccattgagagagacacttcttgctcaaagagaaca aggatgtag >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_3|77_aa MRKKRKKRKKEEEEEEEEEEEEEEKEEGEEEGGGGGGEEEEEKEEEANLDVKEGRRNERK PTLIECILCVRHFTSTI >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_3|234_bp atgaggaagaagaggaagaagaggaagaaagaagaagaagaagaagaagaagaagaagaa gaagaggaagaaaaagaagaaggagaagaagaaggaggaggaggaggaggagaggaggag gaggagaaggaggaggaggccaatttggatgtaaaagaggggaggagaaatgagagaaaa cccacacttattgagtgcatactatgtgtcaggcacttcacgtctacaatttaa >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_4|74_aa MLMVPDLGRGRGPSQEGLDSPSSPTGVMKYEDRYRSEHLLRARPFNDRLDEIPAGSLCRE AAGPISKSAGFEAK >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_4|225_bp atgctcatggtgcctgacctggggaggggcaggggccccagccaggaagggctggactcg ccatcctccccaactggtgttatgaagtatgaagatcgttatcgttctgagcacctacta cgtgcccggccctttaacgatagacttgatgaaatccctgctggctccctgtgcagagaa gcagcaggacccatcagcaagagtgcaggctttgaggccaaatag >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_5|145_aa MQDGPGTSPYRVAKAVQRPKESPASASPEPPGSVSPAPCMTGRLGHITPQFATPDCTAPL ASQAGQHLQDSGQGSGTQQRWRRRPSARQCGELRALGQTVAPAQTRARLWGLAGCWSGGL DEIPAGSMCREATGPISKSAGFEAK >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_5|438_bp atgcaagacgggcccggcaccagcccctaccgggttgcaaaagcagtgcaaaggcccaaa gaaagtcctgcctcagcttcaccagagcccccaggatccgtgtcccctgcaccctgcatg actggacgcctgggtcatatcacaccacagtttgccacacctgactgcacggcgcccctg gccagccaagctggtcaacacctccaggacagcggccaaggcagtggaacccagcagagg tggaggagaaggccatctgctcggcagtgcggcgagctcagggcactggggcagacagtg gcacccgcccagacacgtgctcgcctctggggcctggccggctgctggtcaggaggactt gatgaaatccctgctggctccatgtgcagagaagcaacaggacccatcagcaagagtgca ggctttgaggccaaatag >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_6|256_aa MSLCRSHQEALWKQIVGSTSSVQLASPGPEAHPLASLAAALVLWVRLAGVLVTMVKLAAK CILAGDPAVGKTALAQIFRSDGAHFQKSYTLTTGMDLVVKTVPVPDTGDSVWESPNVLCL VYDVTNEESFNNCSKWLEKARSQAPGISLPEHKGIAEVAVGGEGALGLDFDGLRRTLALP LSSHANLGVLVGNKTDLAGRRAVDSAEARAWALGQGLECFETSVKEMENFEAPFHCLAKQ FHQLYREKVEVFRALA >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_6|771_bp atgagcttgtgtcggagtcaccaggaggccttatggaagcagattgtgggctccacctcc agtgttcagctggccagcccagggcccgaagcccacccactcgcgtctctagcagccgct cttgtcctctgggtacggctcgcgggagtgttggttaccatggtgaagctggcagccaaa tgcatcctggcaggagacccagcagtgggcaagaccgccctggcacagatcttccgcagt gatggagcccatttccagaaaagctacaccctgacaacaggaatggatttggtggtgaag acagtgccagttcctgacacgggagacagtgtgtgggagagtcccaatgtcttatgtctc gtctatgatgtgaccaatgaagaatccttcaacaactgcagcaagtggctggagaaggct cggtcacaggctccaggcatctctctcccagagcacaagggcattgcagaggtagctgtg ggtggagaaggggccctgggcttggactttgatggcctaaggcgtacactggccctgcca cttagcagccatgccaacctcggtgttttagttgggaacaagacagacctggccggcaga cgagcagtggactcagctgaggcccgggcatgggcgctgggccagggcctggaatgtttt gaaacatccgtgaaagagatggaaaacttcgaagcccctttccactgccttgccaagcag ttccaccagctgtaccgggagaaggtggaggttttccgggccctggcatga >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_7|459_aa MSCAGSWEYSDAYDTLVTLKEEEEEEEEEEEEEEEEEKKKKKKKKKKKKKKKKKKNHINK KKKNHCQCGTEINAVSFPPCCERAPELDKLIVMLGEDSSMDNCSTAMITVNTGQYESDMS CHPLRAQKSDPQAWDLDLKVCSIAGTAILPPSGVNQFERVAQGIAENCRMSMTDLLNAED IKKAVGAFSGVLKQDPKIQKIAATDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGF IEEDELGFILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDGFNVSTEKIRMPRVHTC LHAVLVTFFLKPIPGDGSTALCFKAIALRLDRAGCQICISLLLAGHFAYTICYGPKAAAN KSKGGSPSALLQQTISRLLLPPADRIRTSRPPCPAEATAQASTLHSGRTEGESVPFPLPF IEMATAKYCISGDINNRAHVNSGKKNSKGHYREKAVHQM >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_7|1380_bp atgtcctgtgctggatcctgggagtacagcgatgcatatgacacacttgtcaccctcaaa gaagaagaagaagaagaagaagaagaagaagaagaggaagaagaagaagagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaatcatattaacaag aagaagaagaatcattgtcaatgtggcaccgaaatcaacgcagtttcctttccaccctgc tgcgaaagggcccctgagctcgacaagctcatagtcatgttgggagaagacagttccatg gataattgctccacagcaatgatcacagtaaacacaggtcaatatgagagtgacatgtcc tgccatcctttgagggcccagaagtcagatccacaggcctgggacttggacctgaaagtt tgtagcattgcagggacagccatcttgccaccctcaggggtcaaccagtttgagagagtg gcacaaggtattgctgagaattgcaggatgtcgatgacagacttgctgaacgctgaggac atcaagaaggcggtgggagcctttagcggtgttctaaaacaggaccccaagatacagaaa attgcagctaccgactccttcgaccacaaaaagttcttccaaatggtcggcctgaagaaa aagagtgcggatgatgtgaagaaggtgtttcacatgctggacaaggacaaaagtggcttc atcgaggaggatgagctgggattcatcctaaaaggcttctccccagatgccagagacctg tctgctaaagaaaccaagatgctgatggctgctggagacaaagatggggacggcaaaatt ggggttgacggcttcaatgtcagcacagagaagatccgcatgccccgtgtgcacacgtgc ctgcacgcagtcctagttacattcttcctgaaacccattcccggcgatggatccactgcc ctctgctttaaggctatagctctgcggctggacagagctggatgccaaatctgcatctcc ctccttcttgctggacactttgcatataccatctgctatggtccgaaggcggcagcaaat aagtcaaaaggcggcagcccttctgccctcctgcagcagacaatttcccggctgctgctg ccccctgctgacagaatcaggacatcgcggcctccgtgccccgccgaggccacagctcag gcctccactttgcattccgggcgcacagaaggtgagtcagttccatttcctctgcccttt attgagatggccaccgccaagtactgcatcagcggagacataaacaaccgggcccacgtg aacagtggaaaaaagaacagcaaaggtcattaccgagagaaggcagtgcaccagatgtga >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_8|141_aa MEYKEINVVFMPANTTILQPMNQGVISTFKSYYLRNTFYKATASINSDSSDGGGQNKLKT FWKRLIVLNATQHIFDSWEEVTISTLSGVWKKLIPTLIDDFEEFKTSVEEGTADVVEIAR ELELEVESEDMTALLQSHDQT >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_8|426_bp atggagtacaaggagattaatgtcgttttcatgcctgctaacacaaccattctgcagccc atgaatcaaggagtaatttcaactttcaagtcttattatttaagaaacacattttataag gctacagcttccataaacagtgattcctctgatggaggtggacaaaataaattgaaaacc ttctggaaacgactcattgttctaaatgccactcaacatatttttgattcatgggaggag gtcacaatatcaacattatcaggagtttggaagaagttgattccaaccctcatagacgac tttgaggagttcaagacttcagtggaagaaggaactgcagatgtggtagaaatagcaaga gaactagaattagaagtagagtctgaagatatgactgcattgctgcaatctcatgatcaa acctga >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_9|134_aa MDTALDQTIILTMTPGKNHGMNCHGCSSGLQSIRFSASHTSQGLRSPGWLPGTVLGPGDS DTKQVQHKVRQGSETQELVRLAQDEAGGEKQERRGQEISSGQTKEGLISQAMEFELYQAG QERCGKDLNRGDTG >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_9|405_bp atggacactgctctggatcagacaatcattttgactatgacccctggaaaaaatcatggc atgaactgccacggatgcagcagtggcttacagagcattcgcttctcggcatcacacacc agccaaggtttacggagccccggatggttgccaggcacagtgctaggccctggggattca gacacaaagcaggtacagcacaaagtcaggcaaggctcagagacccaggagctggttcgt ttggctcaagatgaagcaggaggggagaaacaggagaggagaggccaggagatcagcagc ggacagaccaaggagggccttataagtcaagccatggaatttgaactttatcaggcaggc caggagaggtgtgggaaggatctgaacagaggagatactgggtga >gi568815576r:36658314_36867865|GENSCAN_predicted_peptide_10|114_aa MAVAQQLRAESDFEQLPDDVAISANIADIEEKRGFTSHFVFVIEVKTKGGSKYLIYRRYR QFHALQSKLEERFGPDSKSSALACTLPTLPAKVYVGVKQEIAEMRIPALNAYMK >gi568815576r:36658314_36867865|GENSCAN_predicted_CDS_10|342_bp atggctgtggcccagcagctgcgggccgagagtgactttgaacagcttccggatgatgtt gccatctcggccaacattgctgacatcgaggagaagagaggcttcaccagccactttgtt ttcgtcatcgaggtgaagacaaaaggaggatccaagtacctcatctaccgccgctaccgc cagttccatgctttgcagagcaagctggaggagcgcttcgggccagacagcaagagcagt gccctggcctgtaccctgcccacactcccagccaaagtctacgtgggtgtgaaacaggag atcgccgagatgcggatacctgccctcaacgcctacatgaag