GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:38:22 Sequence gi568815578r:563674_775601 : 211928 bp : 49.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 3838 3731 108 1 0 80 52 81 0.072 4.18 1.02 Intr - 5230 5105 126 1 0 37 17 118 0.078 0.48 1.01 Init - 16452 15832 621 0 0 71 63 131 0.083 4.25 1.00 Prom - 19778 19739 40 -3.56 2.11 PlyA - 21001 20996 6 1.05 2.10 Term - 25532 25501 32 0 2 135 38 24 0.639 0.12 2.09 Intr - 34241 34109 133 2 1 46 96 60 0.201 2.82 2.08 Intr - 40992 40957 36 0 0 128 100 43 0.922 7.96 2.07 Intr - 46659 46040 620 1 2 57 84 1221 0.087 110.64 2.06 Intr - 64063 63996 68 0 2 60 98 66 0.156 3.35 2.05 Intr - 71376 71262 115 1 1 72 45 53 0.020 -1.09 2.04 Intr - 72179 72062 118 2 1 54 98 60 0.036 3.64 2.03 Intr - 82291 82097 195 1 0 58 36 94 0.009 0.81 2.02 Intr - 85244 85077 168 2 0 112 5 261 0.073 20.34 2.01 Init - 89512 89303 210 1 0 102 76 449 0.997 41.89 2.00 Prom - 97706 97667 40 -4.06 3.03 PlyA - 97945 97940 6 -1.95 3.02 Term - 100788 99998 791 1 2 125 52 1429 0.938 136.48 3.01 Init - 111928 111796 133 1 1 100 113 143 0.980 18.30 3.00 Prom - 112166 112127 40 -2.36 4.00 Prom + 114970 115009 40 -2.86 4.01 Init + 117855 117931 77 2 2 58 119 54 0.602 6.06 4.02 Intr + 129043 129147 105 1 0 95 95 67 0.630 7.43 4.03 Term + 143695 143725 31 1 1 116 48 28 0.014 -1.07 4.04 PlyA + 147766 147771 6 1.05 5.06 PlyA - 148723 148718 6 1.05 5.05 Term - 159035 158856 180 2 0 96 51 60 0.027 0.71 5.04 Intr - 164646 164527 120 0 0 85 47 77 0.208 3.99 5.03 Intr - 165131 164924 208 1 1 76 46 110 0.423 4.68 5.02 Intr - 170684 170601 84 1 0 86 31 79 0.426 0.94 5.01 Init - 172263 172256 8 0 2 114 57 0 0.374 -0.00 5.00 Prom - 189024 188985 40 -3.06 6.05 PlyA - 189046 189041 6 1.05 6.04 Term - 197565 197353 213 0 0 137 48 250 0.997 23.13 6.03 Intr - 198151 198013 139 0 1 138 56 120 0.891 14.27 6.02 Intr - 200330 199825 506 2 2 132 69 539 0.987 48.18 6.01 Init - 202101 201535 567 0 0 111 80 877 0.997 84.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 46564 46040 525 1 0 91 84 1203 0.830 114.45 S.002 Term - 85244 85041 204 2 0 112 43 285 0.925 23.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:563674_775601|GENSCAN_predicted_peptide_1|285_aa MGLLHLIFDKPDKNKKWGKDSLFSKWCWENWLAICRKLKLDPFLTPNTKINSRWIKDLNV RPKTIKTLEENLGNAIQDIGMGKDFMTKTLKAMATKAKIDKWDLIKLKSFCTAKETTIRV NRQPTEWEKIFAIYSSDKGLISRIYKELKQIYKKKTNNPINKWAKDMNRHFSKEDIYAAN RHMKKCSSSLAIREMQIKTTMRYHLTPGDLEVLGNHSRQIEDEGVCYAICPPGSQAAQAA PNHGQDENENQVQYPQNEIPASSWGPLLHSLTMDEVILLDLLDCQ >gi568815578r:563674_775601|GENSCAN_predicted_CDS_1|855_bp atggggttgctccacctgatctttgacaaacctgacaaaaacaagaaatggggaaaggat tccctatttagcaaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacacctaatacaaaaattaactcaagatggattaaagacttaaatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcaatgccattcaggacataggc atgggcaaggacttcatgactaaaacactaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagta aacaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaac agacatatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccaggggatttggaagtccttggcaatcattcaaggcagatt gaagatgaaggcgtgtgttatgccatctgtcctccaggttctcaagccgctcaggcagca ccaaaccacgggcaggacgagaacgaaaaccaagtacagtacccgcagaatgagatccct gcctcctcctgggggcctctcctccactctctcaccatggatgaagttatcctgttggac cttcttgactgccaa >gi568815578r:563674_775601|GENSCAN_predicted_peptide_2|564_aa MGLRAGGTLGRAGAGRGAPEGPGPSGGAQGGSIHSGRIAAVHNVPLSVLIRPLPSVLDPA KVQSLVDTIREDPDSVPPIDVLWIKGAQGGDYFYSFGGCHRYAAYQQLQRETIPAKLVQS TLSDLRVCSEGEVPLSSVSKLLLCPLGWNESEDGRPLLKQLFLLAEKTQPDSAKGPARHC EGGAEQIRSRGKLPGAIKKGFPGEVALEQKLDKVHSPSTHLTDEEIEAQKVDSWLHNLRA PATKCLRASEGHKTCSFPKTLTPALAEDRKLNSAVSWACSHLKAYRVGELEGGGRRGRGS ARERTAGRGRRWDAAHGGTRPAPMAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQS FGCCEGPEAARRGPGPGGGRRAGGGGGAGPVVVVRQRQAANARERDRTQSVNTAFTALRT LIPTEPVDRKLSKIETVRLASSYIAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAADG GRQPRSICTFCLSNQRKGGGRRDLGGSCLKMRKRRHKEVKELGQGDTVAGGFESDVLAAE PHWALNHHPHMTSPGPPVLFAFSP >gi568815578r:563674_775601|GENSCAN_predicted_CDS_2|1695_bp atggggctgcgtgcaggaggaacgctgggcagggccggcgcgggtcggggggcgcccgag gggcccgggccgagcggcggcgcgcagggcggcagcatccactcgggccgcatcgccgcg gtgcacaacgtgccgctgagcgtgctcatccggccgctgccgtccgtgttggaccccgcc aaggtgcagagcctcgtggacacgatccgggaggacccagacagcgtgccccccatcgat gtcctctggatcaaaggggcccagggaggtgactacttctactcctttgggggctgccac cgctacgcggcctaccagcaactgcagcgagagaccatccccgccaagcttgtccagtcc actctctcagacctaagggtttgtagcgaaggtgaggtgcccctgtcttctgtgagtaaa ctcctcctgtgcccgctgggctggaatgagagtgaagatggtcgccccctcctgaaacaa cttttccttcttgcagagaagacccaaccagacagcgctaagggccctgccaggcactgc gaaggaggtgcagaacagattcggagtagagggaagctgcctggagcaatcaagaaaggc ttcccaggggaggtggcccttgaacagaaacttgataaagttcattctccttccactcat ttgacagatgaggaaatagaggcacagaaagtggacagctggctgcacaatctgagggcc cctgcaaccaaatgtctgagggcttctgagggccacaagacctgttccttccccaagacc cttaccccggccctggctgaggacaggaagctaaattcagctgtcagctgggcctgcagt catctgaaggcctatcgggttggggagttggagggcgggggccggcggggccgtgggagc gcgcgggagcgcacggcggggcgcggccgacgctgggacgcggcgcacggagggacgcgg ccggcgcccatggcgttcgcgctgctgcggcccgtcggcgcgcacgtgctgtacccggac gtgcggctgctgagcgaggacgaggagaaccgcagcgagagcgacgcgtcggaccagtcg ttcggctgctgcgagggcccggaggcggcgcggcgcggcccgggccccgggggcgggcgg cgggcgggcggcggcggcggcgcgggccccgtggtggtggtgcgacagcggcaggcggcc aacgcgcgggagcgggaccgcactcagagcgtgaacacggccttcacggcgctgcgcacg ctcatccccaccgagccggtggaccgcaagctgtccaagatcgagaccgtgcgcctggcg tccagctacatcgcgcacctggccaacgtgctgctgctgggcgactcggccgacgacggg cagccgtgcttccgtgccgcgggcagtgccaagggcgccgtccccgccgccgccgacggc ggccgccagccgcgctccatctgcaccttctgcctcagcaaccagcgcaaggggggtggc cgtcgtgacctggggggcagctgcttgaagatgaggaaacggaggcacaaggaggttaaa gaacttggccaaggtgacacagtggctggtggatttgaatcagatgttctagctgcagag ccccactgggctctcaaccaccacccccacatgacctcaccaggtcccccggtccttttt gccttcagtccatga >gi568815578r:563674_775601|GENSCAN_predicted_peptide_3|307_aa MPRSFLVKKIKGDGFQCSGVPAPTYHPLETAYVLPGARGPPGDNGYAPHRLPPSSYDADQ KPGLELAPAEPAYPPAAPEEYSDPESPQSSLSARYFRGEAAVTDSYSMDAFFISDGRSRR RRGGGGGDAGGSGDAGGAGGRAGRAGAQAGGGHRHACAECGKTYATSSNLSRHKQTHRSL DSQLARKCPTCGKAYVSMPALAMHLLTHNLRHKCGVCGKAFSRPWLLQGHMRSHTGEKPF GCAHCGKAFADRSNLRAHMQTHSAFKHYRCRQCDKSFALKSYLHKHCEAACAKAAEPPPP TPAGPAS >gi568815578r:563674_775601|GENSCAN_predicted_CDS_3|924_bp atgccgcgctccttcctggtaaagaagatcaaaggggacggcttccagtgcagcggggtg ccggcccccacctaccaccccttggagacagcctacgtgctgcctggcgcccgggggcct cccggggacaacgggtacgccccgcaccgcctgcccccgagcagctacgatgcggaccag aagccgggcctggagctggccccggccgagcccgcgtacccgccggcggcgccggaggag tacagcgaccccgaaagcccgcagtcgagcctgtcggcgcgctacttccgaggggaggcg gcagtgaccgacagctactccatggacgccttcttcatctcggacgggcgctcgcggcgg cggcggggcgggggcggcggggacgcggggggctcgggagacgcggggggcgccgggggg cgcgcggggcgcgcgggggcgcaggcgggcggcgggcaccggcacgcgtgcgccgagtgc ggcaagacctacgccacgtcgtcgaacctgagccgccacaagcagacgcaccgcagcctg gacagccagctggcgcgcaaatgcccgacgtgcggcaaggcctacgtgtccatgcccgcg ctcgccatgcacctgctcacgcacaacctgcgccacaagtgcggcgtctgcggcaaggcc ttctcgcggccctggctgctgcagggtcacatgcgctcgcacaccggcgaaaagccgttc ggctgcgcgcactgcggcaaggccttcgccgaccgctccaacctgcgcgcgcacatgcag acgcactcggccttcaagcactaccgctgccgccagtgcgacaagagcttcgcgctcaag tcctacctccacaagcactgcgaggcggcctgcgccaaggcggccgagccacccccgccg acccccgccggcccggccagctga >gi568815578r:563674_775601|GENSCAN_predicted_peptide_4|70_aa MEFCMNSKEESSHKCIRSTLMGLSLRCREERRKFFDRKILEILLLSPIGLLEAGVSPEVF SPQEAYWSIT >gi568815578r:563674_775601|GENSCAN_predicted_CDS_4|213_bp atggaattctgtatgaattccaaggaggagagtagccacaaatgcatccgctccacgctg atggggctctctctccgttgcagagaggaaaggaggaaatttttcgatagaaaaatcttg gagatcctgttgctgtcacccatcgggctgttggaagctggggttagtccagaagtcttc agcccccaagaggcctactggagcataacttga >gi568815578r:563674_775601|GENSCAN_predicted_peptide_5|199_aa MPGISASEFRKAEDAAHFGAKRAVVWDALAGEGCILTVCFQVHLQREAERPFLVRLATQR GPAPDRAFKGTARLPCSQHFTLSSYRRATRCKVEKKVNERLARSRPEPPGRDQGTPKPAG APPVCSPPPPSAADPARPGRGQPAKDTEKRRRKECRPWAPCHDQAEDPAAATMSHSSATE TLFIPHSGSMSTGGRDLKV >gi568815578r:563674_775601|GENSCAN_predicted_CDS_5|600_bp atgcccggaatttctgcttcagaattccgtaaggctgaagatgctgcacactttggagcc aagagggcagttgtctgggatgctctggctggggagggctgcattctgacggtctgcttc caagtccatctgcagcgcgaggctgagcgccctttcctggtgaggttggccactcagagg ggcccagctcccgacagggcatttaaaggtacggcccgcttgccctgttctcagcacttc accctcagctcgtaccgtagagccacccgctgtaaggttgagaaaaaggtgaacgagcga ctggcccggagtcggcccgagccccctggacgggaccaggggacccccaagccggccggc gccccgcccgtctgctccccacccccaccgtcagctgcggacccggcccggccgggaagg gggcagcctgccaaggacacagagaagaggaggagaaaggaatgtaggccatgggcaccc tgccatgaccaagctgaagaccccgcagcagccaccatgtcccacagctcagcaactgag acactgttcattcctcactcaggttccatgtccactgggggccgggatctgaaggtctga >gi568815578r:563674_775601|GENSCAN_predicted_peptide_6|474_aa MAFLMHLLVCVFGMGSWVTINGLWVELPLLVMELPEGWYLPSYLTVVIQLANIGPLLVTL LHHFRPSCLSEVPIIFTLLGVGTVTCIIFAFLWNMTSWVLDGHHSIAFLVLTFFLALVDC TSSVTFLPFMSRLPTYYLTTFFVGEGLSGLLPALVALAQGSGLTTCVNVTEISDSVPSPV PTRETDIAQGVPRALVSALPGMEAPLSHLESRYLPAHFSPLVFFLLLSIMMACCLVAFFV LQRQPRCWEASVEDLLNDQVTLHSIRPREENDLGPAGTVDSSQGQGYLEEKAAPCCPAHL AFIYTLVAFVNALTNGMLPSVQTYSCLSYGPVAYHLAATLSIVANPLASLVSMFLPNRSL LFLGVLSVLGTCFGGYNMAMAVMSPCPLLQGHWGGEVLIVSIRPVASWVLFSGCLSYVKV MLGVVLRDLSRSALLWCGAAVQLGSLLGALLMFPLVNVLRLFSSADFCNLHCPA >gi568815578r:563674_775601|GENSCAN_predicted_CDS_6|1425_bp atggccttcctgatgcacctgctggtctgcgtcttcggaatgggctcctgggtgaccatc aatgggctctgggtagagctgcccctgctggtgatggagctgcccgagggctggtacctg ccctcctacctcacggtggtcatccagctggccaacatcgggcccctcctggtcaccctg ctccatcacttccggcccagctgcctttccgaagtgcccatcatcttcaccctgctgggc gtgggaaccgtcacctgcatcatctttgccttcctctggaatatgacctcctgggtgctg gacggccaccacagcatcgccttcttggtcctcaccttcttcctggccctggtggactgc acctcttcagtgaccttcctgccgttcatgagccggctgcccacctactacctcaccacc ttctttgtgggtgaaggactcagcggcctcttgcccgccctggtggctcttgcccagggc tccggtctcactacctgcgtcaatgtcactgagatatcagacagcgtaccaagccctgta cccacgagggagactgacatcgcacagggagttcccagagctttggtgtccgccctcccc ggaatggaagcacccttgtcccacctggagagccgctaccttcccgcccacttctcaccc ctggtcttcttcctcctcctatccatcatgatggcctgctgcctcgtggcgttctttgtc ctccagcgtcaacccaggtgctgggaggcttccgtggaagacctcctcaatgaccaggtc accctccactccatccggccgcgggaagagaatgacttgggccctgcaggcacggtggac agcagccagggccaggggtatctagaggagaaagcagccccctgctgcccggcgcacctg gccttcatctataccctggtggccttcgtcaacgcgctcaccaacggcatgctgccctct gtgcagacctactcctgcctgtcctatgggccagttgcctaccacctggctgccaccctc agcattgtggccaaccctcttgcctcgttggtctccatgttcctgcctaacaggtctctg ctgttcctgggggtcctctccgtgcttgggacctgctttgggggctacaacatggccatg gcggtgatgagcccctgccccctcttgcagggccactggggtggggaagtcctcattgtg agtatccggccggtggcctcgtgggtgcttttcagcggctgcctcagttacgtcaaggtg atgctgggcgtggtcctgcgcgacctcagccgcagcgccctcttgtggtgcggggcggcg gtgcagctgggctcgctgctcggagcgctgctcatgttccctctggtcaacgtgctgcgg ctcttctcgtccgcggacttctgcaatctgcactgtccagcctag