GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:35:20 Sequence gi568815592f:73294906_73516223 : 221318 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1686 1681 6 1.05 1.03 Term - 4538 4243 296 0 2 61 34 173 0.877 4.67 1.02 Intr - 14823 14647 177 1 0 78 105 104 0.911 10.99 1.01 Init - 16548 16386 163 0 1 70 72 71 0.799 3.60 1.00 Prom - 17494 17455 40 -4.96 2.00 Prom + 19195 19234 40 -2.46 2.01 Init + 27941 28048 108 1 0 52 94 168 0.970 14.02 2.02 Term + 28212 28373 162 2 0 13 42 177 0.816 3.54 2.03 PlyA + 29220 29225 6 1.05 3.06 PlyA - 32995 32990 6 1.05 3.05 Term - 46604 46506 99 2 0 68 47 53 0.003 -2.67 3.04 Intr - 58844 58741 104 0 2 94 64 39 0.403 1.99 3.03 Intr - 59127 58948 180 1 0 101 78 331 0.915 33.24 3.02 Intr - 59365 59209 157 1 1 3 109 181 0.551 11.28 3.01 Init - 59468 59409 60 2 0 78 80 55 0.571 2.95 3.00 Prom - 62871 62832 40 -4.76 4.00 Prom + 63749 63788 40 -8.26 4.01 Init + 67490 67554 65 1 2 51 61 54 0.804 -0.37 4.02 Intr + 67812 67993 182 0 2 116 109 139 0.940 18.31 4.03 Intr + 68190 68369 180 1 0 82 102 154 0.462 16.04 4.04 Term + 68651 68955 305 0 2 92 49 260 0.962 17.83 4.05 PlyA + 69101 69106 6 1.05 5.05 PlyA - 70006 70001 6 1.05 5.04 Term - 73958 73879 80 0 2 86 47 51 0.837 -1.27 5.03 Intr - 74480 74301 180 0 0 60 78 143 0.843 10.34 5.02 Intr - 74788 74698 91 1 1 53 85 103 0.532 6.07 5.01 Init - 75054 74815 240 0 0 88 35 245 0.433 15.07 5.00 Prom - 85687 85648 40 -2.46 6.00 Prom + 86775 86814 40 -2.16 6.01 Init + 100001 100243 243 1 0 73 31 99 0.034 0.16 6.02 Intr + 105329 105458 130 1 1 98 111 72 0.691 10.77 6.03 Intr + 109785 109866 82 1 1 66 90 49 0.761 1.60 6.04 Intr + 113055 113196 142 0 1 95 17 83 0.744 2.26 6.05 Intr + 114343 114443 101 0 2 107 98 68 0.563 8.61 6.06 Term + 127631 127733 103 2 1 42 50 66 0.090 -4.05 6.07 PlyA + 127737 127742 6 1.05 7.05 PlyA - 128688 128683 6 1.05 7.04 Term - 144287 144190 98 0 2 78 48 71 0.882 0.23 7.03 Intr - 145540 145304 237 2 0 70 63 70 0.493 0.29 7.02 Intr - 150842 150623 220 2 1 76 116 135 0.989 12.97 7.01 Init - 157276 156620 657 1 0 91 105 450 0.592 41.87 7.00 Prom - 159169 159130 40 -9.75 8.00 Prom + 161452 161491 40 -3.36 8.01 Init + 164607 164773 167 2 2 78 86 86 0.698 6.55 8.02 Intr + 166949 167166 218 2 2 28 85 237 0.651 15.55 8.03 Intr + 171304 171503 200 2 2 79 110 44 0.956 4.77 8.04 Intr + 171584 171701 118 1 1 81 79 17 0.665 0.14 8.05 Intr + 178460 178749 290 0 2 100 116 41 0.949 5.06 8.06 Intr + 184827 184939 113 2 2 58 99 55 0.977 2.78 8.07 Intr + 185031 185221 191 0 2 79 103 63 0.977 6.13 8.08 Intr + 185770 185900 131 2 2 80 71 93 0.998 7.21 8.09 Intr + 187135 187339 205 0 1 44 78 169 0.699 10.27 8.10 Intr + 187544 187715 172 0 1 121 85 -32 0.427 -1.20 8.11 Intr + 197329 197447 119 1 2 63 70 95 0.491 5.31 8.12 Intr + 202831 202991 161 2 2 34 107 71 0.799 3.31 8.13 Term + 205669 205830 162 0 0 61 33 136 0.966 3.34 8.14 PlyA + 206112 206117 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 48306 48376 71 1 2 110 42 54 0.830 1.10 S.002 Intr - 100266 100162 105 1 0 54 36 149 0.802 6.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_1|211_aa MDKFLDTYTLPRLNQEKTESLNRPITGSEIEPIINSLPTKKVQDQKDSQPNSTRGQKREM LSAFQRLFRVLFVIETVSEYGVLIFIYGWPFLQTLAMLLIGTVSFHLWIRRNRAPQSAAP LEENVNRAVSNKAAHGTVRLTWLLVVGGVSPFSRGGCCAHPRGCAPENGQTPPGILLLKE KHQLSPERALDPRRPSPVLVQLRAVGGIWWL >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_1|636_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaaaaaactgaatcc ctgaatagaccaataacaggctctgaaattgagccaataattaatagcctaccaaccaaa aaagtccaggaccagaaggattcacagccgaattctaccagaggccagaaacgcgaaatg ctgtcggccttccagaggctgttccgagtcttatttgtgattgaaacagtctcagaatac ggagtcctgatcttcatttatggatggccctttctccagacccttgcgatgctcctgatc gggacagtgtctttccacctttggatccgcagaaatcgagctccgcagtcagcggctccg ttggaagagaatgtgaaccgcgctgtctccaacaaagctgctcacggaacagtgcgttta acttggctgctggtcgtcggaggcgtcagtcccttcagcaggggcggctgctgtgcccac ccccggggctgtgcgccagagaacggtcagactccgccagggattcttctcctgaaagaa aagcaccaactatctcccgagcgagctctcgacccacgaaggccctcccccgtgctggtc cagctgcgtgctgtcggtggcatctggtggctctag >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_2|89_aa MWYEIKAQVHNIHLCKDKHGKTGLQLQTTNKGLFVQDRIVQWIVTMHKDSTSHGGFIIKK GKVFPVVKGSSVVLKDSSPTTMCARFKNV >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_2|270_bp atgtggtatgagatcaaggcccaggtacacaacatccacctgtgcaaagacaaacatggc aagactgggctgcagctgcagaccaccaacaaggggctctttgtgcaggacaggatagtc cagtggattgtcaccatgcacaaggacagcacaagccatggtggcttcatcatcaagaag ggaaaggtcttccctgtggtcaaagggagctctgtggtcctcaaggactcttcaccaacc accatgtgtgccaggttcaagaacgtttaa >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_3|199_aa MDSGSWPGPGAGPGRGRGHQAWFALESGLGCEGHKMGTLPARRHIPPWVKVPEDLKDPEV FQVQTRLLKAIFGPDGSRIPYIEQVSKAMLELKALESSDLTEVVVYGSYLYKLRTKWMLQ SMAEWHRQRQERAPWASPGVPLASGPSQDPFGNGGPRMPAVGLGDPGTQHHYCYAHFISE KTEVEEVKGPNTVILKVAP >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_3|600_bp atggactcagggtcctggcccgggccgggggcggggcctggccgggggcggggccaccag gcctggtttgcgctggagagtggtcttggttgtgagggtcataagatgggaactctcccg gcacgtagacatatcccgccgtgggtgaaagttcccgaagacctgaaagatccagaggtg ttccaggtccagacgcggctgctgaaagccattttcggcccggacggatctcgaatccct tacatcgagcaggtgagcaaggccatgctcgagctgaaggctctggagtcttcagacctc accgaggtcgtggtttacggctcctatttgtacaagctccggaccaagtggatgctccag tccatggctgagtggcaccgccagcgccaggagcgagcaccttgggcgtcaccaggggtg cccttggcatctgggccttcccaagacccctttgggaatgggggccccagaatgccggcc gtcggtcttggagacccagggacacaacatcactattgttatgcccattttatcagtgag aaaactgaagttgaggaagttaagggtcctaataccgtcattctcaaagtggccccttga >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_4|243_aa MVKTPAGHQRIPGAQSPVPPTGTGRSMDAPRRFPTLVQLMQPKAMPVEVLGHLPKRFSWF HSEFLKNPKVVRLEVWLVEKIFGRGGERIPHVQGMSQILIHVNRLDPNGEAEILVFGRPS YQEDTIKMIMNLADYHRQLQAKGSGKALAQDVATQKAETQRSSIEVREAGTQRSVEVREA GTQRSVEVQEVGTQGSPVEVQEAGTQQSLQAANKSGTQRSPEAASKAVTQRFREDARDPV TRL >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_4|732_bp atggttaagactcccgctgggcaccagcggattcctggtgcccagagcccggttcctcct accgggaccggccgcagcatggacgctcccaggcggtttccgacgctcgtgcaactgatg cagccaaaagcaatgccagtggaggtgctcggtcacctccctaagcggttctcctggttc cactctgagttcctgaagaatccgaaggtagttcgccttgaggtttggctggtggaaaag atcttcggccggggcggagaacgcatcccgcacgtccagggtatgtcccaaatcttgatt cacgtgaatcgattggaccctaacggcgaggctgagatcttggtatttgggaggccttct taccaggaggacacaatcaagatgatcatgaacctggctgactatcaccgccagctccag gcgaaaggctcaggaaaggccctcgcccaggatgtcgccactcagaaggccgagacccag cggtcttcaatagaagtccgggaggccgggacgcagcgttcggtggaggtccgggaggcc gggacccagcgttcggtggaagtccaggaggtcgggacacagggttctccggtggaggtg caggaggccgggacccagcagtctctccaggctgccaacaagtcggggacccagcgatcc cccgaagctgccagcaaggcagtgacccagcggtttcgcgaggatgcccgggacccagtt actagattatga >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_5|196_aa MPGRRLHPLTPTRCEAPPEGAERVEYGKREPELRAWVWKRSCESRLERAPRLLVPVWSMM LVPLSPSGANRLRPTPWSSCIRIRPWWFPVQELRDPLVFYLEAWLADELFGPDRAIIPEM EWTSQALLTVDIVDSGNLVEITVFGRPRVQNRVKSMLLCLAWFHREHRARAEKMKHLEKN LKAHASDPHSPQDPVA >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_5|591_bp atgccggggcgtcgcttgcacccgctcaccccgacgcggtgcgaggcgccgccggaaggg gcggagcgggtcgaatatggtaaaagagagcccgagcttcgcgcctgggtctggaagagg tcttgcgaaagccgcctcgagcgcgctccgcggctgctggtcccagtatggtcgatgatg ctggtgccgctgagtcccagcggggcaaacagactccggcccactccctggagcagctgc attcgcatccggccctggtggtttccggtgcaggaactgagagaccctttggtgttctac ctagaggcatggctggcagacgagctctttggcccagaccgagccataattccagaaatg gagtggacgagccaggccctgctgacagtggacatagtggactcagggaacctagtcgaa atcaccgttttcgggcggccccgtgtacagaatcgggtgaagagcatgctcctgtgcctg gcatggtttcaccgagaacatcgtgcccgagctgagaagatgaaacaccttgagaagaac ttgaaggcccatgcatcagacccccactctccccaggatcctgttgcttaa >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_6|266_aa MSHHGGAPKASTWVVASRRSSTVSRAPERRPAEELNRTGPEGYSVGRGGRWRGTSRPPEA VAAGHEELPLCFALKSHFVGAIIQEQPESLVKIFGSKAMQTKAKAVIDNFVKKLEENYNS ECGIDLPPIKKNFYKESTATSAMSKVEADSWSVCVYGGGNRDEQIEELKKGVDIIIATPG RLNDLQMSNFVNLKNITYLVLDEADKMLDMGFEPQIMKILLDVRPDRQTVMTSQASDEAT ALADVLIIALRDSEQSHTQNFGPLKM >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_6|801_bp atgtcccaccacggaggagctcccaaggcctctacgtgggtcgttgctagtcggcgaagc tcgacagtgtcccgagcgccagagaggaggccggcggaggagttgaatcgaacaggtcct gagggatatagtgtcggcagaggtggtcgctggagaggcacctctaggcccccggaggcc gtggccgctggtcacgaggaactgccgctgtgttttgctttgaagagccactttgttggc gcgataatacaagaacaaccagaatcattagtcaaaatttttggcagcaaggcaatgcaa acgaaagcaaaagcagtgatagacaattttgttaaaaagctagaagaaaattacaattca gaatgcggaattgatttaccaccaattaagaaaaacttttataaagagtccactgccaca agtgccatgtcaaaagtagaagcagatagttggagtgtttgtgtatatggtggtggaaat agagatgaacaaatagaagagcttaaaaaaggtgtagatatcataattgcaactcccgga agattgaatgatctgcaaatgagtaacttcgtcaatctgaagaatataacctacttggtt ttagatgaagcagacaagatgttggacatgggatttgaaccccagataatgaagattttg ttagatgtgcgcccagataggcagacagttatgaccagtcaagcctcagatgaggctaca gcgctggctgacgtcttaattatagccttgagagactctgagcagagccacacccagaat tttggcccactgaaaatgtga >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_7|403_aa MQPWHGKAMQRASEAGATAPKASARNARGAPMDPTESPAAPEAALPKAGKFGPARKSGSR QKKSAPDTQERPPVRATGARAKKAPQRAQDTQPSDATSAPGAEGLEPPAAREPALSRAGS CRQRGARCSTKPRPPPGPWDVPSPGLPVSAPILVRRDAAPGASKLRAVLEKLKLSRDDIS TAAGMVKGVVDHLLLRLKCDSAFRGVGLLNTGSYYEHVKISAPNEFDVMFKLEVPRIQLE EYSNTRAYYFVKFKRNPKENPLSQFLEGEILSASKMLSKFRKIIKEEINDIKDTDVIMKR KRGGSPAVTLLISEKISVDITLALESKSSWPASTQEGLRIQNWLSAKVRKQLRLKPFYLV PKHAKEGNGFQDLYRTFGTYYYWIPGKLKETIKQMNQVINEYC >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_7|1212_bp atgcagccttggcacggaaaggccatgcagagagcttccgaggccggagccactgccccc aaggcttccgcacggaatgccaggggcgccccgatggatcccaccgagtctccggctgcc cccgaggccgccctgcctaaggcgggaaagttcggccccgccaggaagtcgggatcccgg cagaaaaagagcgccccggacacccaggagaggccgcccgtccgcgcaactggggcccgc gccaaaaaggcccctcagcgcgcccaggacacgcagccgtctgacgccaccagcgcccct ggggcagaggggctggagcctcctgcggctcgggagccggctctttccagggctggttct tgccgccagaggggcgcgcgctgctccacgaagccaagacctccgcccgggccctgggac gtgcccagccccggcctgccggtctcggcccccattctcgtacggagggatgcggcgcct ggggcctcgaagctccgggcggttttggagaagttgaagctcagccgcgatgatatctcc acggcggcggggatggtgaaaggggttgtggaccacctgctgctcagactgaagtgcgac tccgcgttcagaggcgtcgggctgctgaacaccgggagctactatgagcacgtgaagatt tctgcacctaatgaatttgatgtcatgtttaaactggaagtccccagaattcaactagaa gaatattccaacactcgtgcatattactttgtgaaatttaaaagaaatccgaaagaaaat cctctgagtcagtttttagaaggtgaaatattatcagcttctaagatgctgtcaaagttt aggaaaatcattaaggaagaaattaacgacattaaagatacagatgtcatcatgaagagg aaaagaggagggagccctgctgtaacacttcttattagtgaaaaaatatctgtggatata accctggctttggaatcaaaaagtagctggcctgctagcacccaagaaggcctgcgcatt caaaactggctttcagcaaaagttaggaagcaactacgactaaagccattttaccttgta cccaagcatgcaaaggaaggaaatggtttccaagatttatataggacttttggcacctac tactattggattccagggaaattgaaggagaccataaaacaaatgaaccaagttataaat gagtattgctga >gi568815592f:73294906_73516223|GENSCAN_predicted_peptide_8|748_aa MSQQRVMCFPLQRAYERTCITKIPIPEKQMFIALGMECNPCGKPLNGRMRGACPCGMFYF RGCGRWVAVSFTKQQFPLARLSSDSAAPRTPHFDVIVIGGGHAGTEAATAAARCGSRTLL LTHRVDTIGQMSCNPSFGGIGKGHLMREVDALDGLCSRICDQSGVHYKVLNRRKGPAVWG LRAQIDRKLYKQNMQKEILNTPLLTVQEGAVEDLILTEPEPEHTGKCRVSGVVLVDGSTV YAESVILTTGTFLRGMIVIGLETHPAGRLGDQPSIGLAQTLEKLGFVVGRLKTGTPPRIA KESINFSILNKHIPDNPSIPFSFTNETVWIKPEDQLPCYLTHTNPRVDEIVLKNLHLNSH VKETTRGPRYCPSIESKVLRFPNRLHQVWLEPEGMDSDLIYPQGLSMTLPAELQEKMITC IRGLEKAKVIQPGYGVQYDYLDPRQITPSLETHLVQRLFFAGQINGTTGYEEAAAQGVIA GINASLRVSRKPPFVVSRTEGYIGVLIDDLTTLGTSEPYRMFTSRVEFRLSLRPDNADSR LTLRGYKDAGCVSQQRYERACWMKSSLEEGISVLKSIEFLSSKWKKLIPEASISTSRSLP VRALDVLKYEEVDMDSLAKAVPEPLKKYTKCRELAERLKIEATYESVLFHQLQEIKGVQQ DEALQLPKDLDYLTIRDVSLSHEVREKLHFSRPQTIGAASRIPGVTPAAIINLLRFVKTT QRRQSAMNESSKTDQYLCDADRLQEREL >gi568815592f:73294906_73516223|GENSCAN_predicted_CDS_8|2247_bp atgagccaacagcgggtgatgtgcttccccctgcagagagcctatgaacggacgtgcatc accaagattcctatcccagaaaagcagatgttcatagctctgggaatggaatgcaaccct tgtggaaagcctctaaacggacgcatgaggggcgcctgtccctgtggcatgttctacttc cgaggctgtggccgttgggtcgcggtttccttcaccaagcagcaatttccgttggcacgg ttgagcagtgacagcgcggcgccccggactccgcacttcgacgtgatagtcattggtgga ggacatgccgggactgaggcagccaccgccgccgctcggtgcggctctcggactctgctc ctcactcaccgcgtggacacgatcggtcagatgtcatgtaatccttcctttggtggcatc ggaaagggacatttaatgagggaagtagatgccttggatggcctgtgttctcgcatctgt gaccagtctggtgtacattataaagtattaaaccggcgtaagggaccagctgtgtggggt ctgagagctcagattgataggaaactctataaacagaacatgcagaaagaaatcttgaat acaccactgcttactgttcaggagggagctgtagaagatcttattcttacagaaccagag cctgaacacactgggaaatgccgtgtcagtggggttgttttggtggatggaagcacagta tatgcagagagtgtgattctgactactgggacatttctgagaggcatgattgtaattgga ttggagacgcatccagcaggacgtttaggggatcagccttctataggattggctcagaca ctggagaagttagggtttgtggtgggaaggttgaagactgggactccaccccgaattgcc aaagagtccattaatttcagtattctaaacaagcatataccggacaatccatccatacca ttcagctttaccaatgagacagtatggattaagccagaagatcagctgccatgttacttg actcacaccaaccctagagtggatgagattgtccttaagaaccttcaccttaatagtcat gttaaagaaacgacaagaggacctcgatactgtccctccattgaatcaaaagttttgcgt tttccaaaccgtctacatcaggtttggttggaacctgaaggaatggattctgaccttatc tacccacaggggttatctatgacgctaccagctgagttacaagagaaaatgatcacatgc atcagaggcttggagaaagctaaagtgattcagccaggctacggtgttcagtatgattac ttagatccccgtcagatcaccccttccttggagactcatttggttcaacgactcttcttt gctggacagatcaatggcaccactggttatgaggaagctgcagctcaaggtgtgatagcc ggaatcaacgccagtcttcgggtcagtcgcaagcctccctttgtggttagccgaacagaa ggttacataggagtcttgattgatgacctcactactctgggcaccagtgaaccataccgc atgtttaccagccgagtagagttccgtttgtcactgcgccctgataatgctgacagccgg ctcacactgcgagggtataaagacgctggctgtgtgtcccaacaacgatatgaaagagct tgttggatgaagtcttctttagaagaaggcatttctgtgttgaaatctattgagtttttg agctctaaatggaaaaaattaatcccagaggcttctataagtactagtagaagtctgcct gtcagagctctcgatgttctgaagtatgaggaagttgacatggattcattagccaaggct gttccagagcccttgaagaagtatactaaatgtagagagctggctgaaagactgaaaata gaagccacttatgaatcagtgttgttccatcaactacaagaaataaagggagttcagcaa gatgaagctctccaactgccaaaagacctagattatttgactatcagggatgtgtctttg tcccatgaagttcgagagaaactacattttagtcgtccacagacgatcggggctgctagt cgcatacccggagtaacacctgccgccatcatcaatctgctgagatttgtgaagaccact caacgaagacagtcggctatgaatgaatcatccaagactgatcaatacttatgtgatgca gacagacttcaagagagagagttatag