GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:32:06 Sequence gi568815593r:178612063_178827128 : 215066 bp : 45.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 483 354 130 0 1 63 52 133 0.529 7.47 1.11 Intr - 828 734 95 1 2 101 115 3 0.997 3.98 1.10 Intr - 1577 1411 167 1 2 82 87 54 0.999 4.30 1.09 Intr - 1781 1665 117 1 0 67 89 81 0.942 5.68 1.08 Intr - 4886 4820 67 0 1 109 103 78 0.945 9.36 1.07 Intr - 11354 11194 161 1 2 71 27 105 0.117 2.33 1.06 Intr - 15793 15701 93 0 0 94 99 -4 0.051 0.38 1.05 Intr - 20928 20890 39 2 0 24 91 95 0.031 0.74 1.04 Intr - 41134 41063 72 0 0 45 80 136 0.046 7.02 1.03 Intr - 47180 47051 130 0 1 21 -16 217 0.027 4.35 1.02 Intr - 50374 50280 95 0 2 80 67 23 0.158 -0.99 1.01 Init - 50601 50558 44 0 2 119 52 68 0.357 4.68 1.00 Prom - 59017 58978 40 -3.26 2.09 PlyA - 62693 62688 6 1.05 2.08 Term - 72180 72056 125 1 2 76 41 94 0.944 2.05 2.07 Intr - 73420 73316 105 2 0 73 110 50 0.858 5.89 2.06 Intr - 77512 77431 82 1 1 -8 113 50 0.287 -2.89 2.05 Intr - 82885 82758 128 2 2 131 80 87 0.877 12.60 2.04 Intr - 83022 82939 84 1 0 75 92 35 0.748 2.39 2.03 Intr - 83536 83225 312 2 0 72 72 165 0.942 9.46 2.02 Intr - 84119 84035 85 2 1 77 92 63 0.353 4.99 2.01 Init - 97595 97593 3 2 0 64 103 0 0.014 -0.90 2.00 Prom - 99853 99814 40 -5.76 3.07 PlyA - 99928 99923 6 1.05 3.06 Term - 101559 99998 1562 1 2 114 43 571 0.893 45.76 3.05 Intr - 113409 113314 96 1 0 72 97 41 0.529 3.38 3.04 Intr - 115063 114937 127 1 1 80 82 125 0.981 11.35 3.03 Intr - 117011 116928 84 2 0 93 99 46 0.560 6.22 3.02 Intr - 117302 117205 98 0 2 23 68 98 0.435 1.03 3.01 Init - 118707 118626 82 0 1 43 72 108 0.501 3.83 3.00 Prom - 123628 123589 40 -1.26 4.00 Prom + 135762 135801 40 -2.46 4.01 Init + 143943 144014 72 2 0 71 40 140 0.238 6.60 4.02 Term + 145048 145134 87 0 0 90 43 55 0.589 -1.14 4.03 PlyA + 146960 146965 6 1.05 5.08 PlyA - 149346 149341 6 1.05 5.07 Term - 155273 155136 138 2 0 87 47 226 0.681 16.36 5.06 Intr - 160569 160367 203 1 2 105 60 309 0.697 28.80 5.05 Intr - 162523 162436 88 1 1 64 77 49 0.462 0.94 5.04 Intr - 163274 163205 70 1 1 119 94 53 0.612 8.18 5.03 Intr - 163653 163511 143 0 2 102 11 68 0.566 -0.35 5.02 Intr - 164214 164080 135 0 0 73 75 112 0.956 9.16 5.01 Init - 166741 166736 6 1 0 92 60 17 0.367 -0.82 5.00 Prom - 167114 167075 40 -6.36 6.05 PlyA - 167247 167242 6 1.05 6.04 Term - 170306 170164 143 0 2 -21 38 190 0.143 1.19 6.03 Intr - 175064 175006 59 1 2 52 100 102 0.648 6.33 6.02 Intr - 176847 176727 121 1 1 70 99 73 0.905 6.15 6.01 Init - 178635 178560 76 0 1 72 64 71 0.897 2.54 6.00 Prom - 180435 180396 40 -4.26 7.00 Prom + 180561 180600 40 -5.66 7.01 Init + 180737 180825 89 1 2 95 88 39 0.945 4.71 7.02 Term + 181070 181469 400 2 1 82 44 290 0.781 18.59 7.03 PlyA + 181758 181763 6 -1.75 8.03 PlyA - 182264 182259 6 1.05 8.02 Term - 185549 185427 123 2 0 132 49 173 0.703 16.18 8.01 Init - 185834 185607 228 2 0 82 80 86 0.809 4.34 8.00 Prom - 190520 190481 40 -2.46 9.02 PlyA - 191280 191275 6 1.05 9.01 Term - 206939 206700 240 2 0 68 49 192 0.525 9.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 30344 30123 222 2 0 100 55 142 0.870 7.33 S.002 Init + 138031 138159 129 0 0 88 33 94 0.813 4.06 S.003 Term + 139625 139708 84 1 0 115 42 34 0.824 -0.85 S.004 Sngl - 191634 191086 549 0 0 86 41 164 0.889 7.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_1|404_aa METSLTPMALPLVRRLMCPGSDKGLRLGREEGSLATPIEHHVDQGAVHVEDPAEALAGIK HQQLAAAIIVTSIIISTVTIVTKAACRWVCVIHNSQDMETTDMSIDGGMDEEACGSDNPD GYYVTVRAPTLTLIFLMLHTYTGELPEGPQNCRIPYLRCGIPKELTVLIGIAEKAGDMKA IVEVTSGRGDLIVAHKRTGIVNHITSLKNLIDEIVDTLGEGAFGKVVECIDHGMDGMHVA VKIVKNVGRYREAARSEIQVLEHLNSTDPNSVFRCVQMLEWFDHHGHVCIVFELLGLSTY DFIKENSFLPFQIDHIRQMAYQICQSINFLHHNKLTHTDLKPENILFVKSDYVVKYNSKM KRDERTLKNTDIKVVDFGSATYDDEHHSTLVSTRHYRAPEVILX >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_1|1212_bp atggagacctccctgacccccatggccttgcccctggtgcgcaggctgatgtgtccaggc tcagataagggcctgcgtctggggagagaggagggttcactagccacccccatagagcat cacgtggaccagggagcagtgcatgtagaagacccagcagaagctctggcaggtattaaa caccagcagctggcggccgccatcattgtcacatccatcatcatctccaccgtgaccatc gtcacaaaggctgcctgcagatgggtgtgtgttattcacaatagccaagatatggaaaca accgacatgtccatcgacggaggaatggatgaagaagcatgtggatctgacaacccagat ggctactacgtgactgtcagggctccgactttgaccctaatatttctcatgttacatact tacacgggggaactgcctgagggtcctcagaactgcagaataccttatttgagatgcggc attccaaaagaactcactgtcctgattgggatagcagagaaagctggggacatgaaagct atcgtggaagtcacaagcggaagaggagatctcatagtagcacacaagagaacaggcatt gtaaaccacatcaccagtttaaagaatctgattgatgaaatcgtggacactttgggtgaa ggagcctttggcaaagttgtagagtgcattgatcatggcatggatggcatgcatgtagca gtgaaaatcgtaaaaaatgtaggccgttaccgtgaagcagctcgttcagaaatccaagta ttagagcacttaaatagtactgatcccaatagtgtcttccgatgtgtccagatgctagaa tggtttgatcatcatggtcatgtttgtattgtgtttgaactactgggacttagtacttac gatttcattaaagaaaacagctttctgccatttcaaattgaccacatcaggcagatggcg tatcagatctgccagtcaataaattttttacatcataataaattaacccatacagatctg aagcctgaaaatattttgtttgtgaagtctgactatgtagtcaaatataattctaaaatg aaacgtgatgaacgcacactgaaaaacacagatatcaaagttgttgactttggaagtgca acgtatgatgatgaacatcacagtactttggtgtctacccggcactacagagctcccgag gtcattttggnn >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_2|307_aa MIVILPTETTINIQKMEQENTAQGSEKPSVQSVKPWSDQEIRSFLQEWEFLEREVYRVKK KYHIVSKAIAQRLKQRGINKSWKECLQMLISLQDLYFTIQEANQRPRCQPLPCPYGEALH RILGYRWKISVFSVIILRTLFVDLFRMDMALGLGEVTSKDSGPPCADVVNLAPPEHPPQA YGVPIVFQEPMWAPTPVIYVENPQLLNTSVPTTHLDPGNDQRMGSGTRWAHENDSLLIQR RYWNTSDWGKEDMCVTQERSCSSQVAVKKGRNEKRTKQNFYSSYYHMDTSTHILILPVTT DLCSSVQ >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_2|924_bp atgatagtaatacttcccactgaaactaccataaacatccagaaaatggagcaggaaaac acagcccagggatcagaaaagccctcagtccagtcagttaaaccttggagtgaccaggaa atccggagtttcctgcaagaatgggaatttcttgaacgtgaggtgtacagggtgaagaag aagtatcacatagtatcaaaagcaattgctcagcgtctcaagcagaggggtatcaacaag agctggaaggaatgtctccagatgctaataagcttgcaggacttatacttcactattcag gaggccaaccagaggccaaggtgccaacccttgccatgtccttatggtgaggccctgcac aggattctggggtacagatggaagatcagcgtcttctcagtaataattctgaggactttg ttcgtggacttgttcagaatggacatggccctggggctgggtgaggtcacaagcaaggat tcaggtcctccctgtgcagatgtggttaacctcgcacctcccgagcacccgccccaggcc tatggcgttcccatagtctttcaggagccgatgtgggccccaacacctgtgatctatgtg gaaaatcctcagctgctgaacacgtctgttccaactacacatctggacccgggaaatgac cagagaatgggttccgggacccgctgggcccacgagaatgacagtttgctcattcagcgc aggtactggaatacctcagactggggtaaagaagacatgtgtgttacacaggaacggagc tgcagctcacaagtagcagtgaaaaagggaagaaatgaaaagagaacaaaacagaatttc tacagctcctattaccacatggacacaagtacccacatcctcatccttcctgttaccaca gatctgtgctcttctgtacagtaa >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_3|682_aa MQVSSRGLHFPAASAAAPPLLVPGAPRVPAGAEELSGVGRALALVGSAGRCCHCGSSVPS ICLLETAPSSRESQKEDMAAGQREARPQVSLTFEDVAVLFTRDEWRKLAPSQRNLYRDVM LENYRNLVSLGLPFTKPKVISLLQQGEDPWEVEKDGSGVSSLGSKSSHKTTKSTQTQDSS FQGLILKRSNRNVPWDLKLEKPYIYEGRLEKKQDKKGSFQIVSATHKKIPTIERSHKNTE LSQNFSPKSVLIRQQILPREKTPPKCEIQGNSLKQNSQLLNQPKITADKRYKCSLCEKTF INTSSLRKHEKNHSGEKLFKCKECSKAFSQSSALIQHQITHTGEKPYICKECGKAFTLST SLYKHLRTHTVEKSYRCKECGKSFSRRSGLFIHQKIHAEENPCKYNPGRKASSCSTSLSG CQRIHSRKKSYLCNECGNTFKSSSSLRYHQRIHTGEKPFKCSECGRAFSQSASLIQHERI HTGEKPYRCNECGKGFTSISRLNRHRIIHTGEKFYNCNECGKALSSHSTLIIHERIHTGE KPCKCKVCGKAFRQSSALIQHQRMHTGERPYKCNECGKTFRCNSSLSNHQRIHTGEKPYR CEECGISFGQSSALIQHRRIHTGEKPFKCNTCGKTFRQSSSRIAHQRIHTGEKPYECNTC GKLFNHRSSLTNHYKIHIEEDP >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_3|2049_bp atgcaggtgtcctctcgtggactccatttcccagcggcctcagcggctgccccgcccctc ctggtccccggcgcgccgcgggtgccagctggcgccgaggaactcagcggcgtggggcga gccctggccctggtgggctcagcgggtcgctgctgccactgcggctccagcgtcccctcc atctgccttctggagactgcgccgtcctcccgggagagccagaaagaggacatggctgct gggcagcgggaagcgaggccccaggtgtcactgacgtttgaggatgtggctgtgctgttt acccgagatgagtggagaaagctggccccttctcagagaaacttgtaccgggatgtgatg ctggagaactataggaacctggtctcactggggctcccatttaccaaaccaaaagtgatc tccctgttgcagcaaggagaagatccctgggaggtggagaaagacggttctggcgtctcc tctctaggatcgaagagcagtcataaaaccacaaagtcaacgcaaacacaagactcttca tttcagggactgatactgaaaagatccaacaggaatgtaccttgggatttgaaattagaa aagccttacatatatgaaggcagattagagaaaaagcaggataaaaagggaagttttcag atagtttcagccacccacaaaaaaatccccactatagaaagaagccataaaaatactgaa ttgagccaaaacttcagcccaaagtcagtgcttattaggcaacagatacttcccagagaa aaaacaccaccaaaatgtgaaatacaaggaaacagcctcaaacagaattcacaattactt aatcaaccaaaaattacagcagataaacgctataaatgtagtctgtgtgaaaaaaccttc attaacacttcatcccttcgtaaacatgagaaaaaccatagtggagagaaactatttaag tgtaaagaatgttcaaaagcctttagccaaagttcagctcttattcaacatcaaataacg catactggagagaaaccctacatatgtaaagaatgtgggaaagcctttactctcagtaca tccctttataaacatctaagaacccatactgtggagaaatcctacagatgtaaagaatgt ggtaaatccttcagccgaaggtcaggcctttttatacatcaaaaaattcatgctgaagaa aacccttgtaagtataatccgggtaggaaggcatctagttgcagcacatccctttctgga tgtcaaagaattcattctagaaagaagtcctacttatgtaatgaatgtggcaacaccttt aagtctagctcatcccttcgttatcatcagagaattcacactggagagaagccttttaaa tgtagtgaatgtgggagagccttcagccagagtgcctctcttattcaacatgaaagaatt cacaccggagaaaagccctatagatgcaatgaatgtgggaaaggctttacttctatttca cgacttaatagacaccgaatcattcatactggagagaagttttataattgtaatgaatgt ggtaaagccttaagctcccactcaacacttattattcacgagcgaattcatactggagaa aaaccatgtaaatgtaaagtatgtggaaaagccttcagacagagttcagctctcattcaa catcagagaatgcatactggagaaagaccctataaatgtaacgagtgtgggaaaacattc aggtgtaactcatcacttagtaatcaccagagaattcatactggagagaaaccatatcga tgtgaggaatgtgggatatcttttggccaaagttcagctcttattcagcatcgaaggatt catacaggagaaaaaccctttaaatgtaatacatgtggaaaaacttttagacaaagctca tcacgtattgcacatcagagaattcatactggagagaaaccctatgaatgtaatacatgt gggaaacttttcaaccataggtcatcccttactaatcattataaaattcatatcgaagag gacccctag >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_4|52_aa MKVMVMVVVMVMMVVVVMVGVVVMRELTFLFDGYAFSLLSSVTTRGEDLLTK >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_4|159_bp atgaaggtgatggtgatggtggtggtgatggtgatgatggtggtggtggtgatggtgggg gtggtggtgatgagagagctaacctttctctttgatggttacgctttcagcctgctttcc agtgtcaccacacggggtgaggatcttctcacaaagtag >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_5|260_aa MLEIFGALLQEESLLLRQMTSRVAKRQRCAFDDGQPVPLSVDDQGLSLPLATLASQDVAR SFLRSTQRCLQMLLLTPSALAALGPALCEFLGPCWYQGSWRLLQHQPEDRGRHHARLESC GVALTFSRFPVTARSTPRFGSLETCNIVDSFEEVEDSLYVPQYNKYGEERVIVFLKTASG HAFQPDLVKRICDAIRVGLSVRHVPSLILETKGIAYTLSGNKVEVAVKQIIARKAMEQRG AFSNPEALHLYWDIPELNGF >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_5|783_bp atgctggaaatctttggggccttacttcaggaggagagcctgctgctaaggcagatgact tctcgtgttgcaaagcggcagagatgcgcttttgatgatggccagccagttcccctcagt gtggatgatcaagggctgagcctgcccctggccactcttgcttcccaagacgtggctcgt tctttcctgcgctccactcagaggtgtctccagatgctcctgttgacccccagtgccctg gccgcacttggtcccgccctctgtgaattcctgggtccttgctggtatcagggctcatgg cgactgctgcagcatcaacccgaagaccgggggcgtcatcatgctcggctggagagctgc ggtgtagccctgactttttctcgctttccagtgacggcacgctcaaccccgcggttcggc agcctggaaacctgtaacatagtggactcctttgaggaggtggaggacagcctgtatgtc ccccagtataacaagtacggggaagagagggtgatcgtcttcctgaagacagcctctggg cacgccttccagcctgacttggtgaagaggatctgtgacgccatccgcgtgggcttgtct gtgcggcatgtgcccagcctcatcctggaaaccaagggcatcgcgtacacgctcagtggc aataaagtggaagttgccgtcaaacagatcatcgctcgaaaagccatggagcaacgaggt gctttctcgaaccccgaggccctgcatctgtactgggacatccctgagctgaatggcttc tga >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_6|132_aa MLSQGSLWMWPLPCLGERLVQFGEAGLQAVAGTGSQWFRALTLSHASVQRPLQYSLEIRR GMSARSILVTRAKWLSVLEEKAVKPGTDITSCFMGQNISLPVYKGEIQAWNLGMAVDAWN EEGDGSANCFFL >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_6|399_bp atgctgtcgcagggctccctgtggatgtggcctttaccctgcctgggagagcgtcttgtg cagtttggcgaggcagggctccaggcagttgcaggcaccgggtcccaatggtttagggcc ctgacgctgtctcatgcttcggtgcagcgacccctacagtactcgctagaaataagacgt ggcatgagtgcccgcagcatcctggtaaccagggcgaagtggctgtcggtgctggaagag aaggccgtgaagccgggcaccgacatcacctcctgcttcatgggccagaatatttctctt cctgtgtataaaggggagattcaggcctggaacctgggcatggccgtggatgcgtggaat gaggaaggtgatggctccgccaactgtttcttcctgtga >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_7|162_aa MAAVDNAILCKCWNFAESRFQALSLLNTKSFTFRFWHLPHPPLREGTSTKAENSGDAPSS VEGSHGSASDPKEPSWCCLSLVLGTASWYLPYPVNQVPEHVGHGHQGEPSYKTTATPVAR MDTIPFRIIQPTCKDRRGVASAGTRQLPPQLPLSFLCPWKTT >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_7|489_bp atggctgcagttgacaacgccatactgtgcaagtgctggaactttgctgagagtcgattt caggcactttcattgctaaacacaaaaagcttcaccttccgcttctggcacctcccacat cctcctcttagagaggggacatcaacaaaagcagagaattccggagacgccccgagctcg gtagaaggcagccacggctctgcatccgatcccaaggagccttcctggtgctgcctatcc cttgttctgggcacagcatcttggtacctaccctatcctgtcaaccaggtcccagagcat gttgggcacgggcaccagggggagccatcgtacaagaccacggccactcctgtggccaga atggacaccatcccgttccgcatcatccagcccacctgcaaagacaggaggggtgtggct tcagcaggcaccaggcagctccccccgcagctgcccctcagcttcctgtgtccctggaaa accacctaa >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_8|116_aa MVPSMARVLRTQRQAGVSGFFSPGYKPRSAFGWQRDVGAWLITREKGEPPKSASARKEIG DLLGLLLVASEWHVDQGTLIQHRTEHLLHSNMTSGDILLCYITVKASTGTSLCYIQ >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_8|351_bp atggtgccctcgatggccagggtactgagaactcaaaggcaggcaggagtgtcagggttt ttctctccaggctacaagccacgctcagcttttgggtggcagagggacgtgggggcatgg ctcatcaccagggagaagggggagcctccaaagtctgcttctgccaggaaagaaattggg gatctgctggggctgcttctggtagcgtctgagtggcatgtggaccagggcaccctcatc cagcatcggacggagcacctgctgcacagcaacatgaccagcggtgacatccttctgtgc tacatcacggtaaaggcttccacaggcaccagcctgtgctacatccagtga >gi568815593r:178612063_178827128|GENSCAN_predicted_peptide_9|79_aa CILPHSLYPDLNVKPKTIKTPDNSIGNTIQDIGMDEDFMRKTPKATATKAKFDEWDLIKL KSFCVAEETISTVKRQPIE >gi568815593r:178612063_178827128|GENSCAN_predicted_CDS_9|240_bp tgtatattaccacatagtctttatccagatttaaatgtaaaacccaaaaccataaaaacc cctgacaacagcataggcaataccattcaggacataggcatggacgaagatttcatgaga aagacaccaaaagcaacggcaacaaaagccaaatttgacgaatgggatctaattaaactg aagagcttctgcgtagcagaagaaactatcagcacagtaaagaggcaacctatagaatag