GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:53:20 Sequence gi568815575r:48196823_48404847 : 208025 bp : 43.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 2023 1855 169 1 1 84 110 87 0.872 10.02 1.01 Init - 3388 3275 114 1 0 91 93 67 0.923 7.71 1.00 Prom - 15531 15492 40 -1.86 2.06 PlyA - 16252 16247 6 1.05 2.05 Term - 18830 18552 279 2 0 -12 48 248 0.144 6.25 2.04 Intr - 20071 20003 69 1 0 94 69 42 0.164 2.28 2.03 Intr - 20257 20163 95 2 2 63 84 40 0.197 0.78 2.02 Intr - 20524 20396 129 2 0 109 70 1 0.098 1.07 2.01 Init - 20674 20575 100 1 1 103 73 49 0.206 5.63 2.00 Prom - 35987 35948 40 -2.46 3.05 PlyA - 36755 36750 6 1.05 3.04 Term - 41093 40989 105 2 0 68 41 92 0.777 0.91 3.03 Intr - 46853 46765 89 0 2 92 100 93 0.911 10.59 3.02 Intr - 48319 48029 291 2 0 90 110 148 0.526 14.31 3.01 Init - 50395 50248 148 1 1 90 -4 98 0.271 1.05 3.00 Prom - 53530 53491 40 -4.76 4.00 Prom + 55935 55974 40 -4.26 4.01 Init + 60264 60271 8 2 2 56 115 4 0.303 0.11 4.02 Intr + 60353 60488 136 2 1 109 61 106 0.898 10.57 4.03 Intr + 60924 61038 115 2 1 96 60 120 0.987 10.02 4.04 Intr + 61714 61809 96 2 0 98 96 61 0.750 7.88 4.05 Intr + 64944 64993 50 1 2 49 113 22 0.218 -0.80 4.06 Term + 66960 67109 150 2 0 28 34 142 0.253 0.71 4.07 PlyA + 67350 67355 6 1.05 5.13 PlyA - 67676 67671 6 1.05 5.12 Term - 67813 67707 107 2 2 57 49 145 0.606 6.07 5.11 Intr - 69947 69819 129 0 0 100 70 23 0.509 2.37 5.10 Intr - 71625 71533 93 1 0 52 66 77 0.010 1.84 5.09 Intr - 79981 79886 96 2 0 94 96 23 0.411 3.68 5.08 Intr - 80254 80140 115 1 1 103 60 153 0.690 14.02 5.07 Intr - 80823 80688 136 2 1 117 100 55 0.845 10.17 5.06 Intr - 99223 99134 90 0 0 95 57 81 0.001 4.81 5.05 Intr - 100098 100002 97 1 1 92 -1 130 0.002 3.57 5.04 Intr - 106877 106782 96 0 0 79 96 41 0.657 3.98 5.03 Intr - 107519 107405 115 2 1 117 60 161 0.994 16.22 5.02 Intr - 108091 107957 135 1 0 59 100 96 0.409 8.66 5.01 Init - 114105 113992 114 0 0 91 93 81 0.817 9.11 5.00 Prom - 125002 124963 40 -3.46 6.05 PlyA - 125203 125198 6 1.05 6.04 Term - 128093 127987 107 0 2 63 49 132 0.925 5.37 6.03 Intr - 130539 130389 151 0 1 91 45 67 0.405 2.44 6.02 Intr - 131831 131721 111 2 0 29 80 63 0.006 0.18 6.01 Init - 143640 143599 42 0 0 95 116 25 0.757 6.42 6.00 Prom - 146759 146720 40 -6.26 7.09 PlyA - 149625 149620 6 1.05 7.08 Term - 150782 150682 101 0 2 81 35 104 0.631 2.69 7.07 Intr - 153300 153165 136 0 1 24 99 128 0.488 7.64 7.06 Intr - 155313 155216 98 1 2 52 99 80 0.921 5.23 7.05 Intr - 157272 157177 96 1 0 94 96 92 0.977 10.58 7.04 Intr - 157924 157810 115 1 1 133 60 169 0.999 18.62 7.03 Intr - 158447 158359 89 0 2 108 100 75 0.687 10.39 7.02 Intr - 161951 161783 169 2 1 50 110 72 0.991 5.12 7.01 Init - 164863 164750 114 1 0 91 93 82 0.926 9.21 7.00 Prom - 177986 177947 40 -6.26 8.00 Prom + 180583 180622 40 -6.56 8.01 Init + 181110 181223 114 2 0 91 93 67 0.851 7.71 8.02 Intr + 184057 184225 169 0 1 74 110 59 0.904 6.22 8.03 Intr + 187212 187300 89 1 2 83 100 105 0.952 10.89 8.04 Intr + 187739 187853 115 1 1 142 60 162 0.998 18.82 8.05 Intr + 188530 188625 96 2 0 96 96 83 0.854 9.88 8.06 Intr + 190489 190538 50 2 2 45 113 33 0.346 -0.10 8.07 Intr + 192524 192659 136 1 1 33 99 141 0.238 9.84 8.08 Term + 195047 195147 101 0 2 77 35 140 0.453 5.89 8.09 PlyA + 196501 196506 6 1.05 9.05 PlyA - 196634 196629 6 1.05 9.04 Term - 201066 200527 540 0 0 25 50 174 0.397 1.56 9.03 Intr - 204698 204458 241 1 1 57 56 113 0.557 2.55 9.02 Intr - 205847 205758 90 1 0 95 57 90 0.624 5.71 9.01 Intr - 206734 206638 97 2 1 77 -1 120 0.299 1.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 1582 1569 14 2 2 93 53 1 0.905 -4.54 S.002 Term - 102945 102796 150 0 0 32 28 166 0.922 2.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_1|95_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKMDTSLSAAGKRHRHRVSAPRKGLSKEDYIRSD IFMINSQPSTSSGQSYVYFISNEQNQAGSTRWDEX >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_1|285_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaaatggacacctcattaagtgctgctgggaagcgccac cgacacagagtgtcagcccccagaaagggcctttccaaggaggactatatcaggtctgac attttcatgatcaacagccagccatctaccagttctggccaatcctatgtgtatttcatc agcaatgaacagaaccaagctgggagcacgagatgggatgaggnn >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_2|223_aa MPMFVKGHHILLLIMGMCHIPEAEARREKEISGNPAHSDSVLPQDLHMASTVPRSEPSAH ATVTSLPGLSIPRLEEENYEKGMIWGISDKIGYQYHNRIASTSAGMWRSVDPGMFPSFGS YDGEELEDKSWSKTRTLFKSMKAERGEEAAGETCEASRGWFMRFKERSCLHNIKVQGEAA NPDGEAAASYPEDLAKVTDEGGYTKEQIFNTDKIAFYIEEDAI >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_2|672_bp atgcccatgttcgtgaaaggtcaccacattctgcttctcatcatgggcatgtgtcatatc cccgaggctgaggcaagaagagagaaggaaataagtggcaaccctgcacactctgattct gtcctaccccaggacctgcacatggcttccacggttcctcgaagtgaaccatctgctcat gccacagtgacttccttgcctgggttatctattcctaggctagaggaagaaaattatgag aagggaatgatttggggaataagtgacaagattggataccagtaccataacagaatagct agcacatctgcagggatgtggaggtctgtggatccaggcatgtttccctcttttggttcc tatgatggagaagagttggaagataagagttggagcaagacccgaactctcttcaagtcc atgaaagctgagagaggtgaagaagctgcaggagaaacgtgtgaagctagcagaggttgg ttcatgaggtttaaggaaagaagctgtctccataacataaaagttcaaggtgaagcagca aaccctgatggagaagctgcagcaagttatccagaagatctagctaaggtcactgatgaa ggtggctacactaaagaacagattttcaatacggataaaatagccttctatattgaagaa gatgccatctag >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_3|210_aa MRYHYTLITIDEIKDIVTTPNGDKDAEKLDTSLSAAGKVKSYSHSGKQFARFSSSKTSRA VMVTQQPSGLPPYVDPRSPGLMVSIMSSSHSIADRVSAPRKGLSKEDYIRTDIFMINSQP STSSGQSYVYFISNEQNQAGSMRWDEGQTTPGAMNGDNNCAKRAGDDAQIPEKIQKAKWA EFLDERQSPRLGPSQKQDSEEPDSSLVKGE >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_3|633_bp atgaggtatcactacacacttattacaatagatgaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacacctcattaagtgctgctgggaaggtaaaa tcttacagccactctggaaagcagtttgcaaggttttcttcttctaaaacctcaagggct gtgatggtcactcagcagcctagtggattgccaccgtatgtggacccacgttcccctggc ttaatggtcagcattatgtcatcgtcccacagcatcgcagacagagtatcagcccccaga aagggcctttccaaggaggactatatcaggactgacattttcatgatcaacagccagccc tctaccagttctggccaatcctatgtgtatttcatcagcaatgaacagaaccaagctggg agcatgagatgggacgagggtcagactactcccggtgccatgaacggagacaacaactgt gcaaagagagctggggatgatgctcaaataccagagaagatacaaaaggctaaatgggca gagttcctggatgaaagacagagcccgaggttgggacctagtcagaaacaggactcagag gagcccgactcaagtttggtcaaaggagagtga >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_4|184_aa MDRTKGPVARKSQAVSLAGETAPGAMNGDDTFAKRPRDDAKASEKRSKAFDDIATYFSKK EWKKMKYSEKISYVYMKRNYKAMTKLGFKVTLPPFMCNKQATDFQGNDFDNDHNRRIQVE HPQMTFGRLHRIIPKIMPKKPAEDENDSKGVSEASGPQNDGKQLHPPGKANISEKINKRS GKRK >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_4|555_bp atggacaggaccaagggtcctgtagctagaaagtctcaggctgtttctcttgcaggtgag actgctcctggtgccatgaacggagacgacacctttgcaaagagacccagggatgatgct aaagcatcagagaagagaagcaaggcctttgatgatattgccacatacttctctaagaaa gagtggaaaaagatgaaatactcggagaaaatcagctatgtgtatatgaagagaaactat aaggccatgactaaactaggtttcaaagtcaccctcccacctttcatgtgtaataaacag gccacagacttccaggggaatgattttgataatgaccataaccgcaggattcaggttgaa catcctcagatgactttcggcaggctccacagaatcatcccgaagatcatgcccaagaag ccagcagaggacgaaaatgattcgaagggagtgtcagaagcatctggcccacaaaacgat gggaaacaactgcaccccccaggaaaagcaaatatttctgagaagattaataagagatct ggtaagaggaaatga >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_5|440_aa MRHHYTLITIAKIKDIVTTPNGDEDAEKLDTSLSAAGKTKAAVAGKTRSVSLAGQTAPGA MNGDDAFARRPRAGSQIPEKIQKAFDDIAKYFSKKEWEKMKSSEKIIYVYMKRKYEAMTK LGFKATLPPFMCNTGATDLQGNDFDNDRNHRNQGPKRGKHAWTHRLRERKQLVIYEEISD PEEDDDLGDTTHAHDEKQNVVTFHERGHGCGPLVIRTKGLEPGKSQAVSLAGQTAPGAMN GDDAFARRPRVDAQIPEKIQKAFDDIAKYFSKEEWEKMKASEKILYVYMKRKYEAMTKLG FKATLPPFMCNKRTADFQGNDFDNDYNHGHQGWENQEAKVMGYGRSKQLQSCLHLGLVFQ VHDPAHSDSALPQDLDTPSMVPRSEPSAHATVTSSPGLCIPRLEEETATATPPFSNHHLD QPAAINTEARPSTNKKSVTH >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_5|1323_bp atgaggcatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgatgaggatgcagagaaactggacacctcattaagtgctgctgggaagaccaaa gctgctgtggctggaaagactcggtctgtttctcttgcaggacagactgctcccggtgcc atgaacggagacgatgcctttgcaaggagacctagggctggttctcaaataccagagaag atccaaaaggccttcgatgatattgccaaatacttctctaagaaagagtgggaaaagatg aaatcctcggagaaaatcatctatgtgtatatgaagagaaagtatgaggccatgactaaa ctaggtttcaaggctaccctcccacctttcatgtgtaatacaggggccacagacctccag gggaatgattttgataatgaccgtaaccacaggaatcagggacccaaaagggggaaacat gcctggacccacagactgcgtgagagaaagcagctggtgatttatgaagagatcagcgac cctgaagaagacgacgacctcggggatacgacacatgcccatgatgagaagcagaatgtg gtgacctttcacgaacgtgggcatggctgcggacccctcgtcatcaggaccaaaggtctt gagcctggaaagtctcaggcagtttctcttgcaggtcagactgctcccggtgccatgaat ggagacgacgcctttgcaaggagacccagggttgatgctcaaataccagagaagatacaa aaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaagatgaaagcc tcagagaaaatcctctatgtgtatatgaagagaaagtatgaggccatgactaaactaggt ttcaaggcaaccctcccacctttcatgtgtaataaacggaccgcagacttccaggggaat gattttgataatgactataaccatgggcatcagggctgggagaaccaggaggccaaggtg atgggctatggacggtcaaaacagctccaatcctgcctccacctggggctggtgtttcaa gtccatgaccctgcacactctgattctgccctaccccaggacctggacacgccttccatg gttcctcgaagtgaaccatctgctcatgccacagtgacttcctcgcctggtttatgcatt cctaggctagaagaagaaactgccacagccactccacccttcagcaaccaccaccttgat cagccagcagccatcaacaccgaggcaagaccctccaccaacaagaagagtgtaactcac tga >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_6|136_aa MDLCRLSLSHVKGKNPKMVQRAKAKMPNHLLQNPAPGLKCIVSIKIKTEGQGLHTPSTVP RSEPSAHATVTSSPGLSIPRLEEGVARISGLTWGLGTHSILETATATPIFSNHHLDQPAA INTEARPSTSKKSVTH >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_6|411_bp atggacctgtgccggttgtctttatcccatgtcaagggaaagaacccgaaaatggttcag agagcgaaggccaagatgcccaaccacctgctgcagaatcctgctccaggactgaagtgt atagtctctatcaaaataaaaactgaaggccagggcctgcacacgccttccacggttcct cgaagtgaaccatctgctcatgctacagtgacttcctcacctggtttatcgattcctagg ctagaggaaggtgtggcccgcatatcagggctgacctggggtttgggaacccacagcatc ctggaaactgccacagccactccaatcttcagcaaccaccaccttgatcagccagcagcc atcaacaccgaggcaagaccctccaccagcaaaaagagtgtgactcactga >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_7|305_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKLDTLLSAAGKHCRHRVSTSRKGLSKEDYIRSD IFMINSQPSTSSGQSYLYFISNEQNQAGSTRWDKGQTTPCAMNGDDTFARRPTVGAQIPE KIQKAFDDIAKYFSKEEWEKMKVSEKIVYVYMKRKYEAMTKLGFKAILPSFMRNKRVTDF QGNDFDNDPNRGNQDDFRQAPGNLPEGEYLSDLKDQRTFVPPRMRTLIMPKKPAEEGNVS KEVPEASGPQNDGKQLCPPGKPTTSEKINMISGPKRGEHAWTHRLRERKQLVIYEEISDP EEDDE >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_7|918_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacaccttattaagtgctgctgggaagcactgc agacacagagtatcaacctccagaaagggcctttccaaggaggactatatcaggtctgac attttcatgatcaacagccagccatctaccagttctggccaatcctatctgtatttcatc agcaatgaacagaaccaagctgggagcacgagatgggacaagggtcagactactccctgt gccatgaacggagatgacacctttgcaaggagacccacggttggtgctcaaataccagag aagatacaaaaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaag atgaaagtctcggagaaaatcgtctatgtgtatatgaagagaaagtatgaggccatgact aaactaggtttcaaggccatcctcccatctttcatgcgtaataaacgggtcacagacttc caggggaatgattttgataatgaccctaaccgtgggaatcaggatgactttcggcaggct ccagggaatcttcccgaaggtgagtatctttcagatctaaaggaccagagaacctttgtc cctccaaggatgcgaacactgatcatgcccaagaagccagcagaggaaggaaatgtttcg aaggaagtgccagaagcatctggcccacaaaacgatgggaaacagctgtgccccccggga aaaccaactacctctgagaagattaacatgatatctggacccaaaaggggggaacatgcc tggacccacagactgcgtgagagaaagcagctggtgatttatgaagagatcagcgatcct gaggaagatgatgagtaa >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_8|289_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKMDTSLSAAGKHRRHRVSAPRKGLSKEDYIRSD IFMINSQPSTSSGHSCLYFINNKQNQAGSTRWDEGETAPSAMNGDDAFARRPRDDAQISE KLRKAFDDIAKYFSKKEWEKMKSSEKIVYVYMKLNYEVMTKLGFKVTLPPFMRSKRAADF HGNDFGNDRNHRNQVERPQMTFGSLQRIFPKIMPKKPAEEENGLKEVPEASGPQNDGKQL CPPGNPSTLEKINKTSGPKRGKHAWTHRLRERKQLVVYEEISDPEEDDE >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_8|870_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaaatggacacctcattaagtgctgctgggaagcaccgc agacacagagtatcagcccccagaaagggcctttccaaggaggactatatcaggtctgac attttcatgatcaacagccagccatctaccagttctggccactcctgtctgtatttcatc aacaataaacagaaccaagctgggagcacgagatgggacgagggtgagactgctcccagt gccatgaacggagacgacgcctttgcaaggagacccagggatgatgctcaaatatcagag aagttacgaaaggccttcgatgatattgccaaatacttctctaagaaagagtgggaaaag atgaaatcctcggagaaaatcgtctatgtgtatatgaagctaaactatgaggtcatgact aaactaggtttcaaggtcaccctcccacctttcatgcgtagtaaacgggctgcagacttc cacgggaatgattttggtaacgatcgaaaccacaggaatcaggttgaacgtcctcagatg actttcggcagcctccagagaatcttcccgaagatcatgcccaagaagccagcagaggaa gaaaatggtttgaaggaagtgccagaggcatctggcccacaaaatgatgggaaacagctg tgccccccgggaaatccaagtaccttggagaagattaacaagacatctggacccaaaagg gggaaacatgcctggacccacagactgcgtgagagaaagcagctggtggtttatgaagag atcagcgaccctgaggaagatgacgagtaa >gi568815575r:48196823_48404847|GENSCAN_predicted_peptide_9|322_aa XPKRGKHAWTHRLRERKQLVVYEEISDPEEDDDLGDMTHAHDEKQNVVTFHEHGHGCGPL VIRDYTLQSWSRILQQVVGHLGLCSLNDFQVQGLGRSIWEYVEGDTDEIVIWGTWRNEED ACTVDRDGQGIEESTQSPCRGATLPMTFLTELEKTTLKFIWNQKRAHIAKSILSQKNKAG GITLPDFKLYYKATVTKTACYWYQNRDIDQRNRTEPSEIMPHIYNHLIFDKPDKNKQWGK DSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDFNIRPKTIKTLEENLGNTIQDI GMGKDFMTKTPKAMATKAKIDK >gi568815575r:48196823_48404847|GENSCAN_predicted_CDS_9|969_bp ngacccaaaagggggaaacatgcctggacccacagactgcgtgagagaaagcagctggtg gtttatgaagagatcagcgaccctgaggaagatgacgacctcggggatatgacacatgcc catgatgagaagcagaacgtggtgacctttcacgaacatgggcatggctgcggacccctc gtcatcagagactatacacttcagtcctggagcaggattctgcagcaggtggttgggcat ctgggcctttgctctctgaatgattttcaagttcaagggctgggacggtccatttgggag tatgtggaaggagacacagatgaaatcgtcatctggggaacgtggaggaatgaggaagat gcgtgcactgtagaccgtgatggacagggaatagaagagtccactcagtctccatgcagg ggagcaacgctaccaatgactttcctcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccacattgccaagtcaatcctaagccaaaagaacaaagccgga ggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcttgt tactggtaccaaaacagagatatagatcaaaggaacagaacagagccctcagaaataatg ccacatatctacaaccatctgatctttgacaaacctgacaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacaccttatacaaaaattaattcaagatggattaaagacttcaac attagacctaaaaccataaaaaccctagaagaaaacttaggcaataccattcaggacata ggcatgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatga