GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:40:05 Sequence gi568815593r:178503620_178723416 : 219797 bp : 45.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3524 3563 40 -2.36 1.01 Init + 11668 11712 45 0 0 79 101 18 0.069 1.00 1.02 Intr + 15594 15737 144 2 0 110 37 75 0.052 5.08 1.03 Term + 18911 19132 222 1 0 25 47 149 0.057 1.42 1.04 PlyA + 19389 19394 6 -0.45 2.00 Prom + 19570 19609 40 -5.46 2.01 Init + 22148 22213 66 1 0 67 98 69 0.594 6.87 2.02 Intr + 38864 39013 150 1 0 62 27 135 0.242 5.16 2.03 Intr + 40882 40933 52 0 1 88 77 5 0.041 -2.12 2.04 Intr + 50415 50461 47 1 2 81 105 51 0.186 4.23 2.05 Intr + 54932 55042 111 1 0 72 85 33 0.232 1.98 2.06 Term + 56843 57133 291 1 0 37 52 161 0.246 2.74 2.07 PlyA + 58935 58940 6 1.05 3.00 Prom + 63037 63076 40 -5.46 3.01 Init + 72356 72529 174 1 0 84 63 109 0.922 7.35 3.02 Intr + 73521 73859 339 2 0 19 9 278 0.608 8.67 3.03 Term + 73864 74265 402 0 0 40 52 190 0.680 5.75 3.04 PlyA + 75235 75240 6 -1.75 4.17 PlyA - 75315 75310 6 1.05 4.16 Term - 76258 76172 87 1 0 70 42 82 0.057 -0.54 4.15 Intr - 79222 79125 98 2 2 54 77 66 0.066 1.83 4.14 Intr - 81941 81623 319 2 1 7 35 171 0.125 -0.77 4.13 Intr - 86632 86285 348 1 0 82 83 475 0.669 41.95 4.12 Intr - 108926 108807 120 2 0 63 26 137 0.126 5.69 4.11 Intr - 109271 109177 95 0 2 101 115 3 0.997 3.98 4.10 Intr - 110020 109854 167 0 2 82 87 54 0.999 4.30 4.09 Intr - 110224 110108 117 0 0 67 89 81 0.942 5.68 4.08 Intr - 113329 113263 67 2 1 109 103 78 0.945 9.36 4.07 Intr - 119797 119637 161 0 2 71 27 105 0.117 2.33 4.06 Intr - 124236 124144 93 2 0 94 99 -4 0.051 0.38 4.05 Intr - 129371 129333 39 1 0 24 91 95 0.031 0.74 4.04 Intr - 149577 149506 72 2 0 45 80 136 0.046 7.02 4.03 Intr - 155623 155494 130 2 1 21 -16 217 0.027 4.35 4.02 Intr - 158817 158723 95 2 2 80 67 23 0.158 -0.99 4.01 Init - 159044 159001 44 2 2 119 52 68 0.357 4.68 4.00 Prom - 167460 167421 40 -3.26 5.09 PlyA - 171136 171131 6 1.05 5.08 Term - 180623 180499 125 0 2 76 41 94 0.944 2.05 5.07 Intr - 181863 181759 105 1 0 73 110 50 0.858 5.89 5.06 Intr - 185955 185874 82 0 1 -8 113 50 0.287 -2.89 5.05 Intr - 191328 191201 128 1 2 131 80 87 0.877 12.60 5.04 Intr - 191465 191382 84 0 0 75 92 35 0.748 2.39 5.03 Intr - 191979 191668 312 1 0 72 72 165 0.942 9.46 5.02 Intr - 192562 192478 85 1 1 77 92 63 0.353 4.99 5.01 Init - 206038 206036 3 1 0 64 103 0 0.014 -0.90 5.00 Prom - 208296 208257 40 -5.76 6.02 PlyA - 208371 208366 6 1.05 6.01 Term - 210002 208441 1562 0 2 114 43 571 0.883 45.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100138 99998 141 1 0 79 44 91 0.856 1.73 S.002 Sngl - 138787 138566 222 1 0 100 55 142 0.870 7.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:178503620_178723416|GENSCAN_predicted_peptide_1|136_aa MPRQFPSAVCLPQEAPHFVDEETVGSGAERCQHRAQASDREASALSTFTGHKANGCCKPS CCGPTTVPARDCDEPVVSTCNEARGSLEKECTQGQAMRLYLRTTGTVVEGTIDALSVILP QSDDRTNTKRCDPRKR >gi568815593r:178503620_178723416|GENSCAN_predicted_CDS_1|411_bp atgcccaggcagtttccttctgcagtctgccttccccaggaagcgccccattttgtagat gaggaaactgtgggctcaggggcagaacgttgtcagcacagagcccaggcctcagaccgg gaggccagtgccctgtctaccttcacaggccataaagccaatggctgctgcaaacccagc tgctgtgggcccacaactgtgcctgcccgagattgtgacgagccggtggtcagcacatgt aatgaagcaagggggtcactggagaaagaatgtacccaagggcaggcaatgcgcttgtat ctgaggacaacggggacagtggtggagggcacgatagacgcgctgtctgtgatcctccca cagtctgatgacaggaccaacaccaaaagatgtgatccaagaaagcgctaa >gi568815593r:178503620_178723416|GENSCAN_predicted_peptide_2|238_aa MEKDNVSKAHRKNFINMKNEDQEGPALVTAAIPLADRGKEQKDQSERVLETNQKALTSER VKTTGVERNVEEKTARPVATPAVSTLRPRGGKPRHKEDKCEVTEPTSVPAHNFMLVVAGG NPLNGFGVSQMGRDSQFKRHTQLLHQYHPANRVRLLCWKPTREPARVNVGKKGPRPVHED DSLTPQSTFRTMLFSAIPSKRRPNAGARRELNLHLPGGQTHSDGASRAVRIFASPSNF >gi568815593r:178503620_178723416|GENSCAN_predicted_CDS_2|717_bp atggaaaaagataatgtgtccaaggctcataggaaaaacttcattaacatgaaaaatgaa gatcaggaagggcctgcgctggtcactgcagccatccctctggctgaccgggggaaagaa cagaaagaccagagtgagagagtgctggaaacaaaccagaaggctcttacatcggaaagg gtgaaaactactggtgtggaaagaaatgtggaagagaaaacagcaagaccggtggccacc cccgctgtgagtactctgaggcccagaggtggaaaaccgaggcacaaggaggataagtgt gaggtcacggagccgacttctgtgccagctcacaatttcatgctggtggttgctggcggc aacccactaaacggatttggtgtctcacaaatggggcgtgactcccagttcaaaagacac acacagctcctccaccaataccacccagcaaaccgagtccgtttactttgctggaaaccg accagagagcctgccagggtcaatgtgggaaagaaaggccccaggccagtgcacgaagac gacagcctgacaccccagtccaccttccggaccatgctgttctcagcaatcccaagcaaa cggcggccgaacgcaggagccagaagggagctcaaccttcacttacctggggggcagaca cattcggatggagcttcccgagcagtccggatcttcgctagtccgtccaacttctga >gi568815593r:178503620_178723416|GENSCAN_predicted_peptide_3|304_aa MDKNPTQIGLDSDVERGSVTKPQCFWQSLNTPSDCVSPCAASRCWELQLTAVGGDFAKAQ RVPALPSGDPAGGPEPGARRAVHSRVLAGPASSRALSLRLVVPACWPRLPSSRDTSPVGS GPPHNDLCHLIRLFKDPIPKGGRIRRPRGEDFATGRRHTIQPMTPPPRRGAGEAQALTED RTRGIRWGFQQGLARGAWGGDLGNSPGPSQNPRLPPAAWMGSLYQPRPERVSRGLLWPRC APAGQPQKGHCSVLTKTSEADSASPFPEEEAEVRKGQPTCPRTLAGSEAALDSDPGFPIL KPTA >gi568815593r:178503620_178723416|GENSCAN_predicted_CDS_3|915_bp atggataaaaacccaactcaaattggcctggacagtgatgtcgagaggggctcggtcacg aagccgcagtgcttttggcagtccctcaataccccttcagactgtgtttctccttgcgca gcaagtcgatgctgggagctccaacttacagccgttgggggtgattttgcgaaggcccag cgggtccctgctctgccgtccggggatcccgcgggtggtccggagcccggggcgcggcgc gccgttcattcccgcgtcctggcggggcctgcttcttcccgagccctctccctgcggctg gtggtcccagcgtgctggccgcgtctcccctcttctcgggacaccagccctgttggatca gggcccccccacaacgacctgtgccacctcatccgcctctttaaagaccccatccccaag ggcggccgcatccggaggcccaggggtgaggacttcgccacgggaaggaggcacacgatt cagcccatgacaccgccacctcggcgtggtgctggggaagctcaggcactcaccgaggac aggacccggggaatccgctggggctttcagcagggcctggcgcgcggggcctggggagga gacctggggaacagcccagggccctcccagaacccgcgcctgccccccgctgcctggatg gggtctctgtaccagcccagaccagagcgagtgtcaagggggctcctctggccgcgctgc gcccccgcggggcagccccagaagggccattgctcagtcctcaccaagaccagcgaggcg gattcggccagcccctttccagaagaggaagccgaggttcggaaaggtcaaccaacttgt cccaggacgctggccggttcagaggcagctctggactcagacccaggcttcccaattcta aagcccactgcatga >gi568815593r:178503620_178723416|GENSCAN_predicted_peptide_4|683_aa METSLTPMALPLVRRLMCPGSDKGLRLGREEGSLATPIEHHVDQGAVHVEDPAEALAGIK HQQLAAAIIVTSIIISTVTIVTKAACRWVCVIHNSQDMETTDMSIDGGMDEEACGSDNPD GYYVTVRAPTLTLIFLMLHTYTGELPEGPQNCRIPYLRCGIPKELTVLIGIAEKAGDMKA IVEVTSGRGDLIVAHKRTGIVNHITSLKNLIDEIVDTLGEGAFGKVVECIDHGMDGMHVA VKIVKNVGRYREAARSEIQVLEHLNSTDPNSVFRCVQMLEWFDHHGHVCIVFELLGLSTY DFIKENSFLPFQIDHIRQMAYQICQSINFLHHNKLTHTDLKPENILFVKSDYVVKYNSKM KRDERTLKNTDIKVVDFGSATYDDEHHSTLVSTRHYRAPEPLPQQHRPLGESTRDERAMG PGERAGGGGDAGKGNAAGGGGGGRSATTAGSRAVSALCLLLSVGSAAACLLLGVQAAALQ GRVAALEEERELLRRAGPPGALDAWAEPHLERLLREHQPGLWSVTPASSRDVECYSSVIQ GCGVLPPASTRDVECYPSVNQGCGVLPTHQSGPRSVTLESAGAIECYLSVSRGHKCYPRV SRGVLPQCLLGPWNVSVVTPEEASWGMFSRGVFSEDVSGLVGESGEYPKSSGPGMPHGRP GGNNDLALSSSFFCECGHPCRHL >gi568815593r:178503620_178723416|GENSCAN_predicted_CDS_4|2052_bp atggagacctccctgacccccatggccttgcccctggtgcgcaggctgatgtgtccaggc tcagataagggcctgcgtctggggagagaggagggttcactagccacccccatagagcat cacgtggaccagggagcagtgcatgtagaagacccagcagaagctctggcaggtattaaa caccagcagctggcggccgccatcattgtcacatccatcatcatctccaccgtgaccatc gtcacaaaggctgcctgcagatgggtgtgtgttattcacaatagccaagatatggaaaca accgacatgtccatcgacggaggaatggatgaagaagcatgtggatctgacaacccagat ggctactacgtgactgtcagggctccgactttgaccctaatatttctcatgttacatact tacacgggggaactgcctgagggtcctcagaactgcagaataccttatttgagatgcggc attccaaaagaactcactgtcctgattgggatagcagagaaagctggggacatgaaagct atcgtggaagtcacaagcggaagaggagatctcatagtagcacacaagagaacaggcatt gtaaaccacatcaccagtttaaagaatctgattgatgaaatcgtggacactttgggtgaa ggagcctttggcaaagttgtagagtgcattgatcatggcatggatggcatgcatgtagca gtgaaaatcgtaaaaaatgtaggccgttaccgtgaagcagctcgttcagaaatccaagta ttagagcacttaaatagtactgatcccaatagtgtcttccgatgtgtccagatgctagaa tggtttgatcatcatggtcatgtttgtattgtgtttgaactactgggacttagtacttac gatttcattaaagaaaacagctttctgccatttcaaattgaccacatcaggcagatggcg tatcagatctgccagtcaataaattttttacatcataataaattaacccatacagatctg aagcctgaaaatattttgtttgtgaagtctgactatgtagtcaaatataattctaaaatg aaacgtgatgaacgcacactgaaaaacacagatatcaaagttgttgactttggaagtgca acgtatgatgatgaacatcacagtactttggtgtctacccggcactacagagctcccgag cctctgccccagcagcaccgccccctcggagagtccacgcgcgacgaacgcgccatgggc ccaggcgagcgcgccggtggcggcggcgacgcggggaagggcaatgcggcgggcggcggc ggcggagggcgctcggcgacgacggccgggtcccgggcggtgagcgcgctgtgcctgctg ctctccgtgggctcggcggctgcctgcctgctgctgggtgtccaggcggccgcgctgcag ggccgggtggcggcgctcgaggaggagcgggagctgctgcggcgcgcggggccgccaggc gccctggacgcctgggccgagccgcacctggagcgcctgctgcgggagcatcaaccaggg ctgtggagtgttactccagcgtcatccagggacgtggagtgttactccagcgtcatccag ggctgtggagtgttacccccagcatcaaccagggacgtggagtgttaccccagtgtcaac cagggctgtggagtgttacccacacatcagtcagggccgcggagtgttaccctagagtca gctggggccatagagtgttacctgagcgtcagccggggccacaagtgttaccccagggtc agccgtggagtgttaccccagtgtctgctggggccatggaatgtttctgttgtcacccca gaggaggcttcctggggaatgttctcacggggagtgttctcagaggatgtctctggcctg gttggggagtctggggagtaccccaagtcctcgggacctggaatgcctcatggcagacct ggtggaaacaatgatttggctttgagcagcagctttttctgcgagtgtggacatccttgt agacatctgtaa >gi568815593r:178503620_178723416|GENSCAN_predicted_peptide_5|307_aa MIVILPTETTINIQKMEQENTAQGSEKPSVQSVKPWSDQEIRSFLQEWEFLEREVYRVKK KYHIVSKAIAQRLKQRGINKSWKECLQMLISLQDLYFTIQEANQRPRCQPLPCPYGEALH RILGYRWKISVFSVIILRTLFVDLFRMDMALGLGEVTSKDSGPPCADVVNLAPPEHPPQA YGVPIVFQEPMWAPTPVIYVENPQLLNTSVPTTHLDPGNDQRMGSGTRWAHENDSLLIQR RYWNTSDWGKEDMCVTQERSCSSQVAVKKGRNEKRTKQNFYSSYYHMDTSTHILILPVTT DLCSSVQ >gi568815593r:178503620_178723416|GENSCAN_predicted_CDS_5|924_bp atgatagtaatacttcccactgaaactaccataaacatccagaaaatggagcaggaaaac acagcccagggatcagaaaagccctcagtccagtcagttaaaccttggagtgaccaggaa atccggagtttcctgcaagaatgggaatttcttgaacgtgaggtgtacagggtgaagaag aagtatcacatagtatcaaaagcaattgctcagcgtctcaagcagaggggtatcaacaag agctggaaggaatgtctccagatgctaataagcttgcaggacttatacttcactattcag gaggccaaccagaggccaaggtgccaacccttgccatgtccttatggtgaggccctgcac aggattctggggtacagatggaagatcagcgtcttctcagtaataattctgaggactttg ttcgtggacttgttcagaatggacatggccctggggctgggtgaggtcacaagcaaggat tcaggtcctccctgtgcagatgtggttaacctcgcacctcccgagcacccgccccaggcc tatggcgttcccatagtctttcaggagccgatgtgggccccaacacctgtgatctatgtg gaaaatcctcagctgctgaacacgtctgttccaactacacatctggacccgggaaatgac cagagaatgggttccgggacccgctgggcccacgagaatgacagtttgctcattcagcgc aggtactggaatacctcagactggggtaaagaagacatgtgtgttacacaggaacggagc tgcagctcacaagtagcagtgaaaaagggaagaaatgaaaagagaacaaaacagaatttc tacagctcctattaccacatggacacaagtacccacatcctcatccttcctgttaccaca gatctgtgctcttctgtacagtaa >gi568815593r:178503620_178723416|GENSCAN_predicted_peptide_6|520_aa XSKSSHKTTKSTQTQDSSFQGLILKRSNRNVPWDLKLEKPYIYEGRLEKKQDKKGSFQIV SATHKKIPTIERSHKNTELSQNFSPKSVLIRQQILPREKTPPKCEIQGNSLKQNSQLLNQ PKITADKRYKCSLCEKTFINTSSLRKHEKNHSGEKLFKCKECSKAFSQSSALIQHQITHT GEKPYICKECGKAFTLSTSLYKHLRTHTVEKSYRCKECGKSFSRRSGLFIHQKIHAEENP CKYNPGRKASSCSTSLSGCQRIHSRKKSYLCNECGNTFKSSSSLRYHQRIHTGEKPFKCS ECGRAFSQSASLIQHERIHTGEKPYRCNECGKGFTSISRLNRHRIIHTGEKFYNCNECGK ALSSHSTLIIHERIHTGEKPCKCKVCGKAFRQSSALIQHQRMHTGERPYKCNECGKTFRC NSSLSNHQRIHTGEKPYRCEECGISFGQSSALIQHRRIHTGEKPFKCNTCGKTFRQSSSR IAHQRIHTGEKPYECNTCGKLFNHRSSLTNHYKIHIEEDP >gi568815593r:178503620_178723416|GENSCAN_predicted_CDS_6|1563_bp ngatcgaagagcagtcataaaaccacaaagtcaacgcaaacacaagactcttcatttcag ggactgatactgaaaagatccaacaggaatgtaccttgggatttgaaattagaaaagcct tacatatatgaaggcagattagagaaaaagcaggataaaaagggaagttttcagatagtt tcagccacccacaaaaaaatccccactatagaaagaagccataaaaatactgaattgagc caaaacttcagcccaaagtcagtgcttattaggcaacagatacttcccagagaaaaaaca ccaccaaaatgtgaaatacaaggaaacagcctcaaacagaattcacaattacttaatcaa ccaaaaattacagcagataaacgctataaatgtagtctgtgtgaaaaaaccttcattaac acttcatcccttcgtaaacatgagaaaaaccatagtggagagaaactatttaagtgtaaa gaatgttcaaaagcctttagccaaagttcagctcttattcaacatcaaataacgcatact ggagagaaaccctacatatgtaaagaatgtgggaaagcctttactctcagtacatccctt tataaacatctaagaacccatactgtggagaaatcctacagatgtaaagaatgtggtaaa tccttcagccgaaggtcaggcctttttatacatcaaaaaattcatgctgaagaaaaccct tgtaagtataatccgggtaggaaggcatctagttgcagcacatccctttctggatgtcaa agaattcattctagaaagaagtcctacttatgtaatgaatgtggcaacacctttaagtct agctcatcccttcgttatcatcagagaattcacactggagagaagccttttaaatgtagt gaatgtgggagagccttcagccagagtgcctctcttattcaacatgaaagaattcacacc ggagaaaagccctatagatgcaatgaatgtgggaaaggctttacttctatttcacgactt aatagacaccgaatcattcatactggagagaagttttataattgtaatgaatgtggtaaa gccttaagctcccactcaacacttattattcacgagcgaattcatactggagaaaaacca tgtaaatgtaaagtatgtggaaaagccttcagacagagttcagctctcattcaacatcag agaatgcatactggagaaagaccctataaatgtaacgagtgtgggaaaacattcaggtgt aactcatcacttagtaatcaccagagaattcatactggagagaaaccatatcgatgtgag gaatgtgggatatcttttggccaaagttcagctcttattcagcatcgaaggattcataca ggagaaaaaccctttaaatgtaatacatgtggaaaaacttttagacaaagctcatcacgt attgcacatcagagaattcatactggagagaaaccctatgaatgtaatacatgtgggaaa cttttcaaccataggtcatcccttactaatcattataaaattcatatcgaagaggacccc tag