GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:36:30 Sequence gi568815582r:48261583_48462428 : 200846 bp : 42.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1196 1290 95 2 2 111 107 16 0.625 4.49 1.02 Intr + 8434 8692 259 2 1 63 75 251 0.915 16.90 1.03 Intr + 15756 15897 142 0 1 98 94 162 0.969 17.23 1.04 Intr + 34433 34583 151 1 1 121 82 85 0.996 10.01 1.05 Intr + 38080 38206 127 2 1 89 92 134 0.979 12.72 1.06 Intr + 41590 41723 134 1 2 49 81 203 0.910 15.07 1.07 Intr + 72634 72776 143 2 2 99 45 191 0.634 15.05 1.08 Intr + 72916 73020 105 0 0 64 33 90 0.126 0.59 1.09 Intr + 85925 86132 208 1 1 100 58 268 0.992 22.73 1.10 Intr + 86518 86708 191 2 2 63 100 161 0.944 13.18 1.11 Intr + 89999 90197 199 1 1 148 30 166 0.888 14.80 1.12 Term + 94442 94677 236 0 2 76 31 164 0.542 5.00 1.13 PlyA + 95429 95434 6 1.05 2.02 PlyA - 97690 97685 6 1.05 2.01 Sngl - 100846 99998 849 1 0 61 44 583 0.989 46.72 2.00 Prom - 100914 100875 40 -12.72 3.00 Prom + 101290 101329 40 -4.25 3.01 Init + 101493 101640 148 2 1 55 103 87 0.945 7.20 3.02 Intr + 103861 104094 234 2 0 38 50 129 0.387 0.84 3.03 Intr + 104540 104702 163 0 1 70 15 152 0.014 4.21 3.04 Intr + 112907 113000 94 2 1 57 98 67 0.014 3.65 3.05 Intr + 119806 119938 133 0 1 28 36 140 0.469 2.10 3.06 Term + 122406 122569 164 1 2 61 42 220 0.697 11.82 3.07 PlyA + 124525 124530 6 1.05 4.00 Prom + 137006 137045 40 -6.85 4.01 Init + 139166 139286 121 1 1 68 94 63 0.973 5.30 4.02 Intr + 139596 139706 111 1 0 108 110 59 0.873 9.53 4.03 Term + 145052 145185 134 0 2 70 43 104 0.703 1.37 4.04 PlyA + 145393 145398 6 1.05 5.00 Prom + 146361 146400 40 -6.25 5.01 Init + 149851 149960 110 0 2 57 105 115 0.759 9.84 5.02 Intr + 166144 166262 119 1 2 104 58 51 0.486 2.89 5.03 Term + 170547 170998 452 1 2 43 48 240 0.796 9.86 5.04 PlyA + 171175 171180 6 1.05 6.06 PlyA - 171229 171224 6 1.05 6.05 Term - 172260 171892 369 0 0 -10 36 273 0.032 5.96 6.04 Intr - 173198 173069 130 1 1 24 20 104 0.044 -3.02 6.03 Intr - 173920 173793 128 1 2 90 78 76 0.567 5.36 6.02 Intr - 175722 175492 231 0 0 81 66 186 0.467 12.75 6.01 Init - 184559 184416 144 2 0 58 76 92 0.196 5.17 6.00 Prom - 189136 189097 40 -7.85 7.00 Prom + 190730 190769 40 -4.55 7.01 Init + 193962 194171 210 2 0 75 93 137 0.975 10.39 7.02 Intr + 194358 194480 123 2 0 98 72 93 0.784 8.56 7.03 Term + 196177 196227 51 0 0 84 43 47 0.597 -3.75 7.04 PlyA + 199853 199858 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 85385 85387 3 1 0 113 81 0 0.902 1.85 S.002 Term + 104540 104730 191 0 2 70 40 169 0.842 6.93 S.003 Sngl - 192116 191541 576 2 0 41 54 227 0.877 10.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_1|663_aa XLKKMPQSMPEYALTRNYLELMVELPWNKSTTDRLDIRAARILLDNDHYAMEKLKKRVLE YLAVRQLKNNLKGPILCFVGPPGVGKTSVGRSVAKTLGREFHRIALGGVCDQSDIRGHRR TYVGSMPGRIINGLKTVGVNNPVFLLDEVDKLGKSLQGDPAAALLEVLDPEQNHNFTDHY LNVAFDLSQVLFIATANTTATIPAALLDRMEIIQVPGYTQEEKIEIAHRHLIPKQLEQHG LTPQQIQIPQVTTLDIITRYTREAGVRSLDRKLGAICRAVAVKVAEGQHKEAKLDRSDVT EREGCREHILEDEKPESISDTTDLALPPEMPILIDFHALKDILGPPMYEMEVVGVAALLA AVDCMGAQASTALVAHHLLAVVFLGKVSQRLSQPGVAIGLAWTPLGGEIMFVEASRMDGE GQLTLTGQLGDVMKESAHLAISWLRSNAKKYQLTNAFGSFDLLDNTDIHLHFPAGAVTKD GPSAGVTIVTCLASLFSGRLVRSDVAMTGEITLRGLVLPVGGIKDKVLAAHRAGLKQVII PRRNEKDLEGIPGNVRQDLSFVTASCLDEVLNAAFDGGFTVKTRPVVPVNLTSNHASGAE ILTCLVVTNANTVKNVIFVIVPEIVHEKSMTVKYCDSDVYHCELPVLGDWSAFTVTKISY VAR >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_1|1992_bp nnactcaaaaaaatgcctcagtcaatgccagaatatgctctgactagaaattatttggaa cttatggtagaacttccttggaacaaaagtacaactgaccgcctggacattagggcagcc cggattcttctggataatgaccattacgccatggaaaaattgaagaaaagagtactggaa tacttggctgtcagacagctcaaaaataacctgaagggcccaatcctatgctttgttggc cctcctggagttggtaaaacaagtgtgggaagatcagtggccaagactctaggtcgagag ttccacaggattgcacttggaggagtatgtgatcagtctgacattcgaggacacaggcgc acctatgttggcagcatgcctggtcgcatcatcaacggcttgaagactgtgggagtgaac aacccagtgttcctattagatgaggttgacaaactgggaaaaagtctacagggtgatcca gcagcagctctgcttgaggtgttggatcctgaacaaaaccataacttcacagatcattat ctaaatgtggcctttgacctttctcaagttctttttatagctactgccaacaccactgct accattccagctgccttgttggacagaatggagatcattcaggttccaggttatacacag gaggagaagatagagattgcccataggcacttgatccccaagcagctggaacaacatggg ctgactccacagcagattcagataccccaggtcaccactcttgacatcatcaccaggtat accagagaggcaggggttcgttctctggatagaaaacttggggccatttgccgagctgtg gccgtgaaggtggcagaaggacagcataaggaagccaagttggaccgttctgatgtgact gagagagaaggttgcagagaacacatcttagaagatgaaaaacctgaatctatcagtgac actactgacttggctctaccacctgaaatgccgattttgattgatttccatgctctgaaa gacatccttgggcccccgatgtatgaaatggaggttgttggtgtggccgcacttcttgca gcagttgactgcatgggggcgcaggcgagcacagctcttgtggcacatcatcttcttgca gttgtatttctgggcaaggtatctcagcgtttgagtcagccaggagtagcaataggtttg gcttggactcccttaggtggagaaatcatgttcgtggaggcgagtcgaatggatggcgag ggccagttaactctgaccggccagctcggggacgtgatgaaggagtccgcccacctcgct atcagctggctccgcagcaacgcaaagaagtaccagctgaccaatgcttttggaagtttt gatcttcttgacaacacagacatccatctgcacttcccagctggagctgtcacaaaagat ggaccatctgctggagttaccatagtaacctgtctcgcctcactttttagtgggcggctg gtacgttcagatgtagccatgactggagaaattacactgagaggtcttgttcttccagtg ggtggaattaaagacaaagtgctggcggcacacagagcgggactgaagcaagtcattatt cctcggagaaatgaaaaagaccttgagggaatcccaggcaacgtacgacaggatttaagt tttgtcacagcaagctgcctggatgaggttcttaatgcagcttttgatggtggctttact gtcaagaccagacctgtggtccctgtgaaccttacctcaaaccatgcatctggggcagag atccttacttgcttggtggttacaaatgcaaatacagtgaagaatgtcatctttgtgatt gttcctgaaatagttcacgagaaatccatgaccgtaaagtactgtgatagtgatgtctac cactgtgagcttccagtactaggtgattggtctgcattcacagtgaccaaaatcagctat gtggccaggtaa >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_2|282_aa MSRQTATALPTGTSKCPPSQRVPALTGTTASNNDLASLFECPVCFDYVLPPILQCQSGHL VCSNCRPKLTCCPTCRGPLGSIRNLAMEKVANSVLFPCKYASSGCEITLPHTEKADHEEL CEFRPYSCPCPGASCKWQGSLDAVMPHLMHQHKSITTLQGEDIVFLATDINLPGAVDWVM MQSCFGFHFMLVLEKQEKYDGHQQFFAIVQLIGTRKQAENFAYRLELNGHRRRLTWEATP RSIHEGIATAIMNSDCLVFDTSIAQLFAENGNLGINVTISMC >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_2|849_bp atgagccgtcagactgctacagcattacctaccggtacctcgaagtgtccaccatcccag agggtgcctgccctgactggcacaactgcatccaacaatgacttggcgagtctttttgag tgtccagtctgctttgactatgtgttaccgcccattcttcaatgtcagagtggccatctt gtttgtagcaactgtcgcccaaagctcacatgttgtccaacttgccggggccctttggga tccattcgcaacttggctatggagaaagtggctaattcagtacttttcccctgtaaatat gcgtcttctggatgtgaaataactctgccacacacagaaaaagcagaccatgaagagctc tgtgagtttaggccttattcctgtccgtgccctggtgcttcctgtaaatggcaaggctct ctggatgctgtaatgccccatctgatgcatcagcataagtccattacaaccctacaggga gaggatatagtttttcttgctacagacattaatcttcctggtgctgttgactgggtgatg atgcagtcctgttttggctttcacttcatgttagtcttagagaaacaggaaaaatacgat ggtcaccagcagttcttcgcaatcgtacagctgataggaacacgcaagcaagctgaaaat tttgcttaccgacttgagctaaatggtcataggcgacgattgacttgggaagcgactcct cgatctattcatgaaggaattgcaacagccattatgaatagcgactgtctagtctttgac accagcattgcacagctttttgcagaaaatggcaatttaggcatcaatgtaactatttcc atgtgttga >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_3|311_aa MTPFSIRDLSILRFWYGEGGPRTNPRIPRIECSSNAKESEYHLPLTLLNGVQRRWSSLSR HEKSGHALRFPQEVKEQAPPGLFLLLSTEDPNSRLRSGSKYHLLLRSTSYRGGERKPFHP QEATTPPSTSNLPQHFHQFKIRWNHAIFGRSEQSNIGSFLWFRSKSYSVSGKCVATLALR SRKLLEDKLFKYKGNNQEKERGYNKQKTASRRKQSHSLAFTQRSYKLMATQKPANGRLQQ LYSELPRLRSQPRHPSVGLEVDTVVTPILQIRKAKHREVSAQGHKPVEQDLTQSRQSDFT ARALNIQLLKS >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_3|936_bp atgacaccattttctatcagggacttgagcatcctcaggttttggtatggagaagggggt cctagaactaatccgaggataccgcggattgaatgtagttccaatgccaaagaatcagag tatcaccttccattaactcttcttaatggagtacagagaaggtggagtagcctttcccgt catgagaagtcaggccacgcattgcgtttcccccaagaagtaaaggaacaggcccctcct gggctctttctgctcctaagcacagaagacccaaattcgcgtctgaggtcaggatccaaa taccatttacttctcagaagcaccagttacagaggtggagaaaggaagcctttccatccc caggaggcaaccacacctccctcgacatcgaaccttcctcagcatttccaccagtttaaa attaggtggaaccacgctattttcggtaggtcagaacagtcgaatatcggcagtttctta tggtttcgatctaagagctacagcgtatctgggaaatgcgtggccacactggcgctacgc tccaggaagctcctggaagataagctcttcaaatacaagggaaataaccaagaaaaagaa cggggttacaataaacagaagactgcatctaggagaaagcaatcacattccttggcattt acccaaaggagttacaaactcatggccacacaaaaacctgcaaatggacgtttacagcag ctttattcagaattgccaagacttcgaagccaaccaagacatccttcagtagggcttgag gtagatactgtcgttacacccattttgcagataaggaaagcgaagcacagagaagtatct gcccaaggtcacaaaccagtggagcaggatttgacccaaagcagacagtcggacttcaca gcccgtgctctcaacatccaactgctgaagagttaa >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_4|121_aa MTLNEHAAFKHLFNKAHLALPLIHLTVSGHSTCFREHRVGGLKRLDGLIQKIQENFPISG SAQQQLDVICNLNTPLAVSLPPYEIVTMVSPTAQRIEQIQRAQKLHKSDGFLRLRELPSG K >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_4|366_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcactg cccttgatccatttaaccgtgagtggacacagcacatgtttcagagagcaccgggttgga ggactcaaaagattagatgggctcatccagaaaattcaggagaactttcccatttcaggg tcagctcagcagcaacttgatgtcatctgcaaccttaatacccctttggcagtatcatta ccaccctatgagatagtcaccatggtcagccccactgcacagagaatagaacagatccag cgagctcagaagctgcacaagtcagatggattccttaggctcagggaacttccatctggg aagtaa >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_5|226_aa MKIRSQKYKHMQMWKRDSADTNPCGECPWMHMGNNKSRSQMPYWRSGSGDSKMTTQSLLR GIHSSETRQRPSSSKPGQGCFLGSQQPGDSSCKKILEHLFCSQRYSKWNQRQVEHLCLQL KKLETQKGEVSHKVYFLIHLEITITIVTIAASMECRGVAVTQRSFQHSGLGLIIPVLQSQ RGRVACQVTQQEVAQLGTELSSVELQSRMPTVKRAIKVAETLSLPS >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_5|681_bp atgaaaatcagatcccagaagtacaagcacatgcagatgtggaagcgtgacagtgccgat acgaatccctgtggagagtgcccgtggatgcatatggggaataataagagtaggtctcaa atgccctattggaggtcaggctctggagattccaagatgaccacacaatccctcctccgt ggaattcacagttctgagacaagacagagaccaagcagctccaagccggggcaagggtgc ttcttgggaagccagcagccaggagactcctcttgcaagaagatcctggaacatctcttc tgttcacagagatattccaagtggaaccaaagacaagtcgagcacctttgtttacagctg aagaaactggaaacccagaaaggggaagtttcacacaaggtctacttcctcatccactta gagatcacaataacaatagtcacaattgctgcctcaatggagtgcaggggagtggcagtc actcaaagatccttccaacactctggcctgggcttgatcatccctgttctccagagtcag agaggacgagtggcttgtcaggtcacacagcaggaagtagcccagcttggcacagaactc agttctgtggaacttcaaagcaggatgccaacagttaagagggctataaaggtggcagag acactgtctttgccctcatga >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_6|333_aa MCHGAGLVFTFLPIQVPETGAAFLRDIGVCGSEIGICSSCLPPDNFGEWLCIIYGSKPFC GSDSCQTTLTSMGRALTGSPAMVLAAATNTGFWASSPQPWSKKLPAGAKLCSSLSRSAFH PSIIYVSIRGAPGRGQLETAPSKTRQTARWVARVQEQVLATQRETHTKNPQGKRKRNIYP NGNDQAMPKEKSEQTVRDTERRAIREDFSEEGHRKEYGSLRRRKTSEMTETEALKTAGLK VRLEVREGLAELEREQPVKEGQELTARRSGRPGPSRQGSCVDNARKAATAYLFRQVQIWI GSGVRRTVEKQIGLERSTRPRGQGEVLTRGSDQ >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_6|1002_bp atgtgccacggtgctggactggtctttacttttcttccaatccaagttccagaaacagga gctgcctttctccgagatataggtgtttgtggttcagagattgggatctgttcttcctgc ttaccacctgacaactttggagagtggctgtgtatcatctatggttccaagcccttctgt ggctctgactcctgccagacaaccctcacctccatggggcgagctctcacgggcagccct gccatggttctagctgctgccaccaacactggcttctgggcttcatctcctcaaccttgg agtaagaagcttcctgctggtgctaagctgtgttcctcactatcccgttcagcctttcat ccttccattatctatgtttccatcaggggtgctcccggcagggggcagctggaaactgcc ccaagcaagactcgccaaacagcaagatgggtggcaagggtccaggaacaggttttggca acacagagggaaacacatactaaaaacccacagggtaaaaggaagaggaacatttatcca aatggaaatgaccaagccatgcctaaggagaagagtgagcaaaccgtcagagacacagaa aggagggctatcagggaagacttctcagaggagggacataggaaagaatatggctcccta agaaggagaaagacaagtgaaatgacagaaacagaggcattgaaaacagcagggctcaaa gttaggctggaggtcagggagggcctcgcagaactggagcgtgagcagcctgtgaaggaa ggacaggaactcactgcccggcgatcaggacggccagggccttccaggcaggggagctgt gtggacaacgcaaggaaggcggcaactgcgtatctgttcaggcaggtgcaaatatggatt ggtagtggggtaaggaggaccgtggagaaacagatcgggctggagaggtcaaccaggcct cgtggccagggagaagttctgacccgggggtctgatcaataa >gi568815582r:48261583_48462428|GENSCAN_predicted_peptide_7|127_aa MASTSQGHCLWAGAPGGFLEEDWGKKGLCLEAVMTLESLDGRNSQEPPAPSWQRCRLGLR LCSESPGAQKAHVGAFPKSRGRTTSLFGESSSMVTANTAGLRDQRTLDKSEGTTLLTSDT RNEFDFV >gi568815582r:48261583_48462428|GENSCAN_predicted_CDS_7|384_bp atggccagcacatcccaggggcactgtctgtgggctggggctccagggggcttcctggag gaggactgggggaaaaaggggctctgtttggaagctgtgatgacccttgagagtcttgat ggacgcaacagccaagagcccccagctcccagttggcagaggtgcaggctggggctgagg ctgtgctcagagtcccctggtgcccagaaggcacatgtgggtgcttttcccaaaagccgg ggtagaaccacctctctctttggagaaagttcttccatggtaactgctaacacagcaggg ctcagggatcagcggaccctcgacaaaagtgaagggaccactcttctgacttccgacacc agaaatgagtttgattttgtttga