GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:44:01 Sequence gi568815588r:17049043_17301634 : 252592 bp : 39.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.23 PlyA - 781 776 6 1.05 1.22 Term - 1437 1378 60 0 0 102 48 43 0.010 -1.47 1.21 Intr - 12607 12462 146 2 2 122 70 17 0.387 2.38 1.20 Intr - 16596 16466 131 2 2 28 92 69 0.342 0.72 1.19 Intr - 19238 19022 217 0 1 49 116 45 0.497 0.04 1.18 Intr - 19728 19563 166 0 1 60 86 65 0.476 2.11 1.17 Intr - 22562 22384 179 0 2 50 94 27 0.503 -1.78 1.16 Intr - 22929 22785 145 0 1 97 99 85 0.977 9.43 1.15 Intr - 35419 35229 191 2 2 99 78 199 0.348 18.38 1.14 Intr - 36717 36555 163 0 1 96 89 -4 0.591 -0.87 1.13 Intr - 39303 39122 182 1 2 70 84 89 0.857 5.37 1.12 Intr - 44124 44082 43 0 1 52 78 63 0.086 -1.41 1.11 Intr - 54195 54083 113 1 2 71 94 63 0.610 4.38 1.10 Intr - 55563 55320 244 0 1 105 64 143 0.499 9.75 1.09 Intr - 56533 56415 119 2 2 88 78 24 0.753 0.66 1.08 Intr - 60693 60598 96 1 0 102 63 75 0.930 5.46 1.07 Intr - 62008 61877 132 2 0 84 53 98 0.945 5.60 1.06 Intr - 67752 67678 75 1 0 104 73 80 0.452 6.67 1.05 Intr - 73599 73484 116 2 2 38 63 145 0.432 6.17 1.04 Intr - 73856 73753 104 2 2 85 93 32 0.976 1.45 1.03 Intr - 74647 74546 102 1 0 105 113 18 0.952 5.45 1.02 Intr - 77757 77719 39 0 0 77 96 28 0.470 0.00 1.01 Init - 85529 85473 57 2 0 88 45 75 0.395 4.56 1.00 Prom - 91962 91923 40 -5.55 2.05 PlyA - 92122 92117 6 1.05 2.04 Term - 100098 99998 101 1 2 122 28 107 0.971 5.41 2.03 Intr - 104594 104465 130 2 1 97 88 93 0.999 9.55 2.02 Intr - 105692 105635 58 1 1 76 79 43 0.995 0.17 2.01 Init - 108736 108399 338 1 2 54 73 278 0.564 19.70 2.00 Prom - 110450 110411 40 -3.05 3.00 Prom + 113022 113061 40 -2.75 3.01 Init + 115052 115906 855 1 0 43 86 338 0.094 23.77 3.02 Intr + 147745 147879 135 0 0 47 83 89 0.011 4.14 3.03 Intr + 165948 166051 104 2 2 41 119 51 0.015 1.55 3.04 Intr + 166487 166650 164 2 2 17 98 108 0.572 3.40 3.05 Intr + 171220 171310 91 2 1 52 94 65 0.245 1.63 3.06 Intr + 177945 177996 52 0 1 91 81 72 0.534 4.79 3.07 Intr + 179327 179361 35 1 2 96 87 40 0.531 0.80 3.08 Intr + 180257 180943 687 2 0 39 72 833 0.346 66.53 3.09 Intr + 181608 181668 61 0 1 40 72 34 0.691 -5.18 3.10 Intr + 184545 184640 96 2 0 83 88 142 0.999 12.99 3.11 Intr + 184728 184889 162 2 0 123 97 277 0.998 31.35 3.12 Intr + 185651 185776 126 1 0 72 84 260 0.980 23.96 3.13 Intr + 186127 186347 221 0 2 104 91 373 0.732 35.38 3.14 Intr + 204830 204943 114 2 0 47 97 49 0.030 0.34 3.15 Intr + 216044 216156 113 2 2 86 64 55 0.121 2.00 3.16 Intr + 219233 219368 136 0 1 -4 71 158 0.193 3.61 3.17 Intr + 220208 220266 59 2 2 78 78 46 0.060 0.21 3.18 Term + 226601 226686 86 0 2 65 33 154 0.099 4.44 3.19 PlyA + 227257 227262 6 1.05 4.00 Prom + 231783 231822 40 -5.75 4.01 Sngl + 232896 233408 513 2 0 49 42 430 0.958 30.29 4.02 PlyA + 235508 235513 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:17049043_17301634|GENSCAN_predicted_peptide_1|939_aa MRKKEKGETMRQDRSRSQQLVDLERKFQGLQQTVDKKVCSSNPCQNGGTCLNLHDSFFCI CPPQWKGPLCSADVNECEIYSGTPLSCQNGGTCVNTMGSYSYFYLTEEKLMSPLLSPREE ILALTSPRNRKKAPVIGTEKTAKKLPPQQLVLGPTTGKPNPVPGGWQGNGYICEDINECE INNGGCSVAPPVECVNTPGSSHCQACPPGYQGDGRVCTLTDICSVSNGGCHPDASCSSTL GSLPLCTCLPGYTGNGYGPNGCVQLSNICLSHPCLNGQCIDTVSGYFCKCDSGWTGVNCT ENINECLSNPCLNGGTCVDGVDSFSCECTRLWTGALCQVPQQGTVSISYATDAHSIAAAS FICGESLSGINGSFSYRSPDVGYVHDVNCFWVIKTEMGKYAAPKPPDALVALNQCGGILT GPYGSIKSPGYPGNYPPGRDCVWIVVTSPDLLVTFTFGTLSLEHHDDCNKDYLEIRDGPL YQDPLLGKFCTTFSVPPLQTTGPFARIHFHSDSQISDQGFHITYLTSPSDLRCGGNYTDP EGELFLPELSGPFTHTRQCVYMMKQPQGEQIQINFTHVELQCQSDSSQNYIEVRDGETLL GKVCGNGTISHIKSITNSVWIRFKIDASVEKASFRAVYQVACGDELTGEGVIRSPFFPNV YPGERTCRWTIHQPQSQVILLNFTVFEIGSSAHCETDYVEIGSSSILGSPENKKYCGTDI PSFITSVYNFLYVTFVKSSSTENHGFMAKFSAEDLACGEILTESTGTIQSPGHPNVYPHG INCTWHILVQPNHLIHLMFETFHLEFHYNCTNDYLEVYDTDSETSLGRYCGKSIPPSLTS SGNSLMLVFVTDSDLAYEGFLINYEAISAATGQGSHLLISGWLVLFFISDPGMSSEEASS SAKLRNDDKHAQRGLLYTKRNFHIVPAALATLIFASSAQ >gi568815588r:17049043_17301634|GENSCAN_predicted_CDS_1|2820_bp atgaggaagaaagagaaaggtgagaccatgcgacaggaccggagcagaagtcagcagctg gtggatcttgagagaaaattccaaggcttgcagcagactgttgacaaaaaggtttgcagc agcaatccttgccagaatggtggaacctgcctcaatctgcatgattcctttttttgtatc tgtcccccacagtggaagggtcctctctgctcagctgatgttaacgaatgtgagatttac tcaggaacacccttgagctgccagaatggaggcacatgtgttaatacaatgggaagttac agctatttttacttaactgaagaaaagctcatgagtcctttactctcaccccgtgaagaa atccttgctctaacatccccaaggaaccggaagaaggctcccgtgattggcacagagaaa actgccaaaaagctgccaccccagcaattggttctggggcccaccacaggcaagccgaat cctgtgcctggaggctggcaaggcaatggatatatttgcgaagatatcaatgaatgtgag ataaataacggcggctgttctgtggctccacccgttgagtgtgtgaatacacctgggtct tcccactgccaggcctgtccaccagggtaccagggtgacggaagagtgtgcacactcaca gacatctgctcagtcagtaatggaggctgccacccagatgcctcatgctcctcaactcta ggttccttacctctctgcacgtgtctcccgggttatactggaaatggttatgggccaaat ggatgtgtgcagctcagtaatatttgcctaagtcacccctgtctaaatggacaatgcatc gacactgtctctggttatttttgtaagtgtgactcaggttggacaggtgtcaactgtaca gaaaacatcaatgagtgtttgagcaacccctgtttgaatggaggaacttgtgttgatggc gttgattctttcagttgtgaatgcacacgtctctggactggagctctctgtcaggttcct cagcaaggtacagtcagcatttcttatgccacggatgctcacagcatagcagcagcatcc tttatttgtggagagtccctctcaggaataaatggaagcttcagctacaggagcccggat gttggttatgttcatgatgttaactgcttctgggttatcaaaactgaaatgggaaagtat gcagctccaaaaccgccagatgctttggtagccctaaatcagtgtggaggtatcctgact ggtccttacggttctattaagtctccggggtatcctggaaactatcccccaggaagagat tgtgtctggattgttgtaactagtcctgacctcctggtaacatttacttttgggaccttg agcctcgagcaccatgatgactgcaacaaagattaccttgagattcgagatggtcctttg tatcaggacccccttcttgggaagttctgcaccactttctctgtcccaccgctccagact actggcccctttgccagaattcacttccattcagactcccagattagtgaccaaggcttc catatcacctacttaacatcaccttcggatctgcgttgtggtgggaactacacggaccca gagggtgaactcttcttgcctgagttgtctgggcctttcactcacaccaggcaatgcgtc tatatgatgaagcagccccagggagaacaaatacaaatcaacttcacccacgtggagctg caatgccagagtgacagttctcagaattacattgaggttcgagatggtgaaaccttactt ggaaaagtctgtggcaacggaaccatctctcacattaaatccattactaatagtgtctgg atcaggtttaaaatagatgcttctgttgaaaaagctagtttcagagctgtttatcaagtc gcttgcggggatgaattaactggagaaggggtcattcgctcgcctttttttcctaacgtg tatcctggagaaagaacctgtaggtggaccatccaccagccccaaagccaagtcattctc ctcaacttcactgtctttgaaattggaagttctgcccactgtgaaacagattatgttgag attggtagcagttccattttgggttctcctgaaaataaaaagtattgcggtacagacata ccttcatttataacatctgtgtacaattttctttatgtcacattcgtgaaaagttcttct actgaaaaccatggtttcatggctaagttcagtgctgaggatttggcatgtggagaaatt cttacagaatcaacagggaccattcaaagtcctggccatccaaatgtctacccccacggt atcaactgtacttggcatatattagtccaacctaatcacctgattcatttaatgttcgaa acatttcatctggagtttcattacaattgcacaaacgactacttggaagtttatgacacc gactctgagacatcccttggaagatactgtggaaagtcgatcccgccatctctcacaagc agtggtaactcattgatgctggtgtttgtgactgactccgacctcgcttatgaaggcttc ttaataaactatgaagcaatcagtgcagcaacagggcagggttcccacctactcatcagt gggtggttagtgctgttttttatcagtgatcctggaatgagctcagaagaggcatctagc agtgcaaagttaagaaatgatgacaaacatgcccaaagaggccttctttacactaaacgg aatttccatatagttccagctgctttggcaactctaatctttgcatcatcagctcagtag >gi568815588r:17049043_17301634|GENSCAN_predicted_peptide_2|208_aa MEFPKIESVHPQKYAMDVENKIQEKNVEPNISFDGSIQCSGKDAILFKLETAEEIHRKNQ QDSDLSVKMLKDFLEDDTDVNQYLLPPKSLLRYALLLDIVQPTCRRSVCFTKGYGSYIEG TGSVLQTAEDVQVENIYKSLTNLSQEEQITKLLILKLRYFTPKEIANLLGFPPEFGFPEK ITVKQRYRLLGNSLNVHVVAKLIKILYE >gi568815588r:17049043_17301634|GENSCAN_predicted_CDS_2|627_bp atggagttccccaaaattgaatctgtacatccacaaaaatatgcaatggatgtagaaaat aaaattcaagaaaagaacgttgaaccaaatattagctttgatggcagcatacagtgttct ggaaaagatgccattctttttaagcttgaaactgcagaagaaattcacaggaaaaatcaa caagatagtgatctctctgtgaaaatgctaaaagattttcttgaagatgacactgacgtg aaccagtatcttttaccaccaaagtcattgctgcgatatgctcttctgttagacattgtt cagcccacttgtagaaggtccgtgtgctttaccaaaggatatggaagctacatagaaggg acagggtctgtgttacagactgcagaggatgtgcaggttgagaatatctacaaatccctt accaatttgtcacaagaagaacagataacaaagctgttaatacttaaactgcgatatttc actcctaaagaaatagcaaatctccttggatttcctccagagttcggatttcctgagaag ataacagtgaaacagcgttatcgcctacttggaaatagtctcaacgtgcatgtagtagct aaactaatcaaaatcttatatgaataa >gi568815588r:17049043_17301634|GENSCAN_predicted_peptide_3|1098_aa MNIDAKIHNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNTSKSINVIQHINRTNDKNHM IISINAEKTFDKTQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIIPNGQKLEAFPLKT GTRQGCPLSSLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLEKPIISA QNLLKLTSNFSKVSGYKINVQKSQAFLYTNKRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKKIYQSLLNDIKEDTNKWKNIPCSGIGRINIVKMATLPKCLPTTCIHMIRPQFR VSSVRRDFNSEVAMEELKQTRKPIFLAREVRTLTSFPIFQPYGFLILRGHRQEQFTSTVV LRITWTKGILVIQVNPWLTTLEAGISKAPDKPWNPSQVAPALPQKPAQVLGLWLAFSVNR TWMQLETIILSKLMQEQKSKYCMLSLISGSESVSKRRIALGSALHVEGPTAHSKAMAQLS PRQRRSRAPTTHTHRALVRLFSGSQSAPPPPPRPSPPSAAMSTRSVSSSSYRRMFGGPGT ASRPSSSRSYVTTSTRTYSLGSALRPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLL QDSVDFSLADAINTEFKNTRTNEKVELQELNDRFANYIDKVRFLEQQNKILLAELEQLKG QGKSRLGDLYEEEMRELRRQVDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAE NTLQSFRQDVDNASLARLDLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDV SKPDLTAALRDVRQQYESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYR RQVQSLTCEVDALKGTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHL REYQDLLNVKMALDIEIATYRKLLEGEESRGAIVLLQISGYYNQLRACVQKSPKRSRDST YPSEQLLSRQTKFYDSDLRVNSSPSMDSMDLMYLILWFLEIHFIRSQKYNTAFGSVIHPL WVYLEPFDPAELWPSLLQNEEECPLGPNGTREAQQLGRTQFPPYFDTPEITVSLQFGTCK RLADLEDVSKASTQAHRK >gi568815588r:17049043_17301634|GENSCAN_predicted_CDS_3|3297_bp atgaacatcgatgcaaaaatccacaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac acaagcaaatcaataaatgtaatccagcatataaacagaaccaacgacaaaaaccacatg attatctcaataaatgcagaaaagacctttgacaaaactcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacatatctcaaaataataagagctatctatgac aaacccacagccaatatcataccgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcatcacttctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggcattcagttaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatctagaaaaacccatcatctcggcc caaaatctccttaagctgacaagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataaaagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaagaagatctaccaatcactgctcaatgacataaaagaggat acaaacaaatggaagaacattccatgctcagggataggaagaatcaatattgtgaaaatg gccacattgcccaagtgtctaccaaccacctgcatccacatgataagacctcagtttaga gttagcagtgtgaggagagacttcaattcagaggtagcaatggaagaacttaaacaaacg aggaaacccatctttctggcaagggaggtgcgaacactcaccagtttccccattttccaa ccctatggttttcttattctacgtggccacagacaggaacaattcactagcacagtggtt ctcagaatcacctggacaaaaggtatactagtgattcaggtaaatccttggctgaccaca ctggaagctggcatctccaaggctcccgacaaaccctggaacccaagccaagttgctcct gcactaccccagaagccagctcaggtccttggcttgtggcttgctttttctgtcaacaga acatggatgcagctggagaccatcatccttagcaaactaatgcaggaacagaaaagcaaa tactgcatgttgtcacttataagtgggagtgagtcagtcagcaagcgtcgcattgccctg ggatcggcactgcacgtagaaggcccgaccgcacacagcaaggcgatggcccagctgtcc ccgcgccagagacgcagccgcgctcccaccacccacacccaccgcgccctcgttcgcctc ttctccgggagccagtccgcgccaccgccgccgcccaggccatcgccaccctccgcagcc atgtccaccaggtccgtgtcctcgtcctcctaccgcaggatgttcggcggcccgggcacc gcgagccggccgagctccagccggagctacgtgactacgtccacccgcacctacagcctg ggcagcgcgctgcgccccagcaccagccgcagcctctacgcctcgtccccgggcggcgtg tatgccacgcgctcctctgccgtgcgcctgcggagcagcgtgcccggggtgcggctcctg caggactcggtggacttctcgctggccgacgccatcaacaccgagttcaagaacacccgc accaacgagaaggtggagctgcaggagctgaatgaccgcttcgccaactacatcgacaag gtgcgcttcctggagcagcagaataagatcctgctggccgagctcgagcagctcaagggc caaggcaagtcgcgcctgggggacctctacgaggaggagatgcgggagctgcgccggcag gtggaccagctaaccaacgacaaagcccgcgtcgaggtggagcgcgacaacctggccgag gacatcatgcgcctccgggagaaattgcaggaggagatgcttcagagagaggaagccgaa aacaccctgcaatctttcagacaggatgttgacaatgcgtctctggcacgtcttgacctt gaacgcaaagtggaatctttgcaagaagagattgcctttttgaagaaactccacgaagag gaaatccaggagctgcaggctcagattcaggaacagcatgtccaaatcgatgtggatgtt tccaagcctgacctcacggctgccctgcgtgacgtacgtcagcaatatgaaagtgtggct gccaagaacctgcaggaggcagaagaatggtacaaatccaagtttgctgacctctctgag gctgccaaccggaacaatgacgccctgcgccaggcaaagcaggagtccactgagtaccgg agacaggtgcagtccctcacctgtgaagtggatgcccttaaaggaaccaatgagtccctg gaacgccagatgcgtgaaatggaagagaactttgccgttgaagctgctaactaccaagac actattggccgcctgcaggatgagattcagaatatgaaggaggaaatggctcgtcacctt cgtgaataccaagacctgctcaatgttaagatggcccttgacattgagattgccacctac aggaagctgctggaaggcgaggagagcagaggggccatagtgttgcttcagatctctggg tactataatcagttgagagcctgtgttcaaaagtcccccaaaaggtctagagattccact taccccagtgaacagttgctcagtaggcagactaaattctatgactctgatctgagggtt aacagtagtccatctatggattctatggacttgatgtatcttatactgtggttcctggaa attcattttatcaggagccagaaatacaacacagcttttggctctgtcattcatccactg tgggtctacctggagccatttgatcctgctgagctgtggccttcccttttgcaaaatgaa gaagaatgcccccttgggccaaatgggacaagagaagcccagcagcttggtaggacccag ttcccaccttattttgatactccagaaataactgtttctctgcagtttggcacctgcaag cggttagcagacctggaggatgtcagcaaagcctctacccaggcccacaggaagtag >gi568815588r:17049043_17301634|GENSCAN_predicted_peptide_4|170_aa MWNNFGNGVTGRDWNNLEGSEEERKMWESLELPKDLLNGFDQNADSDVNNEVQAEVVSDE DEELVGNWNKGDPRYALAKRLAAFCPCTRDLWNFELDRDDLGYLTEEILKQQSIQEVTEH KSLENLQPDNTREKKNLFSGEKFKPAAEICISNEGPNVNHQANGGDVSRA >gi568815588r:17049043_17301634|GENSCAN_predicted_CDS_4|513_bp atgtggaataactttggaaatggggtaacaggcagagattggaacaatttggagggctca gaagaagaaaggaagatgtgggaaagtttggaacttcctaaagacttgttaaatggtttt gaccaaaatgctgatagtgatgtgaacaatgaagtccaggctgaggtggtctcagatgaa gatgaggaacttgttgggaactggaataaaggtgaccctcgctatgctttagcaaagaga ctggcagcattttgcccctgcactagagatctgtggaactttgaacttgacagagatgac ttagggtatctgacagaagaaattcttaaacagcaaagcattcaagaggtgacagaacat aaaagtttggaaaatttgcagcctgacaatacaagagaaaagaaaaatctattttctggg gagaaattcaagccagctgcagaaatttgcataagtaacgaggggccaaatgttaatcac caagccaatgggggagatgtctccagggcataa