GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:32:51 Sequence gi568815592r:40291946_40533113 : 241168 bp : 47.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2754 2782 29 2 2 98 76 38 0.169 2.70 1.02 Term + 28165 28285 121 1 1 99 49 57 0.047 0.85 1.03 PlyA + 34366 34371 6 1.05 2.11 PlyA - 39398 39393 6 1.05 2.10 Term - 40926 40811 116 1 2 101 49 99 0.924 6.03 2.09 Intr - 47827 47582 246 2 0 39 49 127 0.070 1.33 2.08 Intr - 48814 48774 41 0 2 53 111 51 0.133 1.77 2.07 Intr - 51062 50956 107 2 2 92 24 63 0.186 -0.69 2.06 Intr - 54726 54630 97 2 1 67 77 63 0.135 3.11 2.05 Intr - 61584 61494 91 1 1 99 91 4 0.154 0.85 2.04 Intr - 78160 78091 70 1 1 39 77 81 0.159 0.85 2.03 Intr - 85135 85032 104 2 2 104 48 26 0.077 0.09 2.02 Intr - 86623 86560 64 1 1 4 111 78 0.337 -0.01 2.01 Init - 86788 86645 144 1 0 64 16 132 0.393 3.72 2.00 Prom - 95491 95452 40 -5.26 3.12 PlyA - 96900 96895 6 1.05 3.11 Term - 100967 99998 970 1 1 112 43 1336 0.491 123.03 3.10 Intr - 111049 110916 134 1 2 115 4 36 0.014 -2.66 3.09 Intr - 116472 116417 56 1 2 120 24 78 0.231 3.30 3.08 Intr - 118811 118772 40 2 1 75 90 12 0.031 -2.10 3.07 Intr - 125403 125269 135 0 0 69 41 96 0.263 3.76 3.06 Intr - 139662 139621 42 0 0 108 110 3 0.255 2.94 3.05 Intr - 141186 139783 1404 0 0 140 69 2081 0.683 200.88 3.04 Intr - 149561 149460 102 2 0 103 56 21 0.214 0.77 3.03 Intr - 167423 167313 111 2 0 30 98 114 0.748 7.18 3.02 Intr - 173961 173874 88 2 1 117 58 7 0.110 0.57 3.01 Init - 176895 176837 59 0 2 93 74 20 0.142 1.88 3.00 Prom - 177169 177130 40 -10.84 4.00 Prom + 177360 177399 40 -5.56 4.01 Init + 178263 178317 55 2 1 94 90 -1 0.448 2.15 4.02 Intr + 180297 180355 59 1 2 110 75 19 0.615 1.40 4.03 Intr + 185833 185880 48 0 0 89 89 66 0.790 5.68 4.04 Intr + 191709 191813 105 2 0 97 16 63 0.460 0.41 4.05 Intr + 193555 193690 136 0 1 65 7 122 0.670 1.94 4.06 Intr + 195521 195617 97 0 1 101 80 72 0.910 6.77 4.07 Term + 202698 202809 112 0 1 59 38 94 0.112 -0.47 4.08 PlyA + 203402 203407 6 1.05 5.07 PlyA - 206505 206500 6 1.05 5.06 Term - 207684 207622 63 0 0 104 35 57 0.043 -0.11 5.05 Intr - 211608 211477 132 0 0 57 55 123 0.130 6.74 5.04 Intr - 225411 225359 53 1 2 103 97 6 0.239 1.63 5.03 Intr - 226808 226690 119 1 2 73 73 21 0.072 -0.79 5.02 Intr - 232016 231887 130 0 1 103 77 11 0.277 1.25 5.01 Init - 234621 234525 97 0 1 93 110 41 0.328 7.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 217489 217395 95 1 2 70 56 127 0.814 7.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:40291946_40533113|GENSCAN_predicted_peptide_1|49_aa MRVVSLKVHTFCFSILGGKRHSGDESGSCSEEKMAVIVECCIPGESTVL >gi568815592r:40291946_40533113|GENSCAN_predicted_CDS_1|150_bp atgagggtggtgtccctgaaggtgcacactttctgttttagcattttgggaggaaaaagg cattctggtgatgaaagtgggtcctgctcggaggagaagatggctgtcattgtagagtgc tgcatccctggggaatccacagtcctctga >gi568815592r:40291946_40533113|GENSCAN_predicted_peptide_2|359_aa MEFFFRERENTEVKKSRMGEYQTAAAAKPGKRLKRVLGGSRRTFPPQPARTGSSGPGKAS VETSKTTLPGSAFSPFAFYHDWKCPEASPEAETAMYAFCTACGTDGYYQKNQSVEEDVEK LESLYIVGNQHKRPGPGGGDAGAPTQRLYFLFAVFQARLHRKHDPVSAPGEASGSFYSWW KMKQEQVRHMFKLVIVMLLPYHPLTFKSQLFLNKERIGLSLRHGEGKPTDDKHYVGPGAV LHSSLAKGGLKVGKCQGYISPAYQRGSAVPGFSHLILNACLHPAWIQKPPAPHKGDNMPS MAAASDIDTSTSTTINAVLQAGATFLNLVSETGSGSELGKQAASIMEIKKAVNGCVFIS >gi568815592r:40291946_40533113|GENSCAN_predicted_CDS_2|1080_bp atggaattcttcttccgtgagagagaaaatacggaggtgaagaagagcagaatgggcgag taccagactgcagctgcagccaagccagggaagaggctaaagagggtcctcggcggctcc cgcaggacattccctccccagccggcgaggacgggcagcagcggccctgggaaggcttcc gtggaaacttccaaaaccaccttgccaggaagtgctttctccccctttgccttctaccat gactggaagtgtcctgaggcctccccagaagcagaaactgctatgtatgctttctgtaca gcctgtggaactgatggctattatcagaaaaaccaaagtgttgaagaggatgtggaaaaa ctggaatccttgtacattgttggcaaccaacacaaaaggcctggaccaggtggtggagat gctggggctcctacacagaggctctattttctgtttgcagtcttccaggccaggctacac aggaagcacgacccagtatctgctcctggtgaggcctcagggagcttttactcttggtgg aagatgaagcaggaacaggtgcgacacatgtttaagctggtaattgtcatgctgctcccg tatcatccactgacatttaagtcccagctctttctaaataaggagcggattggcctcagc ttgcgccatggggagggcaagcccacagatgacaagcattacgtgggaccaggagcagtt ctgcacagttccctggcaaaaggtgggctgaaggtgggaaaatgccaagggtatatctca cctgcttaccaacggggctctgctgtgcctggattcagccacctcatcctcaatgcctgc cttcaccctgcctggatacagaagcccccagctccccacaagggagacaacatgccctcc atggcagctgccagtgatatagacacctccacatccaccaccattaatgcagtcctacag gcaggtgctacattcttaaatcttgtatcagaaactggttcaggcagtgagcttggcaaa caagcagcttccatcatggagatcaagaaagcagtaaatggctgtgtcttcatcagttag >gi568815592r:40291946_40533113|GENSCAN_predicted_peptide_3|1046_aa MIPILQMKPERRRDMHEEMGLKPPFPSRNLPEGSSGKLEVTAVLLWVDSRDQGNTSSLTI EVPLAYYSYDFSIIDFVHFVENVVAKADSLTSRPPLVKSSPPLPTQPSIKRHTRSLTSIH ALSDQTMETLLGGLLAFGMAFAVVDACPKYCVCQNLSESLGTLCPSKGLLFVPPDIDRRT VELRLGGNFIIHISRQDFANMTGLVDLTLSRNTISHIQPFSFLDLESLRSLHLDSNRLPS LGEDTLRGLVNLQHLIVNNNQLGGIADEAFEDFLLTLEDLDLSYNNLHGLPWDSVRRMVN LHQLSLDHNLLDHIAEGTFADLQKLARLDLTSNRLQKLPPDPIFARSQASALTATPFAPP LSFSFGGNPLHCNCELLWLRRLERDDDLETCGSPGGLKGRYFWHVREEEFVCEPPLITQH THKLLVLEGQAATLKCKAIGDPSPLIHWVAPDDRLVGNSSRTAVYDNGTLDIFITTSQDS GAFTCIAANAAGEATAMVEVSIVQLPHLSNSTSRTAPPKSRLSDITGSSKTSRGGGGSGG GEPPKSPPERAVLVSEVTTTSALVKWSVSKSAPRVKMYQLQYNCSDDEVSVGILGQTVPF VQVCVDRSGGCRGTEHGDRILEAEMVAAAAAGTEHQIFLPPVKMPPRAAHLLPPGGVLLR DLTGPPKPIDNRTKKPLPRPHRPPHCFYLKAFALAPPSCLETLPMDIHVVPSLPSEFPGA CVPKMIPASNKAFVVNNLVSGTGYDLCVLAMWDDTATTLTATNIVGCAQFFTKADYPQCQ SMHSQILGGTMILVIGGIIVATLLVFIVILMVRYKVCNHEAPSKMAAAVSNVYSQTNGAQ PPPPSSAPAGAPPQGPPKVVVRNELLDFTASLARASDSSSSSSLGSGEAAGLGRAPWRIP PSAPRPKPSLDRLMGAFASLDLKSQRKEELLDSRTPAGRGAGTSARGHHSDREPLLGPPA ARARSLLPLPLEGKAKRSHSFDMGDFAAAAAGGVVPGGYSPPRKVSNIWTKRSLSVNGML LPFEESDLVGARGTFGSSEWVMESTV >gi568815592r:40291946_40533113|GENSCAN_predicted_CDS_3|3141_bp atgattcccattttacaaatgaagccagaaaggagaagagacatgcatgaggagatgggg ttaaagcctcccttcccctccaggaacttacctgagggcagcagtggcaagttagaggtc actgctgttcttctctgggtggacagtagagaccaggggaacacatcgtccctcacaatt gaagttcctctggcctattactcttatgacttctccatcatagactttgttcactttgtt gaaaatgtggtggccaaggcagactcactcacatccaggcccccactggtgaaatccagc cctcctttacccacacagccctccatcaagagacacacccgttcactcacctccattcat gcgctgagtgaccagaccatggagaccctgcttggtggcctgctagcgtttggcatggcg tttgccgtggtcgacgcctgccccaagtactgtgtctgccagaatctgtctgagtcactg gggaccctgtgcccctccaaggggctgctctttgtaccccctgatattgaccggcggaca gtggagctgcgcctgggcggcaacttcatcatccacatcagccgccaggactttgccaac atgacggggctggtggacctgaccctgtccaggaacaccatcagccacatccagcccttt tcctttctggacctcgagagcctccgctccctgcatcttgacagcaatcggctgccaagc cttggggaggacaccctccggggcctggtcaacctgcagcaccttatcgtgaacaacaac cagctgggcggcatcgcagatgaggcttttgaggacttcctgctgacattggaggatctg gacctctcctacaacaacctccatggcctgccgtgggactccgtgcgacgcatggtcaac ctccaccagctgagcctggaccacaacctgctggatcacatcgccgagggcacctttgca gacctgcagaaactggcccgcctggatctcacctccaatcggctgcagaagctgccccct gatcccatctttgcccgctcccaggcttcggctttgacagccacaccctttgccccaccc ttgtcctttagttttgggggtaacccacttcactgcaattgtgagcttctctggctgcgg aggctcgagcgggacgatgacctggaaacctgtggctccccagggggcctcaagggtcgc tacttctggcatgtgcgtgaggaggagtttgtgtgcgagccgcctctcatcacccagcac acacacaagttgctggttctggagggccaggcggccacactcaagtgcaaagccattggg gaccccagcccccttatccactgggtagcccccgatgaccgcctggtagggaactcctca aggaccgctgtctatgacaatggcaccctggacatcttcatcaccacatctcaggacagt ggtgccttcacctgcattgctgccaatgctgccggagaggccacggccatggtggaggtc tccatcgtccagctgccacacctcagcaacagcaccagccgcactgcaccccccaagtcc cgcctctcagacatcactggctccagcaagaccagccggggaggtggaggcagtgggggc ggagagcctcccaaaagccccccggaacgggctgtgcttgtgtctgaagtgaccaccacc tcggccctggtcaagtggtctgtcagcaagtcagcaccccgggtgaagatgtaccagctg cagtacaactgctctgacgatgaggtgtctgtggggattctggggcaaactgtgccattc gtgcaggtgtgcgtggacaggtcgggaggctgccgaggcactgagcatggagacaggatc ttggaagctgagatggtggcagcagcggcagccggcactgaacaccagatatttttacca cccgtcaaaatgccaccgcgtgcggctcatcttctgccaccaggaggggtcctgctgcga gatctgacaggaccacccaagcccatagacaatagaaccaagaagcccctgccgaggcct caccgtcctcctcactgtttctacctcaaggcctttgcactggcacctccttcctgcctg gaaactctgcccatggatatccacgtggttccctcacttccctcagaattcccaggagca tgcgtcccgaagatgatcccagcctccaacaaggccttcgtggtcaacaacctggtgtca gggactggctacgacttgtgtgtgctggccatgtgggatgacacagccacgacactcacg gccaccaacatcgtgggctgcgcccagttcttcaccaaggctgactacccgcagtgccag tccatgcacagccagattctgggcggcaccatgatcctggtcatcgggggcatcatcgtg gccacgctgctggtcttcatcgtcatcctcatggtgcgctacaaggtctgcaaccacgag gcccccagcaagatggcagcggccgtgagcaatgtgtactcgcagaccaacggcgcccag ccaccgcctccaagcagcgcaccagccggggccccgccgcagggcccgccgaaggtggtg gtgcgcaacgagctcctggacttcaccgccagcctggcccgcgccagtgactcctcttcc tccagctccctgggcagtggggaggctgcggggctgggacgggccccctggaggatccca ccctccgccccgcgccccaagcccagccttgaccgcctgatgggggccttcgcctccctg gacctcaagagtcagagaaaggaggagctgctggactccaggactccagccgggagaggg gctgggacgtcggcccggggccaccactcggaccgagagccactgctggggccccctgcg gcccgggccaggagcctgctccccttgccgttggagggcaaggccaaacgcagccactcc ttcgacatgggggactttgctgctgcggcggcgggaggggtcgtgccgggcggctacagt cctcctcggaaggtctcgaacatctggacgaagcgcagcctctctgtcaacggcatgctc ttgccctttgaggagagtgacctggtgggggcccgggggacttttggcagctccgaatgg gtgatggagagcacggtctag >gi568815592r:40291946_40533113|GENSCAN_predicted_peptide_4|203_aa MGWVHLRKHNAHSFHTCSELDRCFQVKTIGRFHQNPQQSVLLKELQAKLFSAMVGNKGSE VAGNEEQQFHDVGMRRGKEDYLAASRRKQLGPTPLDRDKSADNAASDKINSSNIDSKALK CPLIEKNSLAQCPRAFLISLNGNSMEPILASLQNAHIPKKSADGKGSTKDLILLFNIPSQ GHNLDLIIIHNTSPLKALFQAAP >gi568815592r:40291946_40533113|GENSCAN_predicted_CDS_4|612_bp atgggctgggtgcatctgagaaaacacaatgcccactccttccacacctgctcagagctg gataggtgtttccaggtcaagacaattggtcgattccatcaaaacccacagcagagtgtt ttgctgaaagagctgcaggcaaagctctttagcgccatggtgggaaacaaaggctcagaa gtggcaggaaatgaggagcagcaattccatgatgtgggcatgagaagggggaaggaagat tacctggcagcttccagacggaagcagctgggccccactcccctggacagagataaatca gcagacaatgcagctagtgacaagataaactcctccaacatcgacagcaaggctctgaaa tgccctctcatagagaagaattctctggctcagtgcccccgagccttcctcatctcacta aatggcaactccatggagcctatccttgcatctctgcaaaatgcacacatcccgaaaaag tcggcagatgggaaagggagcaccaaggacctcatccttctcttcaacatcccctcacaa ggccacaatctagacctcatcatcatccataacacctcacctctgaaagctttatttcaa gcagccccatga >gi568815592r:40291946_40533113|GENSCAN_predicted_peptide_5|197_aa MVEPPELQGRGENVMQVWRTGVLKTGTHSQDPDTVGWQLEEYWPWEDPALSLRLCLIISL SLGQSTHLFVLPQRQSAWNTVGTQYVYWLNDNEWMTKCILNDLYWLKEGLCGSHQSRPGP PPRNHTFIPHTHKDLFLDVLKVPKRYLSTIELVGFSSNGGLLAWLCVPEDRPPSAAPLGK REKPRTHCNLKRSPAKP >gi568815592r:40291946_40533113|GENSCAN_predicted_CDS_5|594_bp atggtggaacccccagagctccagggcaggggagagaacgtcatgcaggtgtggagaact ggtgtgctaaagacaggtactcattcacaagatccagacacagttggatggcaattggaa gagtactggccctgggaagacccagctttgagcctcaggttgtgtctaatcatctctctg tccttgggccagtcaactcacctgttcgtgctgccacagagacagagtgcttggaataca gtaggcactcaatatgtgtattggttgaatgacaatgaatggatgactaagtgtatcctg aatgatctgtactggctcaaggaggggttgtgtgggagccatcaatcacgacctggccct ccacctaggaatcataccttcatcccccatactcataaggatctcttcttggatgtctta aaagtccccaaacgctacctgtctacaatagagcttgtgggcttctccagcaatggtggt cttcttgcatggctctgtgtcccagaggaccgccctccatcggctgcaccactggggaaa agagagaaaccacgaacacattgcaacctgaagaggagccctgcgaagccctaa