GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:08:24 Sequence gi568815597r:47151139_47410319 : 259181 bp : 44.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1891 1979 89 0 2 68 98 45 0.284 3.61 1.02 Intr + 7612 7736 125 1 2 85 54 36 0.092 0.13 1.03 Intr + 18101 18348 248 0 2 46 41 179 0.136 6.18 1.04 Term + 19045 19875 831 0 0 20 41 229 0.887 4.42 1.05 PlyA + 20915 20920 6 1.05 2.06 PlyA - 21396 21391 6 1.05 2.05 Term - 21898 21752 147 1 0 75 47 120 0.303 4.50 2.04 Intr - 32905 32854 52 0 1 115 68 100 0.243 9.61 2.03 Intr - 33959 33864 96 1 0 65 94 65 0.899 3.92 2.02 Intr - 36289 36181 109 2 1 73 78 165 0.862 13.34 2.01 Init - 38794 38728 67 1 1 95 65 116 0.979 9.27 2.00 Prom - 44930 44891 40 -5.96 3.03 PlyA - 46177 46172 6 1.05 3.02 Term - 48244 48145 100 0 1 78 41 83 0.738 0.20 3.01 Init - 49032 48851 182 0 2 87 93 101 0.809 9.16 3.00 Prom - 50984 50945 40 -6.06 4.00 Prom + 51917 51956 40 -5.06 4.01 Init + 58690 58721 32 0 2 83 59 15 0.228 -2.67 4.02 Intr + 59554 59731 178 1 1 113 89 26 0.559 5.12 4.03 Intr + 63673 63801 129 0 0 93 82 52 0.877 5.99 4.04 Term + 65417 65506 90 1 0 8 42 149 0.689 0.02 4.05 PlyA + 65544 65549 6 1.05 5.06 PlyA - 67267 67262 6 -3.24 5.05 Term - 69036 68582 455 1 2 109 45 645 0.989 57.62 5.04 Intr - 72960 72866 95 2 2 103 101 110 0.979 13.41 5.03 Intr - 74751 74305 447 2 0 115 71 408 0.028 34.26 5.02 Intr - 80042 79909 134 2 2 63 65 66 0.003 1.24 5.01 Init - 81277 81173 105 1 0 87 92 94 0.003 8.51 5.00 Prom - 94790 94751 40 -5.36 6.14 PlyA - 97458 97453 6 1.05 6.13 Term - 100784 99998 787 1 1 97 37 344 0.605 23.00 6.12 Intr - 109401 109151 251 0 2 66 93 58 0.554 0.34 6.11 Intr - 111975 111765 211 2 1 71 91 94 0.597 6.92 6.10 Intr - 118728 118497 232 1 1 73 80 209 0.862 15.43 6.09 Intr - 121055 120938 118 2 1 60 91 96 0.553 7.14 6.08 Intr - 130071 129808 264 0 0 88 69 94 0.962 5.21 6.07 Intr - 131321 131219 103 1 1 98 81 44 0.795 4.88 6.06 Intr - 136522 136413 110 1 2 70 83 75 0.969 4.28 6.05 Intr - 138447 138297 151 2 1 71 106 61 0.960 6.36 6.04 Intr - 149014 148767 248 1 2 47 106 17 0.098 -4.34 6.03 Intr - 150610 150423 188 2 2 96 86 89 0.807 8.91 6.02 Intr - 151208 151096 113 1 2 82 85 71 0.781 6.22 6.01 Init - 159181 159138 44 1 2 97 80 10 0.307 1.02 6.00 Prom - 159887 159848 40 -8.56 7.00 Prom + 161471 161510 40 -2.36 7.01 Init + 182808 182978 171 2 0 6 78 342 0.966 22.44 7.02 Intr + 217331 217477 147 1 0 94 86 129 0.991 13.73 7.03 Intr + 221817 221969 153 2 0 108 80 90 0.993 10.47 7.04 Intr + 223771 223847 77 0 2 100 65 12 0.987 -1.59 7.05 Intr + 224059 224155 97 1 1 68 100 98 0.986 9.01 7.06 Term + 225566 225607 42 1 0 130 31 75 0.993 3.16 7.07 PlyA + 226487 226492 6 1.05 8.02 PlyA - 226548 226543 6 1.05 8.01 Term - 230005 229896 110 2 2 118 48 68 0.528 4.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 80232 81079 848 1 2 94 53 330 0.884 23.14 S.002 Term + 183051 183161 111 2 0 57 37 91 0.925 -0.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_1|430_aa MVVKRCRICCQDGQIGAAPVCSSQKDQRRRIRKSYFKFHVEQKRPSIAKTILSKKNKAGD ITLSYCKLYYKEIQTTIREYYKHPYENKLENLEEMGKFLDTYTHPRLNHEEAESLNRPIT GSEIEAIINSLPTKKSPRPDGFTAKFYQRYKEELLIGNFSKVSGYKINVRKSQAFLYTNN RQTESQIMSELPFTIASKRIKYLGIQITRDVKDLFKGNYKPLLNEIKEDTNKWKNIPCSW VGRINIVKMAILPKVIYRFNTIPIKLPMTFFTELEKTTLKFIWNQKTAHIAKSILSQKNK AGGIMLPDFKLYYKAVITKTAWYWYQNRDIDQWNRTEPSEIMPHIYNHLIFDKPDKNKKW GKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLKENLGNTIQ DIGMGKDFNV >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_1|1293_bp atggttgtaaaacgctgtaggatctgctgccaagatggccaaataggagcagctccagtc tgcagctcccagaaagaccaacgcagaagaattagaaaaagctactttaaatttcatgtg gaacaaaaaaggcccagtatagccaagacaatcctaagcaaaaagaacaaagctggagac atcaccctatcttactgcaagctatactacaaggaaatacaaactaccatcagagaatac tataaacacccctatgaaaataaactagaaaatctagaagaaatgggtaaattcctggac acatacacccacccaagactaaaccatgaagaagctgaatctctgaatagaccaataaca ggctctgaaattgaggcaataattaatagcttaccaaccaaaaaaagtccaagaccagat ggattcacagccaaattctaccagaggtacaaggaggagctgctaataggcaacttcagc aaagtctcaggatacaaaatcaatgtgcgaaaatcacaagcattcttgtacaccaataac agacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaata aaatacctaggaatccaaattacaagggatgtgaaggacctcttcaaggggaactacaaa ccactgctcaatgaaataaaagaggatacaaacaaatggaagaatattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaat accatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatatggaaccaaaaaacagcccacattgccaagtcaatcctaagccaaaagaataaa gctggaggcatcatgctacctgacttcaaactatactacaaggctgtaataaccaaaaca gcatggtactggtaccaaaacagagatatagaccaatggaacagaacagagccctcagaa ataatgccacatatctacaaccatctgatctttgacaaacctgacaaaaacaagaaatgg ggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaag ctgaaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaagac ttaaacgttagacctaaaaccataaaaaccctaaaagaaaacctaggcaataccattcag gacataggcatgggcaaggacttcaatgtctaa >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_2|156_aa MSALSLLILGLLTAVPPASCQQGLGNLQPWMQGLIAVAVFLVLVAIAFAVNHFWCQEEPE PAHMILTVGNKADGVLVGTDGRYSSMAASFRSSEHENAYENVPEEEGKTSSLQNCEYYAE ESPHQHLTNLTMLHPHLRRSASRTVNTMLKRALTST >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_2|471_bp atgtcggccctcagcctcctcattctgggcctgctcacggcagtgccacctgccagctgt cagcaaggcctggggaaccttcagccctggatgcagggccttatcgcggtggccgtgttc ctggtcctcgttgcaatcgcctttgcagtcaaccacttctggtgccaggaggagccggag cctgcacacatgatcctgaccgtcggaaacaaggcagatggagtcctggtgggaacagat ggaaggtactcttcgatggcggccagtttcaggtccagtgagcatgagaatgcctatgag aatgtgcccgaggaggaaggcaagacatccagcctccagaactgtgagtactatgctgaa gagagccctcatcagcacctgaccaacctgaccatgctgcaccctcatcttagacgttca gcctccagaactgtgaatactatgctgaagagagccctcaccagcacctga >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_3|93_aa MELRPEMSSGCCKQPVHTGTHRHRRIALCSLHPLTCELPLARLWSYGTLEQCLAQEDTYE SLLTAHFPVRTILPAAVGESLPKCKTFPHRIFA >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_3|282_bp atggagctgcgtcctgagatgtcctccgggtgctgcaaacagcctgtccacactgggact catcgtcatcggcgcattgctctttgctcactgcacccactaacctgtgagctccctttg gcacggctgtggagctatggtactttagaacagtgcttggcacaggaagacacctacgaa agcctgctcactgcccacttccctgttcggaccattcttccagctgcagttggagagagc ttgccaaaatgcaagactttcccacatcgcatctttgcctaa >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_4|142_aa MAVFKGSPDVGCLTSCPGCSFQSTAHSFPRLYLCPAVPATLNAIPLFAYLTNSYSSFKNQ LLLEALLDPQPLTTGPFLILLLSPVSTDQANTTTAEIHSQLTPRLNLTILSSQGASLQQR VTYHRNHKYGQTHPQKAEIVVG >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_4|429_bp atggcagtattcaagggctcacctgatgtagggtgccttaccagctgcccaggctgcagc ttccaaagcacagcccactctttcccaaggctgtacctctgtcctgctgttcctgctacc ctgaatgccattcctctctttgcatacctgacaaattcctattcatccttcaagaaccaa cttcttctggaagccttgcttgacccccaacccttaaccacaggtcccttcctcatactg ctcctctcacctgtgagcacagaccaggccaacaccaccactgctgagatccacagccag ctcaccccacggctaaatctcaccatcctatcatcccagggggcaagtcttcagcagagg gtcacgtaccatcgaaatcacaagtacggccagacccacccccaaaaggccgaaatagtt gtcggctga >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_5|411_aa MAEALIGGGRTRGSALSLRRTRPRRASAHANEAWNVFEGFVGEWEAPPGLKLSSARRGRR LRAGSCGLRFPLHSGFIHFWMTERPPSEAARSDPQLEGRDAAEASMAPPHLVLLNGVAKE TSRAAAAEPPVIELGARGGPGGGPAGGGGAARDLKGRDAATAEARHRVPTTELCRPPGPA PAPAPASVTAELPGDGRMVQLSPPALAAPAAPGRALLYSLSQPLASLGSGFFGEPDAFPM FTTNNRVKRRPSPYEMEITDGPHTKVVRRIFTNSRERWRQQNVNGAFAELRKLIPTHPPD KKLSKNEILRLAMKYINFLAKLLNDQEEEGTQRAKTGKDPVVGAGGGGGGGGGGAPPDDL LQDVLSPNSSCGSSLDGAASPDSYTEEPAPKHTARSLHPAMLPAADGAGPR >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_5|1236_bp atggccgaggcgcttatcgggggcgggcggacccgcggcagtgccttatctctgcggcgc acacggccccggcgcgcctcggcccatgctaacgaggcctggaacgtgtttgaggggttc gttggtgagtgggaggctcctccaggcttgaagctgagcagcgctcgccgaggccggcgc cttcgagccggtagctgcgggcttcggttcccgctgcactccggcttcatccatttctgg atgaccgagcggccgccgagcgaggcggctcgcagtgacccccagctagagggacgggac gcggccgaggccagcatggcccccccgcacctggtcctgctgaacggcgtcgccaaggag acgagccgcgcggccgcagcggagcccccagtcatcgaactgggcgcgcgcggaggcccg gggggcggccctgccggtgggggcggcgccgcgagagacttaaagggccgcgacgcggcg acggccgaagcgcgccatcgggtgcccaccaccgagctgtgcagacctcccgggcccgcc ccggcccccgcgcccgcctcggttacagcggagctgcccggcgacggccgcatggtgcag ctgagtcctcccgcgctggctgcccccgccgcccccggccgcgcgctgctctacagcctc agccagccgctggcctctctcggcagcgggttctttggggagccggatgccttccctatg ttcaccaccaacaatcgagtgaagaggagaccttccccctatgagatggagattactgat ggtccccacaccaaagttgtgcggcgtatcttcaccaacagccgggagcgatggcggcag cagaatgtgaacggggcctttgccgagctccgcaagctgatccccacacatcccccggac aagaagctcagcaagaatgagatcctccgcctggccatgaagtatatcaacttcttggcc aagctgctcaatgaccaggaggaggagggcacccagcgggccaagactggcaaggaccct gtggtgggggctggtgggggtggaggtgggggagggggcggcgcgcccccagatgacctc ctgcaagacgtgctttcccccaactccagctgcggcagctccctggatggggcagccagc ccggacagctacacggaggagcccgcgcccaagcacacggcccgcagcctccatcctgcc atgctgcctgccgccgatggagccggccctcggtga >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_6|939_aa MEPIYPFARPQMNTRNPKLVVTEKTIRLAYRHAKQNKKNSSCFLLGSLTADEDEEGVTLT VDRFDPGREVPECLEITPTASLPGDFLIPCKVHTQELCSREMIVHSVDDFSSALKALQCH ICSKDSLDCGKLLSLRVHITSRESLDSVEFDLHWAAVTLANNFKCTPVKPIPIIPTALAR NLSSNLNISQVQGTYKYGVFSESGNFIIVLYSMTHKEPEFYECFPCDGKIPDFRFQLLTS KETLHLFKNVEPPDKNPIRCELSAESQNAETEFFSKASKNFSIKRSSQKLSSGKMPIHDH DSGVEDEDFSPRPIPSPHPISKIQPSVPELSLVLDGNFIESNPLPTPLEMVNNENPPLIN HLEHLKPLQPQLYDEKHSPEVEAGEPSLRGIPNQLNQDKPALLRHCKTTAVEDTVQAGRQ MELVSVEAQSSPGLHMRKGVSIAVSTGASLFWNAAGEDQEPDSQMKQDDTKISSEDMNFS VDINNEVTSLPGSASSLKAVDIPSFEESNIAVEEEFNQPLSVSNSSLVVRKEPDVPVFFP SGQLAESVSMCLQTGPTGGASNNSETSEEPKIEHVMQPLLHQPSDNQKIYQDLLGQVNHL LNSSSKETEQPSTKAVIISHECTRTQNVYHTKKKTHHSRLVDKDCVLNATLKQLRSLGVK IDSPTKVKKNAHNVDHASVLACISPEAVISGLNCMSFANVGMSGLSPNGVDLSMEANAIA LKYLNENQLSQLSVTRSNQNNCDPFSLLHINTDRSTVGLSLISPNNMSFATKKYMKRYGL LQSSDNSEDEEEPPDNADSKSEYLLNQNLRSIPEQLGGQKEPSKNDHEIINCSNCESVGT NADTPVLRNITNEVLQTKAKQQLTEKPAFLVKNLKPSPAVNLRTGKAEFTQHPEKENEGD ITIFPESLQPSETLKQMNSMNSVGTFLDVKRLRQLPKLF >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_6|2820_bp atggagcctatatatccttttgcacggccccagatgaataccagaaatccaaagcttgtg gtgactgagaagaccatccgacttgcttatcgtcatgctaagcagaataaaaaaaattcg tcatgctttttacttggttctctgacagcagacgaagatgaagaaggtgtaacattgaca gtagatcgctttgatcctggtcgagaagtacctgaatgcctagaaataacccctactgct tctcttcctggggactttttgattccatgcaaagttcatactcaagaactttgttcaaga gaaatgatagttcacagtgtagatgacttcagttcagctttaaaggctctacagtgccat atatgtagcaaagattccttggactgtggtaagctgctttccctaagagttcatatcact tccagggagagtttggacagtgtggaatttgacttgcattgggcagcagtaactctagca aataactttaaatgcacacctgtgaagcccatccccattattccaacagctctggcaaga aacttgagcagtaatctgaatatttctcaagttcaagggacttataaatatggggttttt tcagaatctggaaatttcatcatagttctctattctatgacacataaggaacctgagttt tatgaatgcttcccttgtgatggcaagatacctgactttcggtttcagttgctaaccagt aaggaaacattacatcttttcaaaaatgttgaacctcctgacaaaaatccaatccgttgt gaactgagcgctgaaagccaaaatgcagaaacagagtttttcagtaaggcttccaagaat ttttcaattaagaggtcttcccaaaagttatcttctgggaagatgccaatacatgatcac gactctggtgttgaagatgaagatttttctccaagaccaattcctagtcctcatccaatt tctaagatccaaccatcagttcctgaactttcacttgtgttggatggcaatttcatagaa tcaaaccctctgcctactccattggaaatggtgaataatgaaaatcctcctttgattaac cacttggaacacttgaagccattgcaaccccagctttatgatgagaaacacagtccagaa gttgaagctggagagccttccttgagaggaataccaaatcagttaaaccaggataaacca gctcttttgagacactgcaaaacaactgctgttgaagacacagtgcaagctggaagacaa atggagttggtttctgtggaagcacagtcttcccctggcttgcacatgagaaaaggtgta agcattgctgtgagcacaggtgctagcttgttttggaatgcagcaggtgaggatcaagag cctgactctcaaatgaagcaagatgataccaaaatttccagtgaggacatgaatttttct gtcgatattaataatgaagtcacaagtcttccaggtagtgcatcttcattaaaagcagtt gatattcccagttttgaagagagcaacattgctgtggaagaagaatttaaccagccactt tctgtatccaactcttctctagttgtgagaaaagaacctgatgtacctgtgttctttcca agtggccagctggcagaaagtgtaagcatgtgtttacagactggaccaacagggggtgcc agtaacaattctgaaacatcagaggaaccaaaaattgagcatgtaatgcaacccttgctt catcaaccatcagataaccagaaaatttaccaggatttattgggtcaagtaaaccaccta ttaaatagttcctccaaggaaactgagcagccgtctaccaaagcagtaattatcagtcat gaatgcaccagaacccaaaacgtttaccatacaaagaaaaaaacacatcattcaagactg gtggacaaagattgtgtccttaatgcaactcttaagcaactaagaagccttggagtaaaa attgattctcccactaaagtgaagaaaaatgcacataacgtggatcacgccagtgtgttg gcatgcatcagcccagaagcagtgatctctggattaaactgcatgtcatttgctaatgtt ggcatgagcggcttaagccccaatggtgtggatttgagcatggaggcaaatgctatagct ctgaaatatttaaatgaaaatcagctgtcacaactgtctgtcactcgatcgaaccaaaat aattgtgacccattcagccttctccatattaatacagacagaagcacagtggggcttagt ttaatttcaccaaacaacatgtcatttgcaaccaaaaaatatatgaagagatatggactc ctacaaagcagtgacaatagtgaagatgaagaggaacctcccgacaatgcagatagcaag agtgaatatttattgaatcagaaccttaggtccatacctgaacagcttggtggtcagaaa gagccttctaagaatgaccatgaaataattaattgttctaactgtgaatctgtggggacc aacgcagatacgccagtattgagaaatattacaaatgaagttttgcagacaaaagcaaaa cagcagttgactgaaaagccagctttcttagtaaagaaccttaaaccaagtcctgcagtg aaccttcgaaccgggaaagcagagttcactcaacatcctgagaaagaaaatgaaggggac attacaatttttcctgaaagtttgcaaccttctgaaacgctaaagcagatgaatagcatg aattcagtaggcaccttcttagatgtaaaacgtctcagacagttaccaaaattattttaa >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_7|228_aa MLSRCRSGLLHVLGLSFLLQTRRPILLCSPRLMKPLVVFVLGGPGAGKGTQCARIVEKYG YTHLSAGELLRDERKNPDSQYGELIEKYIKEGKIVPVEITISLLKREMDQTMAANAQKNK FLIDGFPRNQDNLQGWNKTMDGKADVSFVLFFDCNNEICIERCLERGKSSGRSDDNRESL EKRIQTYLQSTKPIIDLYEEMGKVKKIDASKSVDEVFDEVVQIFDKEG >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_7|687_bp atgctgagccgctgccgcagcgggctgctccacgtcctgggccttagcttcctgctgcag acccgccggccgattctcctctgctctccacgtctcatgaagccgctggtcgtgttcgtc ctcggcggccccggcgccggcaaggggacccagtgcgcccgcatcgtcgagaaatatggc tacacacacctttctgcaggagagctgcttcgtgatgaaaggaagaacccagattcacag tatggtgaacttattgaaaagtacattaaagaaggaaagattgtaccagttgagataacc atcagtttattaaagagggaaatggatcagacaatggctgccaatgctcagaagaataaa ttcttgattgatgggtttccaagaaatcaagacaaccttcaaggatggaacaagaccatg gatgggaaggcagatgtatctttcgttctcttttttgactgtaataatgagatttgtatt gaacgatgtcttgagaggggaaagagtagtggtaggagtgatgacaacagagagagcttg gaaaagagaattcagacctaccttcagtcaacaaagccaattattgacttatatgaagaa atggggaaagtcaagaaaatagatgcttctaaatctgttgatgaagtttttgatgaagtt gtgcagatttttgacaaggaaggctaa >gi568815597r:47151139_47410319|GENSCAN_predicted_peptide_8|36_aa XGQSHEGKASLKKRNLREAGSALMNSHKGIPIKVWA >gi568815597r:47151139_47410319|GENSCAN_predicted_CDS_8|111_bp ngaggacaaagccatgaaggaaaagcctcactgaagaagaggaacctaagagaagcaggt tctgctctgatgaactcccataaaggcattcccataaaggtgtgggcctga