GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:30:25 Sequence gi568815591r:117178158_117422989 : 244832 bp : 39.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3317 3356 40 -2.25 1.01 Init + 6456 6542 87 2 0 65 62 66 0.427 2.39 1.02 Intr + 12174 12271 98 2 2 49 35 146 0.439 3.29 1.03 Intr + 12677 12779 103 2 1 87 87 42 0.591 3.26 1.04 Intr + 31630 31780 151 0 1 76 92 32 0.041 1.21 1.05 Intr + 32037 32221 185 1 2 9 70 175 0.042 6.39 1.06 Intr + 38093 38114 22 1 1 74 97 13 0.437 -3.00 1.07 Intr + 40927 41019 93 2 0 114 103 118 0.928 14.92 1.08 Term + 46653 46903 251 1 2 89 39 214 0.670 11.48 1.09 PlyA + 47124 47129 6 1.05 2.04 PlyA - 48479 48474 6 1.05 2.03 Term - 48813 48721 93 0 0 67 39 121 0.226 1.95 2.02 Intr - 50513 50383 131 0 2 42 92 47 0.109 -0.01 2.01 Init - 52153 52015 139 1 1 72 94 63 0.270 5.66 2.00 Prom - 55209 55170 40 -5.05 3.05 PlyA - 56823 56818 6 1.05 3.04 Term - 60709 60257 453 1 0 43 42 251 0.211 10.17 3.03 Intr - 62763 62676 88 2 1 119 91 15 0.240 3.95 3.02 Intr - 75830 75719 112 0 1 53 61 116 0.283 3.92 3.01 Init - 79081 78961 121 1 1 46 92 39 0.400 0.50 3.00 Prom - 79468 79429 40 -3.45 4.04 PlyA - 79888 79883 6 1.05 4.03 Term - 85032 84790 243 0 0 14 43 226 0.993 5.62 4.02 Intr - 85238 85050 189 2 0 89 87 107 0.857 9.56 4.01 Init - 86568 86527 42 0 0 117 36 44 0.599 2.57 4.00 Prom - 89915 89876 40 -6.95 5.00 Prom + 91729 91768 40 -5.25 5.01 Init + 96325 96381 57 0 0 79 81 46 0.044 4.36 5.02 Intr + 100055 100260 206 1 2 33 47 153 0.039 2.78 5.03 Term + 101800 102073 274 1 1 106 43 97 0.029 0.96 5.04 PlyA + 102269 102274 6 1.05 6.00 Prom + 102827 102866 40 -6.95 6.01 Init + 103703 103772 70 1 1 42 71 113 0.787 6.26 6.02 Intr + 104171 104490 320 0 2 -11 65 222 0.402 4.75 6.03 Term + 105452 105550 99 1 0 123 47 83 0.937 4.85 6.04 PlyA + 106088 106093 6 1.05 7.15 PlyA - 106302 106297 6 -0.45 7.14 Term - 106999 106701 299 2 2 84 33 219 0.992 10.44 7.13 Intr - 119719 119455 265 1 1 109 111 266 0.720 27.16 7.12 Intr - 122912 122688 225 2 0 -4 63 190 0.006 4.56 7.11 Intr - 137191 136914 278 2 2 53 86 289 0.382 21.51 7.10 Intr - 142636 142410 227 0 2 118 100 201 0.658 21.01 7.09 Intr - 144766 144599 168 0 0 48 89 136 0.903 7.84 7.08 Intr - 145754 145290 465 1 0 104 -2 249 0.626 8.81 7.07 Intr - 169519 169368 152 1 2 -25 76 127 0.000 -1.76 7.06 Intr - 189281 189195 87 2 0 68 71 82 0.268 3.75 7.05 Intr - 190560 190455 106 2 1 80 61 51 0.752 0.90 7.04 Intr - 201890 201781 110 2 2 47 98 120 0.837 7.06 7.03 Intr - 207652 207541 112 0 1 110 37 144 0.995 10.96 7.02 Intr - 208475 208333 143 2 2 57 57 96 0.363 1.63 7.01 Init - 227007 226918 90 0 0 80 93 99 0.820 10.14 7.00 Prom - 236810 236771 40 -4.95 8.03 PlyA - 237354 237349 6 1.05 8.02 Term - 242117 242002 116 0 2 74 48 29 0.266 -4.75 8.01 Intr - 244202 244080 123 0 0 49 90 163 0.945 12.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_1|329_aa MFEEGQGGYKYKGVAPGSSSVVMEQIYILFGHAHTLCRNGPKQARLLLKLQRFNLHRSTD IGFSPEAASRRGLSTAEMNAVEAIHRAVEFNPHVPKYLLEMKSLILPPEHILKRGDSEAI AYAFFHLAHWKRVEGALNLLHCTWEGRAPCHPENPVWKLGQPHSIMVIFGIGLGFVLDTD DHFLMAFVGGLAKSHSYPPLEPDRPSKKQIHNQNQAFRMIPYPLEKGHLFYPYPICTETA DRELLPWTLLGKLNLRAATCSCMVCHHSCSGGSFSGISGFPGSTLESSKNLKVTEACPHN RNFLGLGSAWATQLSSTPLVMRMYNQCIE >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_1|990_bp atgtttgaggaaggacagggagggtacaaatataaaggggtagcaccagggagttcctct gtggtaatggaacagatctatatcttgtttgggcatgcacacaccctctgccggaatggc cccaagcaggctcgcctgctgctgaagctgcaacgatttaacctccatcgcagcactgac attggattctctcctgaggctgcatctcggcgggggctgagcacagcagagatgaatgca gtagaggccattcatagagctgtggaattcaatcctcatgtgccaaaatacctactagaa atgaaaagcttaatcctacccccagaacatatcctgaagagaggagacagtgaagcaata gcatatgcattctttcatcttgcacactggaagagagtggaaggggctttgaatcttttg cattgtacgtgggaaggcagggcaccttgtcaccctgaaaaccctgtttggaaacttgga cagccccacagcatcatggtgatctttggcattgggttaggcttcgtgctggacactgat gaccacttcttgatggcctttgtgggagggttggctaagagccacagttatcctccactg gaaccagaccggccatccaaaaagcaaattcacaaccaaaatcaagcttttcggatgatc ccttatcccttggaaaaggggcacctattttatccttacccaatctgtacagaaacagca gaccgagagctgcttccatggacactgcttggaaagctgaaccttcgagcagccacttgc agctgcatggtgtgccatcacagctgttcaggtggaagcttttcaggaatcagtggtttc cctggctctaccttggaatcatcgaaaaacttaaaagttactgaggcctgtccccacaac cggaactttcttggtctggggtccgcatgggcaactcagttgtcaagcactcccctggtg atgcggatgtacaaccagtgtattgaatag >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_2|120_aa MVLLNPRDSIFQASAYKGFSGPAEMCSISTETMMWHSVDPGSQETQGKCSVDCGSSRRGM VGKFPKFADPESVLLFHFVFGPVEHTLGRQQSTVVPSFLDDTCPSQQGSTRIRSPNASNV >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_2|363_bp atggtcctgttgaacccaagagattccattttccaagcctctgcctataaggggttctca gggcctgctgagatgtgttccatcagcacagaaaccatgatgtggcattcagtagatcca ggttcccaggaaactcaaggtaaatgttctgtggactgtgggtcttcgagaagaggtatg gtgggcaagtttcccaaatttgctgacccagaatctgtcctgttgtttcattttgttttt ggaccagtagaacacacactgggaaggcagcaaagtactgtggtcccttctttcctggat gacacctgcccttctcaacaaggttccacaagaatacgaagccccaatgcttccaacgta tag >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_3|257_aa MKIQNALLGCGCTGKQRSFLPWPPYLARCPHLPECGKQVLVTRAVQGCVSLTQGPRLGLR ECDLILVASSAEPAFKRRCPWAVRRPQGLPGGPWPTFKPSTKGPSKQWDFGVWSSLSQRV VEEKNRSSLEKLVAVGGGGLGGWPPLPRRPRVSRRQAGEVRGITGWSVGLQVASRPQKRQ QGTHMGSLAWGARGSCGLSMLQKQKAEDFRDAVGLDDAMLSETPRIQVSEGTSALEVESI LFRTHVTFPAGLRGWGS >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_3|774_bp atgaaaatccaaaatgccctactaggctgtggctgtactggcaagcagagatcttttctc ccctggcccccttatcttgcaagatgcccacaccttcctgaatgtggtaaacaagttctg gttaccagggctgtgcagggctgtgtcagtctgacacagggaccacgtctaggactcagg gagtgtgacctgatccttgtcgcctcctctgcagagcctgccttcaaaagaaggtgtcca tgggcggttagaaggccccagggccttcctgggggtccatggcccacattcaaacccagc accaaggggccttctaagcagtgggattttggagtttggagctctttatcccagagagtt gtcgaggagaaaaacagaagcagcctggaaaagcttgtggctgtcggaggtggggggctg gggggatggcccccactgccgagacggcccagggtgtccaggaggcaggctggagaggtg agaggcataacggggtggtctgtgggcctgcaagtggcatccagacctcagaagaggcag caggggacacacatggggagcctggcatggggggcccgggggtcctgtggcctgagcatg ctgcagaagcagaaagctgaggacttcagggatgctgtgggcctggatgatgctatgctg tcagagacacctaggatacaggtgtctgaaggcacatccgctctggaggtggagagcatc ctgttcaggactcacgtgacattccctgctggcttgaggggatggggctcataa >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_4|157_aa MVQADSVMNGSPHKIIHSSIGRLTEVKSLQTGAVMRRVFASVNFGDHKFMDKIVIRKQHT MNGHNCEVGKSLAKQEIVKVVLKTSAVVVEVILEGMATLIMEEITMVVIALVAAMMVVDM MPVGIVIMVLVMIEAILEVVEATVDNGNYNNLQILDP >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_4|474_bp atggtgcaagcagattcagtcatgaatggaagtccacacaagataatacatagcagtata ggaagattgactgaagtgaaatcattacagactggggcagtgatgaggagagtctttgct tctgtaaactttggtgaccataagttcatggataagattgtcattcggaaacaacatact atgaatggccataactgtgaagtagggaaatccctggctaagcaagagattgtcaaagtg gttctgaaaacttcagcagtcgttgtggaggttattttagagggaatggcaactttaatt atggaggaaattacaatggttgtgatagctttggtggcagccatgatggtggtggatatg atgccagtggggatagttataatggtgttggtaatgatagaagcaattttggaggtggtg gaagcgacagtggacaatggcaattacaataatcttcaaattctggacccataa >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_5|178_aa MAQEVGAECRDREVTAGRKSFQAVLTAHGAAPVELTPTLGHPGDMGGVVASPTAHDFTAV HAPGSQVAHTACCTQGAWKTSQGVLLKSPLCVGIIYRNVMLPCLFGMTREAARFKQGTGS RRGEGGAAFGQNDWRRRAGRLAGELHEDRKDGKRPGSGWRVGQPGMAPSLHRAEEAWG >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_5|537_bp atggctcaggaagtaggtgctgaatgtagggacagggaagtgacagcaggaagaaagagc ttccaggcagtcctgacagcgcacggcgcagcaccagtggaacttacacccacacttggt catccgggtgacatgggaggtgtcgtagcctctcccacagcacatgacttcacagctgtc catgccccgggaagtcaggttgcacacacggcctgctgtacccagggagcctggaagaca agccagggagtgttactgaaaagtccgctctgtgttggaattatctacagaaatgtcatg ttgccctgcctgttcgggatgaccagggaagcagcaaggttcaagcagggcacggggtca agaagaggagagggtggggcagcgtttggtcagaatgattggcggcggagagcaggaaga ctggccggggaacttcatgaggacaggaaggatggcaaaaggccggggtcagggtggagg gtgggtcagccagggatggctccgagcctgcatcgtgctgaagaagcatggggttag >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_6|162_aa MRAVSVDDIDECGGGEDMGTVSVEEGGRGRGEGEGGERRGRGRRERGRGGGRQGGGREEE EEKEEERKEEGEEEKEEKGRGGGREGESNGGMCQGQEMSKALQRVRSRKVFPVAGIQYSK GRVSGDEAGVMQKTEPDCLNRRIAQCSDLNTIRTQREEQTKL >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_6|489_bp atgagagcagtttcagtggatgacatagatgaatgtggtggaggagaggacatggggaca gtgagtgtagaagaaggtggtagaggaagaggagaaggggagggaggagaaagaagggga agaggaagaagggaaagaggaagaggaggaggaagacaaggaggaggaagagaagaggaa gaggagaaagaggaggagaggaaggaggagggggaggaggagaaagaagaaaaaggaaga ggaggaggaagagaaggagaaagcaatggtggcatgtgccaaggccaggaaatgagcaaa gctctgcaacgtgtgaggagcaggaaagtgttccctgtggctggtatacagtactcaaag gggagagtgtcaggagatgaggctggagtgatgcagaaaactgaaccagactgtctgaac agaaggattgctcagtgtagtgacctcaacaccatcagaactcagagagaggagcagaca aagctatga >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_7|908_aa MESSSGKEEEDWDESIEVPILSACTLATTKFFVYKSDLEDDSGKLGSESQTSFKTDHDVM FSPSRADKIWCPWNRPRRRLMTPIMYAARDGHTQVVALLVAHGAEVNTQDENGYTNGITS KDQQKILAALKELQVEEIQFGELSEETKLEISGDEFLNFLLKLNKQCGHLITAVQNVITE LPVNSQKNFTSVCEELVNNVEDLSEKVCKLKDLIQKANQHKTRALNETTITDPHRVHFTP LLHPPEQVLYSLLKTDHVTGLFADIPRSGEELGVRKNCIFPGCFGRYCWVRSRGSWTLQV PLPMGSVFKPSTKESKPTGHFAEAWPLPNLALHRRGSSKFTSASSSSQLQGEKRSNELRI ISGFLYPMHSNLLIPWPPSSFPHIARSGTCLPADGLGNPERTLREPTDALWPGASLKVPV FASTLHGGKSTRTGAGDFGDRNGDRIGHIDPSWLRSPAAAAVSDSGPNPGYCPLIAPWYM RATGGSSRVMCDNVPGLVSSQRQLCHRHPDVMRAISQGVAEWTAECQHQFRQHRWNCNTL DRDHSLFGRVLLRSSRESAFVYAISSAGVVFAITRACSQGEVKSCSCDPKKMGSAKDSKG IFDWGGCSDNIDYGIKFARAFVDAKERKGKDARALMNLHNNRAGRKIPKALKKGGKFTNL DHKKSGGRAGPKWAHRAAQQHPQNPDLHICQVCLQLAACRHCCPQVAERLLQKPASRDRD KAVKRFLKQECKCHGVSGSCTLRTCWLAMADFRKTGDYLWRKYNGAIQVVMNQDGTGFTV ANERFKKPTKNDLVYFENSPDYCIRDREAAYHELKKKALSPKSYMSGSLHVTHLIKELEE STGKTLLHWSHNVIKQAAFWCLAGHQDWALGKGSECHDCQELELYSQASLLANLVLATSC LCDLGQVT >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_7|2727_bp atggagagctctagtgggaaggaggaagaggactgggatgagtccatagaggttcccata ctgtctgcatgtacattggccaccaccaagttctttgtgtataaatcagatcttgaggac gactcaggaaaattgggctcagagtctcagacctcctttaaaacagaccatgatgttatg ttcagcccaagtagagctgataaaatctggtgtccttggaataggcccagaaggagactt atgaccccaatcatgtatgctgctcgagatggtcacacccaggttgttgctctccttgtt gctcatggagcagaagttaatacccaggatgagaatggttacactaatggaattaccagt aaagaccagcagaaaattctggctgctcttaaagaactacaggtagaagagatacaattt ggagagctatctgaagagacaaagttggaaatcagtggtgatgagttcctcaactttctt ctcaaattaaataaacagtgtggccatttaataacagctgtacagaatgttattactgag ttacctgtaaattctcaaaagaattttacttcagtttgtgaagaattggttaataatgtt gaagatttgagtgaaaaggtctgcaaactaaaagacctaattcaaaaggccaaccaacac aaaaccagggcactaaatgaaactacaatcacggaccctcacagagtccatttcactccc ctgctacatccaccggagcaggtgttgtattcactgctgaagacagatcacgtcacagga ctctttgcagacattcctcgctccggagaggagctgggtgtgcgtaaaaactgcatcttt cctgggtgctttggcagatactgctgggttaggtcccgagggtcatggaccctccaagtt cccctccctatgggctctgtatttaagcccagcaccaaggaatcaaagccaacaggtcat ttcgcagaggcctggcccctccctaaccttgccctgcatagacgcggcagctccaaattt acaagtgctagctcttcatcccagcttcagggagagaagcgaagcaatgagttgagaatc atctctggattcttgtatcccatgcatagtaatctccttatcccctggcccccttcctcg tttcctcacattgcacgctcagggacttgtttgccagcggatggcctcggcaatccggaa cgcacgctccgagagcccacggatgctctttggcctggagcttccctaaaggttcctgta ttcgcgtcaactcttcatggtggtaagtccacgcgcacgggcgcgggggattttggagat cggaatggggaccgcattggccacatagacccttcctggctgcgttcccccgcggccgca gcagtctcggattctggcccgaaccctggctactgccctcttatcgccccatggtacatg agagctacaggtggctcctccagggtgatgtgcgataatgtgccaggcctggtgagcagc cagcggcagctgtgtcaccgacatccagatgtgatgcgtgccattagccagggcgtggcc gagtggacagcagaatgccagcaccagttccgccagcaccgctggaattgcaacaccctg gacagggatcacagcctttttggcagggtcctactccgaagtagtcgggaatctgccttt gtttatgccatctcctcagctggagttgtatttgccatcaccagggcctgtagccaagga gaagtaaaatcctgttcctgtgatccaaagaagatgggaagcgccaaggacagcaaaggc atttttgattggggtggctgcagtgataacattgactatgggatcaaatttgcccgcgca tttgtggatgcaaaggaaaggaaaggaaaggatgccagagccctgatgaatcttcacaac aacagagctggcaggaagatcccaaaggcactaaaaaaaggaggaaagtttactaacttg gatcacaaaaagtctggcggtagggcaggtccaaagtgggctcatcgagcagctcaacag catccccagaacccagatcttcacatctgtcaagtgtgcctgcagcttgctgcctgtcgg cattgctgccctcaggttgcagagaggctgctgcagaaaccagcatctagagacagagac aaggctgtaaagcggttcttgaaacaagagtgcaagtgccacggggtgagcggctcatgt actctcaggacatgctggctggccatggccgacttcaggaaaacgggcgattatctctgg aggaagtacaatggggccatccaggtggtcatgaaccaggatggcacaggtttcactgtg gctaacgagaggtttaagaagccaacgaaaaatgacctcgtgtattttgagaattctcca gactactgtatcagggaccgagaggcagcatatcatgagctgaagaaaaaggccctctcc ccaaaaagttacatgtcaggatctttgcacgttactcatttgataaaagaattagaagag agtacgggaaagacgctgctgcactggtcacataatgttataaagcaggctgctttctgg tgcctggccgggcaccaggactgggcactcggcaaagggtcagaatgtcacgactgtcaa gagcttgagctctacagtcaggcaagcctgcttgcaaatcttgtccttgccaccagctgc ctgtgtgatcttgggcaagttacttaa >gi568815591r:117178158_117422989|GENSCAN_predicted_peptide_8|79_aa XISVDSNFQYGWTPLMYAASVANAELVRVLLDRGANASFEKDKQSILITACSAHGSEEQI LKCVELLLSRNADPNVACR >gi568815591r:117178158_117422989|GENSCAN_predicted_CDS_8|240_bp ngcattagtgtagattccaactttcagtatggatggactccccttatgtatgctgctagt gttgccaatgcagagctggttcgggtccttttggacagaggtgctaatgcaagctttgag aaggataagcaaagtattttgataactgcatgttctgctcatggctcagaggaacagatc ttgaagtgtgtagaactactactttcaagaaatgctgatccaaatgttgcttgtaggtaa