GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:01:45 Sequence gi568815596r:19933208_20151520 : 218313 bp : 43.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 2396 2264 133 1 1 109 50 46 0.268 3.55 1.12 Intr - 3158 3012 147 1 0 72 68 150 0.733 10.75 1.11 Intr - 4739 4536 204 1 0 31 115 148 0.963 10.12 1.10 Intr - 5194 5058 137 1 2 92 -4 92 0.677 -0.23 1.09 Intr - 8632 8552 81 1 0 92 86 2 0.358 0.23 1.08 Intr - 12789 12579 211 2 1 54 83 98 0.468 4.82 1.07 Intr - 20771 20627 145 0 1 62 71 79 0.275 2.94 1.06 Intr - 32351 32195 157 2 1 70 42 52 0.405 -1.62 1.05 Intr - 33702 33517 186 0 0 112 96 54 0.977 8.49 1.04 Intr - 40501 40356 146 2 2 89 84 177 0.968 17.40 1.03 Intr - 41426 41261 166 2 1 43 78 93 0.835 3.33 1.02 Intr - 42456 42323 134 1 2 63 115 9 0.784 1.46 1.01 Init - 45655 45544 112 1 1 64 50 107 0.694 4.87 1.00 Prom - 46675 46636 40 -5.76 2.00 Prom + 47031 47070 40 -4.56 2.01 Sngl + 52002 52334 333 2 0 58 36 407 0.706 28.52 2.02 PlyA + 53169 53174 6 1.05 3.06 PlyA - 54115 54110 6 1.05 3.05 Term - 56999 56548 452 0 2 69 42 339 0.302 22.75 3.04 Intr - 67359 67234 126 1 0 110 110 0 0.961 5.05 3.03 Intr - 70079 69954 126 0 0 94 94 122 0.990 14.05 3.02 Intr - 73103 72537 567 0 0 113 103 812 0.987 77.95 3.01 Init - 79424 79202 223 2 1 108 92 331 0.987 32.22 3.00 Prom - 90216 90177 40 -4.56 4.00 Prom + 91814 91853 40 -5.06 4.01 Init + 92363 92471 109 1 1 76 100 55 0.406 5.88 4.02 Term + 99184 99347 164 2 2 43 42 123 0.689 1.30 4.03 PlyA + 99354 99359 6 1.05 5.08 PlyA - 99464 99459 6 1.05 5.07 Term - 100072 99998 75 1 0 94 49 71 0.963 1.64 5.06 Intr - 101208 101110 99 0 0 90 71 93 0.993 8.11 5.05 Intr - 101855 101760 96 2 0 75 102 9 0.692 1.21 5.04 Intr - 104231 104109 123 2 0 69 94 48 0.674 4.28 5.03 Intr - 104407 104331 77 2 2 72 84 -18 0.461 -4.57 5.02 Intr - 107804 107684 121 2 1 104 84 102 0.957 11.47 5.01 Init - 118313 118203 111 2 0 76 91 160 0.889 15.31 5.00 Prom - 122150 122111 40 -6.26 6.00 Prom + 124557 124596 40 -4.36 6.01 Init + 125329 125407 79 0 1 52 83 99 0.761 7.02 6.02 Term + 126125 126252 128 0 2 52 49 75 0.701 -1.56 6.03 PlyA + 126314 126319 6 -0.45 7.03 PlyA - 126538 126533 6 1.05 7.02 Term - 128197 128097 101 2 2 115 54 46 0.668 2.19 7.01 Init - 133152 133107 46 0 1 60 71 66 0.377 2.96 7.00 Prom - 134350 134311 40 -4.66 8.00 Prom + 141114 141153 40 -0.16 8.01 Init + 149809 149953 145 0 1 60 106 60 0.866 5.34 8.02 Intr + 159865 159915 51 2 0 122 64 39 0.163 3.78 8.03 Intr + 169420 169572 153 2 0 -73 65 299 0.000 11.54 8.04 Intr + 172971 173097 127 1 1 40 111 97 0.954 6.94 8.05 Term + 173397 173856 460 0 1 31 42 145 0.908 -1.24 8.06 PlyA + 173913 173918 6 1.05 9.00 Prom + 176123 176162 40 -3.46 9.01 Init + 180554 180688 135 1 0 73 93 119 0.742 11.36 9.02 Intr + 180741 180849 109 2 1 64 70 52 0.984 0.86 9.03 Intr + 184620 184940 321 1 0 -11 55 385 0.426 21.13 9.04 Intr + 191551 191638 88 2 1 85 59 70 0.374 2.83 9.05 Term + 198039 198162 124 0 1 107 44 141 0.987 9.46 9.06 PlyA + 198893 198898 6 1.05 10.00 Prom + 199172 199211 40 -9.65 10.01 Init + 199923 200101 179 2 2 109 105 52 0.458 7.73 10.02 Intr + 208163 208262 100 2 1 31 86 67 0.108 0.91 10.03 Term + 208432 208602 171 0 0 88 53 74 0.647 1.73 10.04 PlyA + 209150 209155 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 169418 169572 155 2 2 38 65 299 0.901 22.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_1|653_aa MINNRNKSVVRSMSWNADGQKICIVYEDGAVIVGSVDGNRIWGKDLKGIQLSHVTWSADS KVLLFGMANGEIHIYDNQGNFMIKMKLSCLVNVTGAISIAGIHWYHGTEGYVEPDCPCLA VCFDNGRCQIMRHENDQNPVLIDTGMYVVGIQWNHMGSVLAVAGFQKAAMQDKDVNIVQF YTPFGEWGYCSNTVVYAYTRPDRPEYCVVFWDTKNNEKYVKYVKGLISITTCGDFCILAT KADENHPQNTGSQALSFEVGVWKSSQENSTCSREKPSKDTEIESPQGNSPAHHLTVKTPV IPLFVAMTKTHVIAASKEAFYTWQYRVAKKLTALEINQITRSRKEGRESRLAIIDISGVL TFFDLDARVTDSTGQQVVGELLKLERRDVWDMKWAKDNPDLFAMMEKTRMYVFRNLDPEE PIQTSGYICNFEDLEIKSVLLDEILKDPEHPNKDYLINFEIRSLRDSRALIEKVGIKDAS QFIEDNPHPRLWRLLAEAALQKLDLYTAEQAFVRCKDYQGIKFVKRLGKLLSESMKQAEV VGYFGRFEEAERTYLEMDRRDLAIGLRLKLGDWFRVLQLLKTGSGDADDSLLEQANNAIG DYFADRQKWLNAVQYYVQGRNQERLAECYYMLEDYEGLENLAISLPENHKLLP >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_1|1959_bp atgatcaacaatcgaaataaatcagttgttcgcagtatgagctggaatgctgacggacag aagatctgcattgtatatgaagatggggctgtgatagttggttcagtggatggcaatcgt atttggggaaaagacctgaagggtatacagctatcccatgtaacatggtctgcggacagt aaagtcttactttttggaatggcaaatggggaaatacacatttacgataatcaaggaaat tttatgataaaaatgaaactgagttgtttggtgaatgtcactggagctatcagcattgct ggaattcattggtaccatggcacagaaggctacgtggagcctgattgcccttgccttgct gtttgctttgataatggaagatgccaaataatgagacatgagaatgaccaaaatcccgtt ttgattgacactggcatgtacgtagtaggcatccagtggaaccacatgggcagcgtgtta gctgtggcaggcttccagaaggcagccatgcaggacaaagatgtgaacattgtgcagttt tacactccgtttggtgagtggggttattgctcaaacactgtagtttatgcatataccaga cctgatcgtccagaatattgtgttgtcttctgggatacgaaaaacaatgaaaaatatgtt aaatatgtgaagggtctcatttctattactacctgtggagatttctgcattttggctaca aaagctgatgaaaatcatcctcagaacactggcagccaagcattgagctttgaggtggga gtttggaaatcttctcaggagaattcgacctgttcaagagaaaagccttctaaagatact gagattgagagtccccagggaaatagccctgcccatcaccttacagtaaagacgccagtc ataccattgtttgttgcaatgaccaaaacccatgtgatagcagcctcgaaagaagcattt tatacctggcaatatcgtgtggcaaagaagctcacagcattggaaattaatcagatcaca cggtctcgaaaagaagggagagaaagccgtcttgctatcatagacatctcaggagttctg actttctttgacttggatgctcgagtaacggacagtacgggacagcaagtagttggagag ttgttaaaattggaacgaagagatgtctgggatatgaagtgggccaaagataatcctgat ttgtttgcaatgatggagaagacaagaatgtatgttttcagaaacttggatcctgaggaa cccattcagacctctggatatatttgtaattttgaggatttagaaattaaatctgttctt ttggatgagatattaaaggatccagaacatccaaacaaggattacctaattaactttgag attcggtctctgcgagatagccgagcactgattgagaaggttggaattaaagatgcatct cagttcatagaggacaatccacacccccgactttggcgcctactggctgaagcagctctt cagaaactggatctatacactgcagagcaagcatttgtgcgctgcaaagattaccaaggc attaagtttgtgaagcgcttgggcaaactactgagtgagtcaatgaaacaggctgaagtt gttggctacttcggcaggtttgaagaggctgaaagaacgtatctcgagatggacagaagg gatcttgctattggcctccggctgaaattgggggattggtttagagtactccagctcctg aaaactggatctggtgatgcagatgacagtctcctggaacaagccaacaatgccattgga gactactttgctgatcgacaaaagtggttgaatgctgtacaatattatgtacaaggacgg aaccaggaacgcttagctgaatgttactatatgttagaggattatgaagggttagagaac cttgccatttcacttccagaaaaccacaagttacttcca >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_2|110_aa MRVEALAVVAEKARNATSILVRAIILKPYHKQLQPRAIILKPYHQQLQPRTVILKPYHKQ LQPRTIILKPYHKQLQPRTIILKPYHKQLQPRTIILKPYHKQLQPRLSAL >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_2|333_bp atgagggtggaggccttggcagtcgtagcagaaaaagcaagaaatgccacctcaatccta gtcagggccatcatcctcaaaccctaccacaagcagctccaacccagggccatcatcctc aaaccctaccaccagcaactccaacccaggaccgtcatcctcaaaccctaccacaagcag ctccaacccaggaccatcatcctcaaaccctaccacaagcagctccaacccaggaccatc atcctcaaaccctaccacaagcagctccaacccaggaccatcatcctcaaaccctaccac aagcagctccaacccaggctgtcagccctttaa >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_3|497_aa MPRPAPARRLPGLLLLLWPLLLLPSAAPDPVARPGFRRLETRGPGGSPGRRPSPAAPDGA PASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSRIIDTLDIGPADT RVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVEAGA REPSSNIPKVAIIVTDGRPQDQVNEVAARAQASGIELYAVGVDRADMASLKMMASEPLEE HVFYVETYGVIEKLSSRFQETFCALDPCVLGTHQCQHVCISDGEGKHHCECSQGYTLNAD KKTCSAQDKCALGTHGCQHICVNDRTGSHHCECYEGYTLNADKKTCSGALRRCPVRGRHR KCPAAPRMRPGNSGCQGDGSFPELLVLPIGDVEPLLVEGLSGRRDPLGDPTMFFYLSKKV SFLGGVLVSPPFPQPAAIPAAFPRSQAPSPGPGRGFAIPQPSFPPGPASDLFLRFPEPQS LAELAALSQHSPIPAPG >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_3|1494_bp atgccgcgcccggcccccgcgcgccgcctcccgggactcctcctgctgctctggccgctg ctgctgctgccctccgccgcccccgaccccgtggcccgcccgggcttccggaggctggag acccgaggtcccgggggcagccctggacgccgcccctctcctgcggctcccgacggcgcg cccgcttccgggaccagcgagcctggccgcgcccgcggtgcaggtgtttgcaagagcaga cccttggacctggtgtttatcattgatagttctcgtagcgtacggcccctggaattcacc aaagtgaaaacttttgtctcccggataatcgacactctggacattgggccagccgacacg cgggtggcagtggtgaactatgctagcactgtgaagatcgagttccaactccaggcctac acagataagcagtccctgaagcaggccgtgggtcgaatcacacccttgtcaacaggcacc atgtcaggcctagccatccagacagcaatggacgaagccttcacagtggaggcaggggct cgagagccctcttctaacatccctaaggtggccatcattgttacagatgggaggccccag gaccaggtgaatgaggtggcggctcgggcccaagcatctggtattgagctctatgctgtg ggcgtggaccgggcagacatggcgtccctcaagatgatggccagtgagcccctagaggag catgttttctacgtggagacctatggggtcattgagaaactttcctctagattccaggaa accttctgtgcgctggacccctgtgtgcttggaacacaccagtgccagcacgtctgcatc agtgatggggaaggcaagcaccactgtgagtgtagccaaggatacaccttgaatgccgac aagaaaacgtgttcagctcaagataaatgtgctttgggtacccatgggtgtcagcacatt tgtgtgaatgacagaacagggtcccatcattgtgaatgctatgagggctacactctgaat gcagataaaaaaacatgttcaggcgcactacgccggtgcccggtgcgagggcgtcaccgg aagtgtcccgcggcgccccggatgcgaccgggcaacagcggttgccagggcgacgggagc tttccggagctgctggtactcccgattggagacgtagaaccgttacttgtcgagggcctt agcggccgccgtgaccctctcggggatcccacgatgttcttctacctgagcaagaaagtg agtttcctggggggcgtcctcgtttctccgccattcccgcagcctgcggcgattcccgct gccttcccgagaagccaggctccttcacccggtcctggtcgtggcttcgccattccgcag ccatccttcccgccgggtccagcctccgacctcttcctgcggttcccagagcctcagtct ttggccgagcttgctgccctctcgcagcactctcccattcctgcccctggctag >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_4|90_aa MTHSFVQCIHAACMYYPPIGHFVIILVIRSTDHKKKENNDKNKKSVHVQYRSNYPFFPPN IFNPQSINSMDIEPMDTEPMDTKGQLYSQS >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_4|273_bp atgactcactcctttgtccagtgcatccacgctgcctgcatgtattacccacctattggg cacttcgtaatcatcttggttatcagatcaacagatcacaagaagaaagagaacaacgac aagaataaaaagtctgtacatgttcagtacagatccaactatccatttttccccccaaat attttcaatccacagtcgattaactccatggatatagaacccatggatacagaacccatg gatacgaagggccaactgtattctcaaagttaa >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_5|233_aa MVSMSFKRNRSDRFYSTRCCGCCHVRTGTIILGTWYMVVNLLMAILLTVEVTHPNSMPAV NIQYEVIGNYYSSERMADNACVLFAVSVLMFIISSMLVYGAISYQVGWLIPFFCYRLFDF VLSCLVAISSLTYLPRIKEYLDQLPDFPYKDDLLALDSSCLLFIVLVFFALFIIFKAYLI NCVWNCYKYINNRNVPEIAVYPAFEAPPQYVLPTYEMAVKMPEKEPPPPYLPA >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_5|702_bp atggtgtccatgagtttcaagcggaaccgcagtgaccggttctacagcacccggtgctgc ggctgttgccatgtccgcaccgggacgatcatcctggggacctggtacatggtagtaaac ctattgatggcaattttgctgactgtggaagtgactcatccaaactccatgccagctgtc aacattcagtatgaagtcatcggtaattactattcgtctgagagaatggctgataatgcc tgtgttctttttgccgtctctgttcttatgtttataatcagttcaatgctggtttatgga gcaatttcttatcaagtgggttggctgattccattcttctgttaccgactttttgacttc gtcctcagttgcctggttgctattagttctctcacctatttgccaagaatcaaagaatat ctggatcaactacctgattttccctacaaagatgacctcctggccttggactccagctgc ctcctgttcattgttcttgtgttctttgccttattcatcatttttaaggcttatctaatt aactgtgtttggaactgctataaatacatcaacaaccgaaacgtgccggagattgctgtg taccctgcctttgaagcacctcctcagtacgttttgccaacctatgaaatggccgtgaaa atgcctgaaaaagaaccaccacctccttacttacctgcctga >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_6|68_aa MRDLEEIVSTTMNLTVIIQDFYDEIREYEEVLDQKVSLETTNLSLLTALVDNGYKYSECD SRILRAAT >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_6|207_bp atgagagatttggaagaaattgtgtccaccaccatgaacttgacagtgataattcaagac ttttatgatgagattcgagagtatgaagaggttttagaccaaaaagtaagtttagaaacc actaacttgagccttctgactgcccttgtggacaatggttacaaatattcagaatgtgac tctaggatactaagagctgccacctga >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_7|48_aa MSNRKRLYEALTVSTGTEMELEAIILSKLTQEQKAKHSMFSLKVGAKQ >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_7|147_bp atgagcaacagaaagcggctttatgaagctcttactgtcagcacagggacagagatggaa ctggaagccattatcctcagcaaattaacacaggaacagaaagccaaacacagcatgttc tcacttaaagtgggagctaagcaatga >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_8|311_aa MGAVTRIGGPPLGDQSPVFLLFFHKKDPPMTSGPQTNQPKEHLTNFKLGYTATCKVLPSM DHIEAARPCLKKRKKKKKEKEKEKEGEGEGEGEGEGEGEGEEEEEEEEEEEDDDNGGVKL QTFTVSVRAHKGSKDPKSKQQQQELSQRAKHPQHGKTPTSPAGFISHWDSLLDFAAPNPG TPAAHRELVPDNRGKEGKREREGDLLLWPTIPRRGNGGPRTGFSLPSSPAGAHLEPAQAR KHRVQPRLPPAPLPSSRPRLSLHTSPPAEGAVSGLGQPQRGAPTAQRRAEGLLQCGQSGR RSRGSAKSEGC >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_8|936_bp atgggtgccgtgactcggatcgggggacctcccttgggagatcaatcccctgtcttcctg ctctttttccataagaaagatccacctatgacctctggtcctcagaccaaccagcccaaa gaacatctcaccaattttaaattgggttacacagcgacatgcaaagtcctccccagtatg gaccacattgaagcagcaagaccttgtctaaaaaaacgaaaaaagaagaagaaggagaag gagaaggagaaagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaagga gaagaagaagaagaagaagaagaagaagaagaagacgacgacaacggtggagtgaagcta cagaccttcacagtgagtgttagagctcataaaggcagcaaggacccaaaaagtaagcag cagcagcaagagctctcccaaagagcaaagcacccacagcatggaaaaacaccaacaagc ccagccggcttcatctctcactgggactcgctgctggactttgcagcaccaaaccccggc actccggcagcccacagggagctagtcccagacaatcgaggaaaagaagggaagcgagaa agagagggagacctgctattgtggccaacgatcccgcgaagaggcaacggcggtccacga acgggattcagcctcccatcaagcccagccggcgctcacctggaacccgcgcaggcccgc aagcaccgtgttcagccccggctcccgcccgcgcctctcccttcctcccgcccgcgcctc tccctccacacctccccgccagcagagggagccgtctccggcctcggccagccccaaaga ggggcccccacagcgcagcggcgggctgaagggctcctccagtgcggccagagcggacgc cgaagccgaggaagcgccaagagtgagggctgctag >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_9|258_aa MASVWQGCARGTDSPRAQDSGNQSGDENYQSGNARKMQTVLASPQSFPEQEIKLERIPGL APGLIEPRTVYHFPLHQAEKKSTEEEPGGGGQEQLEEQEEPGGGGQEQLEEQEEPRGGGQ EQLEEQEDPGGGGQEQLEEQEEPRGGGQEQLEEQEEPGGGGQEQLEEQGEPGGGGQEQLE EQEEPGGGGAVFEPSIQEGQMALELLDEVSSGECQGGRPHTELRGLCEEDRNEGDEGVEF EASKPHGGKLAFEKRGLA >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_9|777_bp atggccagtgtctggcagggttgcgccaggggcacagacagccccagggctcaggactct gggaaccagagtggggacgagaactaccaatcagggaatgcccgcaagatgcagacagtt ttggcatctcctcagagttttccagagcaggaaatcaagctggagagaatacctggcttg gccccaggcctcattgaacccagaactgtctaccactttccattgcaccaagcagagaag aagagcactgaggaggagcccggaggtggaggtcaggagcagctggaagagcaggaggag cccggaggtggaggtcaggagcagctggaagagcaggaggagcccagaggtggaggtcag gagcagctggaagagcaggaggatcccggaggtggaggtcaggagcagctggaagagcag gaggagcccagaggtggaggtcaggagcagctggaggagcaggaggagcccggaggtgga ggtcaggagcagctggaggagcagggggagcccggaggtggaggtcaggagcagctggag gagcaggaggagcccggaggtggaggggctgtctttgagccatctattcaggaaggacaa atggccttggagctcctggatgaagtctcctctggagagtgtcagggcgggaggcctcac acggagctgcggggcctctgtgaggaagacagaaatgaaggcgatgagggtgttgagttt gaggcatccaagccacacggagggaagctggcatttgagaagagagggctggcgtga >gi568815596r:19933208_20151520|GENSCAN_predicted_peptide_10|149_aa MERRQGAVRRRVVQEDSTAQDREGSWSPRGWHNASKRYIEGCRSVTATGQSFQPEDRRLR MATPNKGLWLSRAQNLNTSRPEPRDDALAPAPWAQLTTADITFSSTPFPANPDEPSLSSD PLSSSESRMLEPKAISGNIWSSLTYSWGN >gi568815596r:19933208_20151520|GENSCAN_predicted_CDS_10|450_bp atggagaggaggcaaggagcagtccgtaggagggtagtgcaggaggacagcactgcacaa gacagggagggcagctggtccccgagggggtggcacaatgccagcaagaggtacatcgag ggctgtcgcagcgtcactgccaccggacagtctttccaacctgaggacaggaggctccgg atggccactcccaacaagggcctatggctctccagagcccagaacctgaacacatccagg ccagagccacgggatgatgcactggcccctgccccatgggcccagctcaccacggctgac atcaccttcagctccacccctttcccagcaaaccctgatgagccctctctttcctctgac ccattgtcctcctcggaatcaagaatgctagagccaaaagctatctcagggaacatctgg agcagcctcacttacagctggggaaactga