GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:19:24 Sequence gi568815585r:41160493_41360907 : 200415 bp : 39.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 10549 10278 272 1 2 49 91 282 0.232 21.09 1.00 Prom - 16577 16538 40 -5.95 2.02 PlyA - 16703 16698 6 1.05 2.01 Sngl - 33765 31711 2055 0 0 96 36 2151 0.999 203.67 2.00 Prom - 38615 38576 40 -5.35 3.00 Prom + 38749 38788 40 -5.75 3.01 Init + 53056 53132 77 0 2 1 101 111 0.495 4.31 3.02 Term + 53242 53650 409 1 1 46 45 216 0.738 6.80 3.03 PlyA + 54639 54644 6 1.05 4.12 PlyA - 55116 55111 6 1.05 4.11 Term - 56736 56623 114 0 0 84 36 125 0.963 4.49 4.10 Intr - 62862 62764 99 0 0 76 85 88 0.989 6.69 4.09 Intr - 66076 65940 137 2 2 107 110 70 0.995 10.47 4.08 Intr - 73515 73398 118 0 1 58 115 155 0.999 14.32 4.07 Intr - 79941 79769 173 1 2 110 76 218 0.744 21.54 4.06 Intr - 81330 81204 127 0 1 36 -9 131 0.194 -2.47 4.05 Intr - 87070 86805 266 2 2 33 38 302 0.228 16.01 4.04 Intr - 92260 92153 108 2 0 125 66 112 0.788 12.04 4.03 Intr - 92538 92457 82 0 1 57 78 68 0.945 0.99 4.02 Intr - 94128 94037 92 1 2 93 92 100 0.999 9.69 4.01 Init - 100214 100001 214 2 1 77 56 173 0.973 11.95 4.00 Prom - 110567 110528 40 -5.55 5.04 PlyA - 110695 110690 6 1.05 5.03 Term - 127601 127342 260 0 2 67 44 177 0.485 5.93 5.02 Intr - 127867 127786 82 1 1 69 37 122 0.627 3.49 5.01 Init - 129316 129137 180 1 0 77 16 221 0.662 13.04 5.00 Prom - 132702 132663 40 -6.85 6.00 Prom + 135360 135399 40 -2.85 6.01 Init + 138321 138365 45 2 0 87 84 81 0.510 8.33 6.02 Term + 146144 146248 105 1 0 85 37 79 0.003 -0.07 6.03 PlyA + 149516 149521 6 1.05 7.03 PlyA - 149691 149686 6 1.05 7.02 Term - 151408 151016 393 1 0 -13 43 454 0.962 24.85 7.01 Init - 151826 151683 144 2 0 64 84 98 0.603 7.17 7.00 Prom - 154164 154125 40 -5.95 8.00 Prom + 155015 155054 40 -7.65 8.01 Init + 159212 159260 49 1 1 88 58 28 0.601 -1.01 8.02 Intr + 160175 160332 158 0 2 72 91 84 0.935 5.91 8.03 Intr + 162564 162698 135 2 0 28 71 90 0.541 1.14 8.04 Intr + 165206 165359 154 1 1 82 115 71 0.987 7.92 8.05 Intr + 168232 168351 120 2 0 63 30 103 0.212 1.55 8.06 Intr + 176158 176264 107 2 2 71 93 -1 0.033 -2.29 8.07 Intr + 194652 194724 73 2 1 83 115 21 0.643 2.36 8.08 Intr + 197812 197981 170 2 2 68 93 89 0.829 6.14 8.09 Intr + 198318 198470 153 2 0 55 53 129 0.469 5.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 143634 143519 116 1 2 105 49 77 0.982 3.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_1|91_aa MRSLGQNPTEAELQDMINEVDADGNGTVDFPEFLTMMARKMKDTDSEEEIRDAFCVFDKD GNGYISATELHHVMTNLGENLTDDEVDEMIS >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_1|273_bp atgaggtctcttgggcagaatcccacagaagcagagttacaggacatgattaatgaagta gatgctgatggtaatggcacagttgacttccctgaatttctaacaatgatggcaagaaaa atgaaagacacagacagtgaagaagaaattagagatgcattctgtgtgtttgataaggat ggcaatggctatattagtgcaacagaacttcaccatgtgatgacaaaccttggagagaat ttaacagatgacgaggttgatgaaatgatcagn >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_2|684_aa MQSREDVPRSRRLASPRGGRRPKRISKPSVSAFFTGPEELKDTAHSAALLAQLKSFYDAR LLCDVTIEVVTPGSGPGTGRLFSCNRNVLAAACPYFKSMFTGGMYESQQASVTMHDVDAE SFEVLVDYCYTGRVSLSEANVQRLYAASDMLQLEYVREACASFLARRLDLTNCTAILKFA DAFDHHKLRSQAQSYIAHNFKQLSRMGSIREETLADLTLAQLLAVLRLDSLDIESERTVC HVAVQWLEAAAKERGPSAAEVFKCVRWMHFTEEDQDYLEGLLTKPIVKKYCLDVIEGALQ MRYGDLLYKSLVPVPNSSSSSSSSNSLVSAAENPPQRLGMCAKEMVIFFGHPRDPFLCYD PYSGDIYTMPSPLTSFAHTKTVTSSAVCVSPDHDIYLAAQPRKDLWVYKPAQNSWQQLAD RLLCREGMDVAYLNGYIYILGGRDPITGVKLKEVECYSVQRNQWALVAPVPHSFYSFELI VVQNYLYAVNSKRMLCYDPSHNMWLNCASLKRSDFQEACVFNDEIYCICDIPVMKVYNPA RGEWRRISNIPLDSETHNYQIVNHDQKLLLITSTTPQWKKNRVTVYEYDTREDQWINIGT MLGLLQFDSGFICLCARVYPSCLEPGQSFITEEDDARSESSTEWDLDGFSELDSESGSSS SFSDDEVWVQVAPQRNAQDQQGSL >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_2|2055_bp atgcagtcccgggaagacgtcccgcgctctcgccgcctcgccagtccccgtggtgggagg cggcccaagaggatttccaagccctcggtttcggcctttttcacgggtccagaggagtta aaggacacggcccattctgcagccctgctggcacagctcaagtccttctacgacgcgcgg ctgctgtgtgatgtgaccatcgaggtggtgacgcctggcagcgggcctggcacgggtcgc ctcttttcctgcaatcgcaacgtgctagcagctgcgtgtccctacttcaagagcatgttc acaggtggcatgtacgagagccagcaggccagcgtgaccatgcacgatgtggacgccgag tccttcgaggtgttggtcgactactgctacacgggtcgtgtgtctctcagtgaggccaat gtgcagcgcctgtacgcggcctccgacatgctacagctggaatatgtgcgggaagcctgt gcctccttcttagcccgacgtcttgacctgaccaactgcaccgccatcctcaagtttgca gacgccttcgaccatcacaagcttcgatctcaggcccagtcctacatagctcacaacttc aagcagctcagccgaatgggttcaattcgggaggagactctagcagatctaaccctggcc cagctgctggctgtcctacgcctggatagtctggacatagagagtgagcggactgtatgc catgtagctgtgcagtggctggaggctgctgccaaagagcggggtcccagtgctgcagaa gtcttcaagtgcgtgcgctggatgcacttcactgaagaagatcaggactacttagaaggg ctgctgaccaagcccatcgtgaagaagtactgcctggacgttattgaaggggccctgcag atgcgctatggtgacctgttgtacaagtctctggtgccagtgccaaacagcagcagcagc agtagcagcagcaactctcttgtatctgcagcagaaaatccaccccagagactgggtatg tgtgccaaggagatggtgatcttctttggacatcctagagatccctttctctgctatgac ccttactcgggggacatttacacaatgccatcccctttgaccagctttgctcacactaag actgtcacctcctcagctgtctgtgtgtccccagaccatgacatctatctagctgctcag cccaggaaagacctctgggtgtataaaccagctcagaatagttggcagcaacttgcagat cgcttgctgtgtcgtgagggcatggatgtggcatatctcaatggctacatctacattttg gggggacgagaccctattactggagttaagttgaaggaagtggaatgctacagtgttcag agaaaccagtgggcattggtggctcctgtccctcattccttctattcctttgaactcata gtggttcagaactatctttatgctgtcaacagtaagcgcatgctttgctatgatcctagc cacaatatgtggctgaactgtgcttctcttaaacgtagtgactttcaggaagcatgtgtc ttcaatgatgaaatctattgtatctgtgacatcccagtcatgaaggtctacaacccagct aggggagaatggaggcggattagtaatattcctttggattcagagacccacaactaccag attgtcaatcatgaccaaaagttgcttctcatcacttctacaaccccacaatggaaaaag aaccgagtgacagtgtatgagtatgatactagggaagatcagtggattaatataggtacc atgttaggccttttgcagtttgactctggctttatttgcctttgtgctcgtgtttatcct tcctgccttgaacctggtcagagttttattactgaggaagatgatgcacggagtgagtct agtactgaatgggacttagatggattcagtgagctggactctgagtcaggaagttcaagt tctttttcagatgatgaagtctgggtgcaagtagcacctcagcgaaatgcacaggatcag cagggttctttgtaa >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_3|161_aa MFAQAVRLYNKQVVKPAKLVSVSSGWLAPKPQDKILHTLPSPFHKQRGLSLWPLPPLTHG EFCQTTTSVHLRPKGSSVSCDKCCQAWNSPLRVVKSPLAQCRSRNAIQEPGPGDPKSPLV LYTMAKLIPEAGMSESHSRSTAYCLCIAVGYSGPMGSLVSR >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_3|486_bp atgtttgcacaggccgtaaggctctacaataagcaggtggtgaagccagccaagcttgtg tccgtctcttcaggatggctggcacccaaaccacaagacaaaatccttcacactcttccc tcccctttccacaagcagaggggcctctccctatggcctctgccaccactgacccatggc gaattctgtcagaccaccactagtgttcacttaaggcccaagggctcttcagtcagctgt gacaaatgttgccaggcctggaactcacccctcagggtagtgaaatcccctctggcacag tgcaggtccagaaatgctatccaagagcctggacctggggaccccaagagcccgttggtg ctctacaccatggccaagctgatacctgaggctggcatgtccgagtctcactcaaggtcc acagcatactgcctgtgcatcgctgttggttactcagggcccatgggctctttagtcagc aggtga >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_4|509_aa MLWKHKALQKYMENLSKEYQTLEQCLQHIPVNEENRRSLNRRHAELAPLAAIYQEIQETE QAIEELESMCKSLNKQDEKQLQELALEERQTIDQKINMLYNELFQSLVPKEKYDKNDVIL EVTAGRTTGGDICQQFTREIFDMYQNYSCYKHWQFELLNYTPADYALLFRLYRRQVDPGE MMDYRKLNQAVNPIAAAIPDVVSSVEQINIIPGTWYTAVDVANAFFSIPVNESQQKQFAF IQQGHNTPSLSSVRLPNCCRYRGQQNRVTCTKNMQSAKTDCWKFYRSAAQVLHQIICGLH HAAARISGDGVYKHLKYEGGIHRVQRIPEVGLSSRMQRIHTGTMSVIVLPQPDEVDVKLD PKDLRIDTFRAKGAGGQHVNKTDSAVRLVHIPTGLVVECQQERSQIKNKEIAFRVLRARL YQQIIEKDKRQQQSARKLQVGTRAQSERIRTYNFTQDRVSDHRIAYEVRDIKEFLCGGKG LDQLIQRLLQSADEEAIAELLDEHLKSAK >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_4|1530_bp atgctctggaagcataaagcactacagaaatatatggagaacctgagtaaggagtaccaa acacttgagcaatgtctgcagcatatccctgtgaatgaggaaaaccgaaggtccttgaac agaaggcatgctgagttggcacctcttgcagccatttaccaagaaattcaggagactgaa caagcaattgaagaattagaatcaatgtgtaaaagcctaaataaacaagatgaaaagcag ttacaagaacttgcactggaagaaaggcaaaccattgatcaaaaaatcaacatgttgtac aatgagcttttccagagccttgtgccaaaggagaaatatgacaaaaatgatgttatttta gaggtgacagctggaaggactactggaggtgacatctgccaacaatttacccgagaaata tttgacatgtaccagaattattcgtgctataaacactggcaatttgaacttctgaattat acaccagcagattatgctctcctatttcgcctgtacagaagacaggtggatcctggagaa atgatggattatcgaaagcttaaccaggcagtgaatccaattgcagctgctataccagat gtggtttcatcagttgagcaaattaacataatccctggtacctggtatacagctgttgat gtggcaaatgcctttttctccatccctgtcaatgaaagccagcagaagcagtttgctttc atccagcaaggccacaatacaccttcattgtcctccgttaggctgcctaattgctgccga tacagagggcagcagaacagggtgacttgcaccaagaatatgcagtctgcaaaaacagac tgttggaaattctacaggtcagcagcccaagttcttcaccagatcatttgtggactacat catgcagccgcccgaatttccggtgacggtgtctataagcatttgaagtatgagggtggg attcaccgagttcagcgcatccccgaggtgggcctgtcctcaaggatgcagcgcattcac acaggaacgatgtcggttattgtccttcctcagccagatgaggtggatgtgaaattggac cccaaggatttgcgaatagatacatttcgagccaaaggagcaggagggcagcatgttaat aaaactgatagtgccgtcagacttgtccacatccccacagggctagtagtagaatgccaa caagaaagatcacagataaaaaataaagaaatagcctttcgtgtgttgagagctagactc taccagcagattattgagaaagacaagcgtcagcaacaaagtgctagaaaactgcaggtg ggaacaagagcccagtcagagcgaattcggacatataatttcacccaggatagagtcagt gaccacaggatagcatatgaagttcgtgatattaaggaatttttatgtggtgggaagggc ctggatcagctaattcagagactgcttcaatcagcagatgaagaagccattgctgaactt ttggatgaacaccttaaatcagcaaaataa >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_5|173_aa MEKNWALFVDQCRLQALRFSVDLFDLLSILVRCDSFAGIQKAVVDQIGSRPSNSDHDLCL CLTTTARAQRVTQVQAIKCVVVGDGATNIFLICFSLVSLASFENVCAEWHPEMQHHFPNT PIILVGSKLDLNVVKDTIEKLKEKKLTPITYLQFLARNKEIGALKYLKCLALT >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_5|522_bp atggagaagaattgggccctatttgttgaccaatgccggctgcaggctttgcggttttca gtggatctctttgatttgctgagcatacttgtcagatgtgatagttttgcagggattcag aaagctgtcgtggatcagatcggcagcagaccatcaaacagtgaccatgacctttgtctg tgcctaaccaccactgccagggcccagcgagtgacccaggtgcaagccatcaagtgtgtg gtggttggagatggagctacaaacatatttttaatatgcttctcccttgtgagtcttgca tcatttgaaaatgtctgtgcagagtggcatcctgaaatgcagcaccattttcccaacact cccatcattcttgtgggaagtaaacttgatcttaatgttgttaaagacacgattgagaaa ctgaaggagaagaagctgactcccatcacctatctgcagtttttagccaggaataaggag attggcgctttaaaatacctgaagtgcttggctctcacatag >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_6|49_aa MEPEQRKKEEEEEREAEREESVAHITYVKRALGIAKVFTDVVTVILLYA >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_6|150_bp atggagccagaacagcggaagaaggaggaagaggaggaaagggaggctgaacgagaagaa tcagtggcccatattacatatgtaaaaagggctttgggaattgcaaaggtctttacagat gtggttactgttattttactttatgcatga >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_7|178_aa MLPSNAKPASPRGFNYSSKACATKKQTAVLKLVTTQKRTSSQIFTSKERARQPLPAEAAV PGHCGERREVSGKSAAGVRRAAEDRGKEAHGGAARQPPWAGRSGKVGSRDLRGKQRAVGS GGQAWPAPAAARPLALKARGPGDRGAAARPTATHQDAFEEVALLGRQQHVRHRGRQGG >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_7|537_bp atgctcccctcaaatgcaaaaccagccagtccacgcggctttaattactcaagcaaggcc tgtgcgactaaaaagcaaacagctgttttgaagttggtgacaactcagaaacgcacttca agtcaaatattcacttccaaagagagagcacggcagccactaccggccgaggcggctgtg cccgggcactgcggagagcggcgtgaagtttctggaaagagcgcggcgggggtgaggcga gccgcagaagaccgcggaaaggaagcgcacggaggagcggctcggcagcccccgtgggcc ggccggagcgggaaagtggggtcccgcgacctccgaggtaaacaaagagcggtgggcagc ggaggccaagcctggcccgcgccggccgccgccagaccgcttgcccttaaggcccgagga cccggggaccggggggcggcagcgcggcctacggccactcaccaagatgcgtttgaagag gttgctctccttgggcggcagcagcacgttcggcatcgtggccggcagggaggctag >gi568815585r:41160493_41360907|GENSCAN_predicted_peptide_8|373_aa MEFRHVGQADLELLTSGWHVYGLLQRSDKKYDEAIKCYRNALKLDKDNLQILRDLSLLQI QMRDLEGYRETRYQLLQLRPTQRASWIGYAIAYHLLKDYDMALKLLEEFRQTQQVPPNKI DYEYSELILYQNQVMREADLLQESLEHIEMYEKQICDKLLVEEIKGEILLKLGRLKEASE VFKNLIDRNAENWCYYEGLEKALQISERFRELMDKFLRVNFSKGCPPLFTTLKSLYYNTE KVSIIQELVTNYEASLKTCDFFSPYENGEKEPPTTLLWVQYFLAQHFDKLGQYSLALDYI NAAIASTPTLIELFYMKAKIYKHIGNLKEAAKWMDEAQSLDTADRFINSKCAKYMLRANM IKEAEEMCSKFTR >gi568815585r:41160493_41360907|GENSCAN_predicted_CDS_8|1119_bp atggagtttcgccatgttggtcaggctgatctcgaactcctgacctcaggttggcatgta tatggactcttgcagcgttctgataaaaaatatgatgaagctataaaatgttaccgaaat gccctcaaattagataaagataacctgcaaattttgagggatctctcactgttgcagatc caaatgagagaccttgaaggttaccgagagacaagataccagcttcttcagttgcgcccc acacagcgtgcctcctggattggatatgctattgcataccatttgctgaaagattatgat atggccctaaaactgttggaagaatttagacaaactcagcaagttcctccaaacaaaata gattatgaatatagtgaattgatattataccagaatcaagtgatgagagaggcagatctg ttgcaggaatctttggaacatatagaaatgtatgagaaacaaatatgtgataaacttttg gtggaagaaattaaaggggaaatactgttgaaattgggaagattaaaagaagccagtgaa gtgttcaaaaacttgattgatcgaaatgcagaaaattggtgttattatgaaggcttggaa aaagctctacaaattagtgaaagatttagagaactaatggataagttcctgagggttaac ttcagtaaaggctgcccacccttgtttactactttgaaatctttatattacaatacagaa aaggtttctataatccaggaacttgttactaattatgaagcctctcttaaaacgtgtgac ttttttagcccatatgagaatggggagaaggaacccccgacaacactactctgggttcag tatttcctggcacagcactttgataaacttggacagtattctttggctttggattatatt aatgctgcaattgctagtactccaactctaatagaattattctatatgaaagcaaaaatt tacaagcatataggtaatctcaaagaagctgcaaagtggatggatgaagcacagtctttg gacacagctgatagattcatcaattccaaatgtgcaaaatacatgcttcgagcaaatatg ataaaagaagcagaggaaatgtgctccaagttcacaagg