GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:12:02 Sequence gi568815586r:95418157_95634256 : 216100 bp : 42.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 13675 13872 198 0 0 -65 49 532 0.985 30.42 1.02 PlyA + 15270 15275 6 1.05 2.00 Prom + 16435 16474 40 -10.75 2.01 Sngl + 16807 17079 273 0 0 84 42 275 0.966 17.58 2.02 PlyA + 17320 17325 6 1.05 3.00 Prom + 20479 20518 40 -6.05 3.01 Init + 26833 26985 153 0 0 60 20 206 0.612 11.03 3.02 Intr + 29418 29497 80 2 2 61 57 36 0.169 -4.77 3.03 Intr + 30205 30278 74 1 2 81 73 78 0.102 3.73 3.04 Term + 31037 31125 89 0 2 100 36 70 0.165 -0.16 3.05 PlyA + 32055 32060 6 -0.45 4.04 PlyA - 32897 32892 6 1.05 4.03 Term - 33346 33120 227 2 2 66 42 255 0.859 14.66 4.02 Intr - 36996 36869 128 2 2 34 58 120 0.636 3.00 4.01 Init - 39734 39640 95 2 2 61 89 66 0.382 3.90 4.00 Prom - 47468 47429 40 -2.65 5.04 PlyA - 49134 49129 6 1.05 5.03 Term - 49606 49421 186 1 0 -18 38 297 0.685 10.61 5.02 Intr - 50737 50638 100 0 1 61 58 80 0.704 1.49 5.01 Init - 53498 53440 59 2 2 75 116 52 0.899 7.53 5.00 Prom - 54972 54933 40 -7.25 6.00 Prom + 55620 55659 40 -5.65 6.01 Init + 56024 56174 151 1 1 120 103 236 0.996 28.55 6.02 Intr + 57915 58022 108 1 0 28 94 201 0.672 14.04 6.03 Intr + 65059 65124 66 2 0 66 29 145 0.629 4.36 6.04 Intr + 67723 67825 103 2 1 67 82 53 0.747 0.91 6.05 Intr + 75900 76061 162 0 0 30 58 172 0.986 6.57 6.06 Intr + 76801 76982 182 1 2 79 44 123 0.991 5.59 6.07 Intr + 77848 77942 95 2 2 75 95 46 0.974 2.76 6.08 Intr + 85909 86005 97 0 1 75 95 134 0.999 11.46 6.09 Intr + 93739 93842 104 2 2 92 87 56 0.993 4.87 6.10 Intr + 94645 94760 116 0 2 117 100 105 0.999 12.93 6.11 Term + 95496 95748 253 0 1 81 37 180 0.880 6.33 6.12 PlyA + 96165 96170 6 1.05 7.07 PlyA - 96983 96978 6 1.05 7.06 Term - 100197 99998 200 1 2 93 42 101 0.941 2.58 7.05 Intr - 103046 102841 206 1 2 87 65 140 0.979 9.52 7.04 Intr - 105359 105150 210 1 0 45 72 140 0.131 5.21 7.03 Intr - 106632 106524 109 1 1 86 62 11 0.159 -3.28 7.02 Intr - 110846 110651 196 2 1 73 74 118 0.455 6.97 7.01 Init - 116091 114673 1419 0 0 68 80 518 0.697 39.94 7.00 Prom - 116885 116846 40 -9.25 8.00 Prom + 117539 117578 40 -5.15 8.01 Init + 126284 126354 71 1 2 72 60 70 0.623 3.17 8.02 Intr + 130168 130347 180 1 0 61 -1 150 0.562 1.46 8.03 Intr + 130564 130921 358 1 1 119 59 278 0.632 22.33 8.04 Term + 131177 131236 60 1 0 93 40 44 0.455 -3.07 8.05 PlyA + 131622 131627 6 1.05 9.06 PlyA - 132874 132869 6 1.05 9.05 Term - 140106 140000 107 1 2 80 54 59 0.446 -0.71 9.04 Intr - 149005 148893 113 0 2 27 116 113 0.845 7.10 9.03 Intr - 149634 149583 52 1 1 65 87 43 0.848 -1.05 9.02 Intr - 150994 150841 154 1 1 49 55 138 0.490 5.32 9.01 Init - 160270 160160 111 1 0 51 59 96 0.447 3.26 9.00 Prom - 179995 179956 40 -3.65 10.04 PlyA - 180336 180331 6 1.05 10.03 Term - 182508 182396 113 1 2 80 48 121 0.943 5.04 10.02 Intr - 188039 187873 167 1 2 28 77 117 0.885 3.28 10.01 Init - 189541 189417 125 1 2 76 67 60 0.651 2.39 10.00 Prom - 190765 190726 40 -5.05 11.00 Prom + 203461 203500 40 -2.65 11.01 Init + 204486 204542 57 2 0 108 95 -20 0.416 1.85 11.02 Intr + 206405 206539 135 1 0 57 96 98 0.676 7.34 11.03 Intr + 212834 212981 148 1 1 42 55 107 0.020 1.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 103921 103869 53 1 2 62 86 55 0.827 3.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_1|65_aa KKEEEEEKEKDKDGEGEGEGEGEGEGEGEGEGEEEVEEEEEEEEEEEEEEEEEEEEEEEK NLRQQ >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_1|198_bp aagaaggaggaggaggaggagaaggagaaggacaaagacggagaaggagaaggagaagga gaaggagaaggagaaggagaaggagaaggagaaggagaagaagaagtagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaag aatctgaggcagcaatga >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_2|90_aa MTRSVVGPQRSSKALPKAKFAPRKRHGHCLVVCCRSDPLQLSESWRNRYIREVFAANRCD AQKTAVSTERAQFFFTTAPDRMLYNQCFKS >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_2|273_bp atgaccaggtcagtggttggaccacagagaagctccaaagcacttcccaaagccaaattt gcaccgagaaaacgtcatggtcactgtttggtggtctgctgccggtctgatccactacag ctttctgaatcctggcgaaatcgttacattcgagaagtattcgcagcaaatcgatgcgat gcacagaaaactgcagtgtcaactgaaagggcccaattcttcttcacgacagcacctgac cgcatgttgtacaaccaatgcttcaaaagttga >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_3|131_aa MKLRTFTVSVTALKGGTDPKSKQQQGLSRRTKEQSFHTMEGNPSGLPLLAGLASQGAPTV GGLFPQTADFPHTGHQLSVSRVGSCQWVRGLADFKNEASDLCRHPNCCWEMGNDRSSYFL LDRGKEGALQL >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_3|396_bp atgaagctgcggaccttcacggtgagtgttacagctcttaaaggtggcacagacccaaag agtaagcagcagcaaggtttatcgcgtagaacaaaagaacaaagcttccacaccatggaa gggaatccaagtggattaccactgctggctgggcttgcctcacaaggggctccaactgtg gggggtctgtttccgcagaccgctgactttcctcacacggggcaccaattgagtgtgtcc agagttggttcctgccagtgggttcgtggtcttgctgacttcaagaatgaagcttcggac ctttgcagacaccccaactgctgttgggaaatgggcaatgaccgctctagctactttctg ctggataggggcaaagaaggggccctgcagttgtag >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_4|149_aa MASLLRSARPRTHRQEPTPDISCGLVQDIATWHQEEQAEKEKRDKRKATALVMALRQTNL GGSERPENGAGQSPVKVAGVESWIHYTQVKLWTPPEEPAGSSAQASQVQPDQPRYTCEPL KDLRLLFWKETSQIKKAPTADPEEKPLPT >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_4|450_bp atggcttcattattgaggtcagcaagaccacgaacccaccggcaggaaccaactcctgac atatcttgtgggctcgtccaggatattgccacgtggcaccaagaggaacaggccgaaaag gaaaagcgagataagagaaaggccacagccttagtcatggccctcaggcaaacaaacctt ggtggttcagagaggccagaaaacggagcaggccaatcacctgttaaggtggcaggagtg gaatcttggattcactacacccaagttaaactttggacaccccctgaagaacctgcggga tcatcagctcaggcgtcccaagttcagccagaccagcctcgatacacctgtgaaccattg aaggatttgcgtctcctattttggaaggaaacgtcccagattaaaaaggctcccacagct gatcctgaggaaaaaccccttcctacttaa >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_5|114_aa MAGMQRGGEGSECDEMRLERAHIEPAPTRKEQLAGSRIRSLQFPHSLSRTLPPGVDPKFL RNMRFAKKHNKKGLKKMQANNAKAMSARAEAIKALVKPKEVKPKIPKGVSHKLD >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_5|345_bp atggctggaatgcaaagaggtggagaagggagtgagtgtgatgagatgagactagaaagg gcccacatagagcctgctcccaccagaaaggagcaactggctggttccagaattcgttca ctccagttcccacactcgctctctcgcacactccctcccggggtggaccccaagttcctg aggaacatgcgctttgccaagaagcacaacaagaagggcctaaagaagatgcaggccaac aatgccaaggccatgagtgcacgtgccgaggctatcaaggccctcgtaaagcccaaggag gttaagcccaagatcccaaagggtgtcagccacaagctcgattga >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_6|478_aa MAGVEEVAASGSHLNGDLDPDDREEGAASTAEEAAKKKRRKKKKSKGPSAAGEQEPDKES GASVDEVARQLERSALEDKERDEDDEDGDGDGDGATGKKKKKKKKKRGPKVQTDPPSVPI CDLYPNGVFPKGQECEYPPTQDGRTAAWRTTSEEKKALDQASEEIWNDFREAAEAHRQVR KYVMSWIKPGMTMIEICEKLEDCSRKLIKENGLNAGLAFPTGCSLNNCAAHYTPNAGDTT VLQYDDICKIDFGTHISGRIIDCAFTVTFNPKYDTLLKAVKDATNTGIKCAGIDVRLCDV GEAIQEVMESYEVEIDGKTYQVKPIRNLNGHSIGQYRIHAGKTVPIVKGGEATRMEEGEV YAIETFGSTGKGVVHDDMECSHYMKNFDVGHVPIRLPRTKHLLNVINENFGTLAFCRRWL DRLGESKYLMALKNLCDLGIVDPYPPLCDIKGSYTAQFEHTILLRPTCKEVVSRGDDY >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_6|1437_bp atggcgggtgtggaggaggtagcggcctccgggagccacctgaatggcgacctggatcca gacgacagggaagaaggagctgcctctacggctgaggaagcagccaagaaaaaaagacga aagaagaagaagagcaaagggccttctgcagcaggggaacaggaacctgataaagaatca ggagcctcagtggatgaagtagcaagacagttggaaagatcagcattggaagataaagaa agagatgaagatgatgaagatggagatggcgatggagatggagcaactggaaagaagaag aaaaagaagaagaagaagagaggaccaaaagttcaaacagaccctccctcagttccaata tgtgacctgtatcctaatggtgtatttcccaaaggacaagaatgcgaatacccacccaca caagatgggcgaacagctgcttggagaactacaagtgaagaaaagaaagcattagatcag gcaagtgaagagatttggaatgattttcgagaagctgcagaagcacatcgacaagttaga aaatacgtaatgagctggatcaagcctgggatgacaatgatagaaatctgtgaaaagttg gaagactgttcacgcaagttaataaaagagaatggattaaatgcaggcctggcatttcct actggatgttctctcaataattgtgctgcccattatactcccaatgccggtgacacaaca gtattacagtatgatgacatctgtaaaatagactttggaacacatataagtggtaggatt attgactgtgcttttactgtcacttttaatcccaaatatgatacgttattaaaagctgta aaagatgctactaacactggaataaagtgtgctggaattgatgttcgtctgtgtgatgtt ggtgaggccatccaagaagttatggagtcctatgaagttgaaatagatgggaagacatat caagtgaaaccaatccgtaatctaaatggacattcaattgggcaatatagaatacatgct ggaaaaacagtgccgattgtgaaaggaggggaggcaacaagaatggaggaaggagaagta tatgcaattgaaacctttggtagtacaggaaaaggtgttgttcatgatgatatggaatgt tcacattacatgaaaaattttgatgttggacatgtgccaataaggcttccaagaacaaaa cacttgttaaatgtcatcaatgaaaactttggaacccttgccttctgccgcagatggctg gatcgcttgggagaaagtaaatacttgatggctctgaagaatctgtgtgacttgggcatt gtagatccatatccaccattatgtgacattaaaggatcatatacagcgcaatttgaacat accatcctgttgcgtccaacatgtaaagaagttgtcagcagaggagatgactattaa >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_7|779_aa MDTCKHVGQLQLAQDHSSLNPQKWHCVDCNTTESIWACLSCSHVACGRYIEEHALKHFQE SSHPVALEVNEMYVFCYLCDDYVLNDNTTGDLKLLRRTLSAIKSQNYHCTTRSGRFLRSM GTGDDSYFLHDGAQSLLQSEDQLYTALWHRRRILMGKIFRTWFEQSPIGRKKQEEPFQEK IVVKREVKKRRQELEYQVKAELESMPPRKSLRLQGLAQSTIIEIVSVQVPAQTPASPAKD KVLSTSENEISQKVSDSSVKRRPIVTPGVTGLRNLGNTCYMNSVLQVLSHLLIFRQCFLK LDLNQWLAMTASEKTRSCKHPPVTDTVVYQMNECQEKDTGFVCSRQSSLSSGLSGGASKG RKMELIQPKEPTSQYISLCHELHTLFQVMWSGKWALVSPFAMLHSVWRLIPAFRGYAQQD AQEFLCELLDKIQRELETTGTSLPALIPTSQRKLIKQVLNVVNNIFHGQLLSQVTCLACD NKSNTIEPFWDLSLEFPERYQCSGKDIASQPCLVTEMLAKFTETEALEGKIYVCDQCNSK RRRFSSKPVVLTEAQKQLMICHLPQVLRLHLKRFREYECDLCLPKCVGDFSPPSKSSVLQ WMRAGCPLIQFSSDTVYLEVASDPTDGELSPTRLTPTSDPSHNSRWSGRNNREKIGVHVG FEEILNMEPYCCRETLKSLRPECFIYDLSAVVMHHGKGFGSGHYTAYCYNSEGGFWVHCN DSKLSMCTMDEVCKAQAYILFYTQRVTENGHSKLLPPELLLGSQHPNEDADTSSNEILS >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_7|2340_bp atggatacgtgcaaacatgttgggcagctgcagcttgctcaagaccattccagcctcaac cctcagaaatggcactgtgtggactgcaacacgaccgagtccatttgggcttgccttagc tgctcccatgttgcctgtggaagatatattgaagagcatgcactcaagcactttcaagaa agcagtcatcctgttgcattggaggtgaatgagatgtacgttttttgttacctttgtgat gattatgttctgaatgataacacaactggagacctgaagttactacgacgtacattaagt gccatcaaaagtcaaaattatcactgcacaactcgtagtgggaggtttttacggtccatg ggtacaggtgatgattcttatttcttacatgacggtgcccaatctctgcttcaaagtgaa gatcaactgtatactgctctttggcacaggagaaggatactaatgggtaaaatctttcga acatggtttgaacaatcacccattggaagaaaaaagcaagaagaaccatttcaggaaaaa atagtagtaaaaagagaagtaaagaaaagacggcaggaattggagtatcaagttaaagca gaattggaaagtatgcctccaagaaagagtttacgtttacaagggctcgctcagtcgacc ataatagaaatagtttctgttcaggtgccagcacaaacgccagcatcaccagcaaaagat aaagtactctctacctcagaaaatgaaatatctcaaaaagtcagtgactcctcagttaaa cgaaggccaatagtaactcctggtgtaacaggattgagaaatttgggaaatacttgctat atgaattctgttcttcaggtgttgagtcatttacttatttttcgacaatgttttttaaag cttgatctgaaccaatggctggctatgactgctagcgagaagacaagatcttgtaagcat ccaccagtcacagatacagtagtatatcaaatgaatgaatgtcaggaaaaagatacaggt tttgtttgctccagacaatcaagtctgtcatcaggactaagtggtggagcatcaaaaggt agaaagatggaacttattcagccaaaggagccaacttcacagtacatttctctttgtcat gaattgcatactttgttccaagtcatgtggtctggaaagtgggcgttggtctcaccattt gctatgctacactcagtgtggagactcattcctgcctttcgtggttacgcccaacaagac gctcaggaatttctttgtgaacttttagataaaatacaacgtgaattagagacaactggt accagtttaccagctcttatccccacttctcaaaggaaactcatcaaacaagttctgaat gttgtaaataacatttttcatggacaacttcttagtcaggttacatgtcttgcatgtgac aacaaatcaaataccatagaacctttctgggacttgtcattggagtttccagaaaggtat caatgcagtggaaaagatattgcttcccagccatgtctggttactgaaatgttggccaaa tttacagaaactgaagctttagaaggaaaaatctacgtatgtgaccagtgtaactcaaag cgtagaaggttttcctccaaaccagttgtactcacagaagcccagaaacaacttatgata tgccacctacctcaggttctcagactgcacctcaaacgattcagagaatacgaatgtgac ctctgcttaccaaaatgtgtaggagatttctccccaccaagcaagtcatcagttctgcag tggatgcgagctgggtgtcctctaattcaatttagttctgacactgtctacctggaggta gcatcagatcccacagatggagagctcagtcccacaagactgacccccacttcagacccc agtcataactccaggtggtcaggacgtaataaccgagagaagattggtgttcatgttggc tttgaggaaatcttaaacatggagccctattgctgcagggagaccctgaaatccctcaga ccagaatgctttatctatgacttgtccgcggtggtgatgcaccatgggaaaggatttggc tcagggcactacactgcctactgctataattctgaaggagggttctgggtacactgcaat gattccaaactaagcatgtgcactatggatgaagtatgcaaggctcaagcttatatcttg ttttatacccaacgagttactgagaatggacattctaaacttttgcctccagagctcctg ttggggagccaacatcccaatgaagacgctgatacctcgtctaatgaaatccttagctga >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_8|222_aa MERDDTAVSPGGSYKRISGTKGQCAQPVPAPRPPLHDARAGISAKPLPVPEALARARSPP ARPLQPAAPCRSISGHSEAAAGTGGPHSANRRGRAPSRTLTSPSRGGHLSAHPGPPPPGS AAATTRCGGYDSRDAQQAPPPPTAPPLRRLFAGTENLPVAAFRASWGGRGHGRSPAPQVG LREQQALPPRAGFAPVVPSGHGRKGERGSRTGTGREPLSRTA >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_8|669_bp atggaaagggacgatacagctgtaagccctggaggttcttacaagaggatatcaggtacc aaaggtcaatgcgcccagcccgtgccagccccgcgcccacctctccacgacgctcgtgcc gggatcagcgcgaagccccttccagtccccgaagccctcgcccgcgcccgttctccccca gctcgccccctccagcccgctgcgccttgccgcagcatctccgggcactctgaggctgcc gccgggacagggggtcctcactccgccaatcgccgcggccgcgcgccctcgcgcacactc accagcccgagccggggcggccatcttagcgctcaccccggccccccgccccccggttcg gcggccgcgacgacccggtgcggcggctacgacagccgtgacgcgcagcaggccccgccc cctcccacagccccacccctgcgccggctcttcgcgggcaccgagaacctgccggtggcc gccttccgcgcctcgtgggggggtcggggccacggacggtccccggcgccgcaagtgggt ctgcgcgaacaacaagcactgcctccccgggcgggcttcgcacctgtagtgccgtcggga cacgggaggaagggcgagagaggttccagaacgggcacaggaagggaaccgctatctaga actgcctaa >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_9|178_aa MSSALAKERRQREQAYAKAQKRGDLDMRRGERIKTGEDDVWSDSENEENVSRKEARSALA NSEVKSVAFEFDKKGIVGSFVSREHFQSKSYRIEKNTTETVLLPWLFLTDDVRKKRNNVV ETGSNRPSNFFQDMDKYACLKAEASSACLDGAVEQVGELNLKSIDVCQEKGTRELMYV >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_9|537_bp atgagtagcgcgttagccaaagaaaggaggcagagggaacaagcatatgcaaaggctcag aaaagaggtgatcttgatatgagaagaggagaaagaattaagactggagaggatgatgta tggtctgacagtgaaaatgaagagaatgtttcaagaaaagaagcacgatcagcgttggca aatagtgaagtaaaatcagtggcttttgaatttgacaagaaaggaattgttggtagcttt gtttcaagagaacatttccaaagcaaaagttacaggattgagaaaaatacgactgagacc gtcttattgccatggctttttctcactgatgacgtaaggaagaagaggaataatgtggtt gagacaggaagcaatcggccaagcaatttctttcaagatatggataaatatgcatgcttg aaagccgaagcctcatcagcctgcttggatggagctgtagagcaggtgggagagttgaat ttaaagagtatagatgtttgtcaagagaaaggcacaagggagttgatgtatgtgtga >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_10|134_aa MTLAISKEFYKTITPATAKLQTGSINITLHLFKIMGPKDDIRASREPEFMNVIVPPEQRY RDKQTIVSRHRFWQIKFYWNTATSIVEVLSMAALELQGWARGEAMWRTTAHSMAVLGWTY YMAAGFPEKAARSS >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_10|405_bp atgacattggctatatcaaaagagttttataagacaatcacacctgctactgcaaaactc caaactggcagtataaacatcactttgcacttatttaaaattatgggaccaaaagatgac attagggcatccagagaaccagagttcatgaatgttattgttccaccagagcagagatac agagataaacaaactatagttagcaggcaccggttttggcaaataaaattttactggaat acggccacatccattgtagaagtcttgtcaatggctgctttggagctacaaggttgggct cgtggagaggccatgtggaggacaactgctcacagcatggcggtcttaggatggacatat tacatggcagctggcttccccgagaaagctgctaggtcttcttaa >gi568815586r:95418157_95634256|GENSCAN_predicted_peptide_11|114_aa MVKPCLYNTRISQACWCAPARIIKKFFYVNPKKGSFDLFVLDTLFARSKKCSRGIAAINM GDLKRDSFFKAKDRGEVMGKEERRERGSRGGAGEGAEEEEETVADKKRLEVKRX >gi568815586r:95418157_95634256|GENSCAN_predicted_CDS_11|342_bp atggtgaagccctgtctctataatacacgaattagccaggcgtgttggtgtgcacctgca agaataattaaaaagttcttttatgtcaacccaaagaaaggcagttttgacctttttgtc ctcgacacattgtttgctcgaagtaagaaatgttcacgtggaatagctgccattaatatg ggtgacctcaagagggacagttttttcaaagcaaaagacagaggagaggtgatggggaag gaggagagaagagagagagggagtaggggaggagcaggggaaggggcagaagaagaagaa gagactgttgcagataaaaagagacttgaggtcaagcgagnn