GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:43:51 Sequence gi568815596f:200963089_201164048 : 200960 bp : 41.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 607 402 206 2 2 61 101 202 0.000 16.80 1.10 Intr - 881 759 123 0 0 4 116 100 0.000 4.04 1.09 Intr - 18783 18268 516 1 0 84 98 563 0.360 48.90 1.08 Intr - 24444 24277 168 1 0 33 53 165 0.983 6.50 1.07 Intr - 25333 25174 160 1 1 60 113 158 0.925 14.14 1.06 Intr - 29280 29193 88 2 1 72 111 69 0.204 6.65 1.05 Intr - 29915 29799 117 1 0 31 103 90 0.067 3.46 1.04 Intr - 46000 45885 116 1 2 64 52 48 0.022 -2.87 1.03 Intr - 54082 53903 180 1 0 95 111 66 0.904 8.74 1.02 Intr - 59844 59743 102 0 0 83 75 114 0.994 9.05 1.01 Init - 60921 60871 51 0 0 52 94 45 0.994 2.71 1.00 Prom - 66494 66455 40 -5.25 2.02 PlyA - 67140 67135 6 1.05 2.01 Sngl - 72634 71942 693 1 0 88 49 619 0.998 53.95 2.00 Prom - 81550 81511 40 -6.65 3.00 Prom + 88006 88045 40 -3.75 3.01 Sngl + 100001 100906 906 1 0 83 45 1029 0.982 94.08 3.02 PlyA + 101280 101285 6 1.05 4.00 Prom + 104376 104415 40 -6.95 4.01 Init + 104525 104531 7 1 1 77 103 0 0.483 1.64 4.02 Intr + 108359 108609 251 0 2 50 72 274 0.728 18.23 4.03 Term + 108837 108905 69 2 0 69 48 119 0.975 2.96 4.04 PlyA + 109755 109760 6 1.05 5.00 Prom + 113258 113297 40 -6.75 5.01 Init + 115795 115934 140 0 2 99 111 86 0.983 11.66 5.02 Term + 122371 122527 157 1 1 118 49 75 0.992 3.02 5.03 PlyA + 123682 123687 6 1.05 6.03 PlyA - 124514 124509 6 1.05 6.02 Term - 154029 153530 500 1 2 73 55 327 0.178 21.60 6.01 Init - 155981 155660 322 2 1 94 39 307 0.575 23.94 6.00 Prom - 158551 158512 40 -6.95 7.00 Prom + 165550 165589 40 -6.75 7.01 Init + 166778 167058 281 1 2 37 59 309 0.980 19.43 7.02 Intr + 169941 170046 106 0 1 92 113 73 0.994 9.50 7.03 Intr + 172884 173019 136 2 1 53 99 142 0.722 11.02 7.04 Term + 173110 173339 230 2 2 89 46 38 0.701 -4.49 7.05 PlyA + 173484 173489 6 -3.44 8.06 PlyA - 173736 173731 6 1.05 8.05 Term - 175006 174428 579 1 0 22 55 777 0.980 61.30 8.04 Intr - 175287 175180 108 0 0 -11 96 120 0.464 2.46 8.03 Intr - 175783 175508 276 1 0 18 56 229 0.378 9.49 8.02 Intr - 175951 175825 127 0 1 92 28 103 0.536 4.46 8.01 Init - 177096 176873 224 0 2 64 47 175 0.736 7.10 8.00 Prom - 177695 177656 40 -7.65 9.00 Prom + 179214 179253 40 -6.25 9.01 Init + 182297 182344 48 1 0 62 74 36 0.061 0.60 9.02 Intr + 186666 186747 82 2 1 97 96 55 0.316 5.49 9.03 Intr + 197344 197854 511 2 1 103 105 403 0.783 34.60 9.04 Term + 200747 200885 139 2 1 105 48 137 0.819 7.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 408 542 135 0 0 78 94 159 0.886 14.26 S.002 Term + 616 841 226 1 1 55 48 190 0.997 6.97 S.003 Sngl + 63022 63702 681 0 0 63 39 205 0.949 9.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_1|609_aa MLGTDRCVVEEWLSEFKALPDTQITSYAATLHRKKTLVPALYKVIQDSNNELLEPVCHQL FELYRSSEVRLKRFTLQFLPELMWVYLRLTVSRDRQSNGCIEALLLGIYNLPSTIGSMAL TEGALCQHDLIRVVYSDLHPQRETFTAQNRVCVSGFPRQHEKHWKELCGRIVLDPEFMVQ LLTGVYYAMYNGQWDLGQEVLDDIIYRAQLELFSQPLLVANAMKNSLPFDAPDSTQEGQK VLKVEVTPTVPRISRTAITTASIRRHRWRREDGFDFSNEADSSIPGSPIQHGSTDLGIKR VQEGEVLVRRTPEHGSPEPNSATATTEGAEGVNGGEESVNLNDADEGFSSGASLSSQPIG TKPSSSSQRGSLRKVATGRSAKDKETASAIKSSESPRDSVVRKQYVQQPTDLSVDSVELT PMKKHLSLPAGQVVPKINSLSLIRTASASSSKSFDYVNGSQASTSIGVGTEGGTNLAANN ANRYSTVSLQEDRLGQAGEEHFLKIATRQTATTQPATVQTSSDPNNTSFSLPRKRKEERK GREIRCAARRLPIPQLASGAPYLPGGRARITGNWLFPLGPKVPSLRRRLSGVSFGRRDAA GSAPDGNLA >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_1|1827_bp atgctgggaactgaccgttgtgttgtggaagaatggttatcagaattcaaggcattacct gacactcagatcaccagttatgcagcaactttacaccggaaaaaaacacttgtaccagcc ctctataaagttattcaagattcaaataatgagctcctggagcctgtctgccatcagctg tttgagctctatcgtagctcagaggttcgacttaagaggttcacactgcagttcttgcca gaattgatgtgggtttatttacggcttacagttagccgagacagacagagtaatggttgc attgaagcacttctgttaggaatttacaatttgccttcaacaattggatccatggctttg acagaaggggcattgtgtcagcatgatctcatcagagttgtttatagtgatcttcatcct cagagggaaacattcactgcacagaaccgggtttgtgtgagtggctttccacggcaacat gaaaaacactggaaagaactctgtggtcgaatagtattggatcctgaatttatggtgcaa cttctcacaggggtttattatgccatgtataatggacagtgggaccttggccaggaagtt cttgatgatatcatttatagagcccagctagagcttttttctcaaccactattggttgcc aatgccatgaaaaactcattaccatttgatgctcctgattctacacaagaaggccagaaa gtccttaaagttgaagtcactccaacagtgccgaggatttctcggactgcaattacaaca gcttcaatccgtcgtcatagatggagaagagaagatggctttgacttctcaaacgaggct gactcgagtattcctggctccccgatccaacacggctccactgacctagggatcaaacgt gtgcaagagggggaggtgctggtgcgcaggacccctgagcatggctcgccggagcccaac tcagcaacagccacaacagagggtgctgagggtgtaaatggaggagaggagtctgtaaac ctgaatgatgcagatgaaggattttcatcaggggcttccctcagcagtcagccaattggg accaaaccatcctcctcttctcagaggggaagcttaaggaaagtagcaactgggcgttca gccaaggataaagaaacagcctctgccatcaaatccagtgagagccctcgagattcagta gttcgcaagcagtatgtacagcaaccaactgatcttagtgtagattcagttgagctgaca ccaatgaagaaacacctgagcctgcctgctggccaggtggtgccaaaaatcaatagctta agtctaatccggacagccagtgcttcctcaagtaaatcatttgactatgtaaatggcagt caagcaagtaccagcattggggttggcactgagggaggtactaatttagcagccaacaat gctaatcgatactcaactgtcagtctgcaggaagaccggctaggtcaagctggcgaagaa cactttttaaagattgctacaagacaaacagctaccactcagccagctactgtgcaaact tcctcagatccaaacaacacttccttttcccttccccggaaacggaaagaagaaaggaaa gggcgggagattcggtgtgccgcacgcaggcttcccatcccccagctggcctccggtgct ccttacctccccggtggtcgggcgcgaattactggaaattggcttttcccgttggggccg aaggtaccttccctgcggcggcgactcagcggggtgtcgttcggccggcgtgacgcagcc ggatcggcgccagacggaaacctagcg >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_2|230_aa MGKKQSRKTENSKNQSTSPPPKERSSSPAMEQSWTENDFDELREEGFRRSDYSELKEEVR TRGKEVKNLEKRLDEWLTRITNAEKSLKDLMELKTTARERRDECTSLSSRFNQLEERVSV MEDEMNEMKRKEKFREKRIKRNEQSLQEIWDYMKRPNLRLIGVPESDGENGTKLENTLQE IIQENFPNLARQANIQIQEIQRTPQRYSSRGATPRHIIVRFTKVEMKEKM >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_2|693_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagcacctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgat gagttgagagaagaaggcttcagacgatcagactactccgagctaaaggaggaagttcga acccgtggcaaagaagttaaaaaccttgaaaaaagattagatgaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccacggcacgagaacga cgtgacgaatgcacaagcctcagtagccgattcaatcaactggaagaaagggtatcagtg atggaagatgaaatgaatgaaatgaagcgaaaagagaagtttagagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatatgggactatatgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggag attatccaggagaacttccccaatctagcaaggcaggccaacattcaaattcaggaaata cagagaacgccacaaagatactcctcaagaggagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgtga >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_3|301_aa MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQRGTLTDCVVMRDPNTKCSTGFGFV TYATVKEVEAAMNARPQKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA LSKQEMTSASSSQRGRSGSGNFGGGRGGGFSGNDNFGHGRNFSGHGGFGGSRGGGGYDGS GDGYNGFGNDGSHFGGGGSYNDFGNYKNQSSNFGPVKGGNFGGRSSGPYGGGGQYFAKPR N >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_3|906_bp atgtctaagtcagagtctcctaaagagcccgaacagctgaggaagctcttcattggaggg ttgagctttgaaacaaccgatgagagcctgaggagccattttgagcaacggggaacgctc acagactgtgtggtaatgagagatccaaacaccaagtgctccacgggctttgggtttgtc acatatgccactgtgaaggaggtggaggcagctatgaatgcaaggccacagaaggtggat ggaagagtcgtggaaccaaagagagctgtctcgagagaagattctcaaagaccaggtgcc cacttaactgtgaaaaagatatttgttggtggcattaaagaagacactgaagaacatcac ctaagagattattttgaacagtatggaaaaattgaagtgattgaaatcatgactgaccga ggcagtgggaagaaaaggggctttgcctttgtaacctttgatgaccatgactccgtggat aagattgtcattcagaaataccacactgtgaatggccacaactgtgaagttagaaaagcc ctgtcaaagcaagagatgactagtgcttcatctagccaaagaggtcgaagtggttctgga aactttggtggtggtcgtggaggtggtttcagtgggaatgacaactttggtcatggaaga aacttcagtggtcatggtggctttggtggcagccgtggtggtggtggatatgatggcagt ggggatggctataatggatttggtaatgatggaagccattttggaggtggtggaagctac aatgattttggcaattacaaaaatcagtcttcaaattttggacccgtgaagggaggaaat tttggaggcagaagctctggcccctatggcggtggaggccaatactttgcaaaaccacga aactaa >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_4|108_aa MNHATWFTRGSGTTNTPATAFPDHPRQLPSVVSAAEGEENPSSSSTLPPPKTAPPPTCHP AVSGFLPPPPPAAACQLQRNNQPSHWESLTQYRNVKRLLRFPAVRNSR >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_4|327_bp atgaaccatgctacctggtttacgagaggttccgggaccaccaacacaccggctaccgcc tttccagaccaccctcgccagctcccgtctgtagtttccgcggccgaaggagaagaaaac ccctcttcatctagcactttgcctcccccaaagacagcaccgccgcctacctgtcaccct gctgtctccggcttcctaccgccgccgcctcccgcagcagcctgccaactgcagcgcaac aaccaaccctctcattgggaatctctaacccagtatcggaacgttaagcggcttctccgc tttcctgccgtgagaaactcgaggtga >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_5|98_aa MAHEHGHEHGHHKMELPDYRQWKIEGTPLETIQKKLAAKGLRDPWGRNEAWRYMGGFAKS VSFSDVFFKGFKWGFAAFVVAVGAEYYLESLNKDKKHH >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_5|297_bp atggcccatgaacatggacatgagcatggacatcataaaatggaacttccagattataga caatggaagatagaagggacaccattagaaactatccagaagaagctggctgcaaaaggg ctaagggatccatggggccgcaatgaagcttggagatacatgggtggctttgcaaagagt gtttccttttctgatgtattctttaaaggattcaaatggggatttgctgcatttgtggta gctgtaggagctgaatattacctggagtccctgaataaagataagaagcatcactga >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_6|273_aa MPRSVVSPLEAQGDLGPKARTQSFSASIGGQCPDVPGAQGDTSPGAGIGQDSRSTHARAA AGSSHWISAGTNSHPRRTSSDLLLRKTRIGNAPDGTRNKDRKSKASTGRRLPLSEIERIH NCWESPRSQQLSKGKMPDLLNSSPQELNSQDHFKQSLHDAEGPPLKLAGGPLGLRLQGAA RVGQGLLPVSQVHRGAACTAVAQVAPFWESGAELRVEGVRGFGAGRKRKRGHSAKPAVNN RDKEPTKLSQVPEAPPPSPGHCRPPAVSRSAGL >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_6|822_bp atgccccggagtgttgtgagtcctctggaggctcaaggagacctagggccaaaagcgagg actcaatccttttctgccagtatcgggggacagtgccccgacgtcccaggagcccagggc gacaccagcccaggggcggggatcgggcaggattctcggtctactcacgcccgggcagcc gccggctcttcccactggatctccgccgggaccaactcccaccctcgccggacaagctca gacctgctgctccgcaaaaccagaattggaaatgctccagatgggacgaggaacaaggac agaaagtcaaaagcatcaacagggcggcggctcccactgagtgagatcgagcgcatacac aattgttgggaaagtccacgttcccaacaactttcgaaaggaaaaatgccagatcttctc aacagttcaccccaagaattgaactctcaagatcattttaagcaaagtttacacgacgct gagggcccgccactgaagctcgcgggtgggcctctggggctgcggctgcagggggcggcc agagttggccaagggttacttccggtctctcaggtccatcggggtgcagcttgtaccgct gtggcccaggtcgcccctttctgggaatccggggcggagttgcgggtagaaggggttcgg gggttcggagccgggcggaaacggaagcgggggcattccgcaaagcccgcggtaaacaac cgagataaggaacctacaaaactgagtcaggttccagaagcgcccccgccatcccctggc cactgccggccacctgcagtttcccggagcgcagggctgtga >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_7|250_aa MSAEVIHQVEEALDTDEKEMLLFLCRDVAIDVVPPNVRDLLDILRERGKLSVGDLAELLY RVRRFDLLKRILKMDRKAVETHLLRNPHLVSDYRVLMAEIGEDLDKSDVSSLIFLMKDYM GRGKISKEKSFLDLVVELEKLNLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSGGWNGT WMTKASFSLWLEIASLYILHIPDLPTPLPCIFMISAKCCQAVLNISVFKKESGICTGWQS YGHSGVLHVC >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_7|753_bp atgtctgctgaagtcatccatcaggttgaagaagcacttgatacagatgagaaggagatg ctgctctttttgtgccgggatgttgctatagatgtggttccacctaatgtcagggacctt ctggatattttacgggaaagaggtaagctgtctgtcggggacttggctgaactgctctac agagtgaggcgatttgacctgctcaaacgtatcttgaagatggacagaaaagctgtggag acccacctgctcaggaaccctcaccttgtttcggactatagagtgctgatggcagagatt ggtgaggatttggataaatctgatgtgtcctcattaattttcctcatgaaggattacatg ggccgaggcaagataagcaaggagaagagtttcttggaccttgtggttgagttggagaaa ctaaatctggttgccccagatcaactggatttattagaaaaatgcctaaagaacatccac agaatagacctgaagacaaaaatccagaagtacaagcagtctggtggatggaatggaacc tggatgaccaaagcctcctttagcttgtggctagaaatcgcgtccctttatattcttcat atcccagatctgccaactccattgccctgtatattcatgatctctgcaaaatgctgccaa gcagttcttaacatttcagtcttcaagaaagaatctgggatctgcacaggctggcagagc tatggacattcaggggttttgcatgtatgctga >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_8|437_aa MEYRAPWPAPPLPQPPPPAHPAHPAPGCSVAAARLRVAAAAAFMSGRGVSRQHEGLPISS STGYVVEDGFTALKGHSNNLISLSFPHTVQQLFASDQGLTYNDFLILPGFIDFIADETPL ISSPMDTVTEADLAIVMALMGGTGFIHHNCTPEFQASEVQKVKKFEPGFITHPVVLSPLH TVGDVLEAKMRHGFSGIPITETGTMGSKLNRDYPVASKDSHEQLLGGAAVGTHEDDKYHL DLLTQGHGLRGSIYINQEVIACSQPQGTAVYKVAKHTQNFGVPIIADGGIQTMGHVVKAL ALGASTVMMGSLLAATMEAPGECFFSDGMQLKKYQGMGSLDAMEKSSSSQKQYFNDGDKA KITQDVLGSIQDKGSIQKFVPYLIVGIQHGCQDIGAHSLSVLRSMMYSGELKFEKQTMSA QIDGGIHGLHSYEKWLY >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_8|1314_bp atggagtatagggcgccctggcccgccccgccgctgccgcagcccccgcccccagcccac ccagcccacccggcgcctggctgcagcgtggcagcggcgcgcctccgcgtcgcagcagct gcagcgtttatgtcgggtcgcggggtctcgcggcagcatgagggactacctatcagcagc agcaccggctacgtggtcgaggacgggttcactgcgctgaagggacacagtaacaatctg atctctctttcttttccccacaccgtgcagcagctcttcgccagtgaccagggactcacc tacaacgacttcttgattctcccaggattcatagacttcatagctgatgagacgccgctg atctcctcccccatggacactgtgacagaggccgacctggccatcgtgatggctctgatg ggaggtactggtttcattcaccacaactgcaccccagagttccaggccagtgaggtgcag aaggtcaagaagtttgaaccgggctttatcacacaccccgtggtgctgagccccttgcac actgtgggtgatgtgttggaggccaagatgcgtcatggcttctctggcatccccatcact gagacgggtaccatgggcagcaagctgaaccgagactaccctgtggcctccaaggattcc catgagcagctgctgggcggggcagctgtgggtacccatgaggatgacaaataccacctg gacctgctcacccaggggcatggactgcgcggctccatctacatcaaccaggaagtgata gcctgcagtcagccccagggcactgctgtgtacaaggtggccaagcatacccagaacttt ggtgtgcccatcatagccgatggtggcatccagaccatggggcatgtggtcaaggccctg gccctaggagcctccacagtgatgatgggctccctgctggccgccaccatggaggccccc ggcgagtgcttcttctcagacggaatgcagctcaagaagtaccagggcatgggctcactg gatgccatggagaagagcagcagcagccagaaacaatacttcaacgacggggataaggcg aagatcacgcaggatgtcttgggctccatccaggacaaagggtccattcagaagttcgtg ccctacctcatagtgggcatccagcatggctgccaggatatcggggcccacagcctgtct gtccttcggtccatgatgtactcaggggagctcaagtttgagaagcagaccatgtcagcc cagatcgacggtggcatccatggcctgcactcttacgagaagtggctgtactga >gi568815596f:200963089_201164048|GENSCAN_predicted_peptide_9|259_aa MGEVKNKDLRNSLALNSIPEERYKMKSKPLGICLIIDCIGNETELLRDTFTSLGYEVQKF LHLSMHGISQILGQFACMPEHRDYDSFVCVLVSRGGSQSVYGVDQTHSGLPLHHIRRMFM GDSCPYLAGKPKMFFIQNYVVSEGQLEDSSLLEVDGPAMKNVEFKAQKRGLCTVHREADF FWSLCTADMSLLEQSHSSPSLYLQCLSQKLRQERKRPLLDLHIELNGYMYDWNSRVSAKE KYYVWLQHTLRKKLILSYT >gi568815596f:200963089_201164048|GENSCAN_predicted_CDS_9|780_bp atgggagaagtaaagaacaaagacttaaggaacagcttggcgctcaacagcatacctgaa gagagatacaagatgaagagcaagcccctaggaatctgcctgataatcgattgcattggc aatgagacagagcttcttcgagacaccttcacttccctgggctatgaagtccagaaattc ttgcatctcagtatgcatggtatatcccagattcttggccaatttgcctgtatgcccgag caccgagactacgacagctttgtgtgtgtcctggtgagccgaggaggctcccagagtgtg tatggtgtggatcagactcactcagggctccccctgcatcacatcaggaggatgttcatg ggagattcatgcccttatctagcagggaagccaaagatgttttttattcagaactatgtg gtgtcagagggccagctggaggacagcagcctcttggaggtggatgggccagcgatgaag aatgtggaattcaaggctcagaagcgagggctgtgcacagttcaccgagaagctgacttc ttctggagcctgtgtactgcggacatgtccctgctggagcagtctcacagctcaccatcc ctgtacctgcagtgcctctcccagaaactgagacaagaaagaaaacgcccactcctggat cttcacattgaactcaatggctacatgtatgattggaacagcagagtttctgccaaggag aaatattatgtctggctgcagcacactctgagaaagaaacttatcctctcctacacataa