GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:44:14 Sequence gi568815597r:70162720_70454408 : 291689 bp : 38.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1229 1224 6 1.05 1.07 Term - 9520 9353 168 1 0 45 33 130 0.297 0.10 1.06 Intr - 10990 10903 88 0 1 49 84 24 0.387 -2.85 1.05 Intr - 13263 13091 173 0 2 58 97 138 0.748 9.52 1.04 Intr - 22195 22066 130 0 1 32 83 60 0.156 -0.32 1.03 Intr - 24619 24546 74 1 2 46 98 72 0.151 1.29 1.02 Intr - 26554 26373 182 2 2 -12 95 105 0.029 -0.13 1.01 Init - 42821 42671 151 2 1 67 116 125 0.286 13.67 1.00 Prom - 43628 43589 40 -4.45 2.00 Prom + 56229 56268 40 -3.05 2.01 Init + 58918 59120 203 0 2 89 64 179 0.982 14.01 2.02 Intr + 65703 65836 134 0 2 88 76 93 0.927 7.47 2.03 Intr + 69549 69658 110 1 2 42 79 87 0.852 2.28 2.04 Intr + 71977 72069 93 0 0 49 74 74 0.713 1.34 2.05 Intr + 72782 72831 50 1 2 76 82 77 0.959 2.46 2.06 Intr + 74706 74833 128 0 2 86 97 221 0.999 22.20 2.07 Intr + 81965 82096 132 0 0 31 75 201 0.656 12.80 2.08 Intr + 84101 84188 88 0 1 -2 110 104 0.992 1.71 2.09 Intr + 87646 87784 139 1 1 66 53 212 0.999 15.05 2.10 Term + 87892 88086 195 0 0 33 42 290 0.933 15.33 2.11 PlyA + 88736 88741 6 1.05 3.10 PlyA - 88839 88834 6 1.05 3.09 Term - 100128 99998 131 1 2 75 36 198 0.960 10.66 3.08 Intr - 108237 108137 101 2 2 93 82 37 0.980 2.43 3.07 Intr - 112099 112001 99 0 0 92 81 21 0.502 0.11 3.06 Intr - 114125 114046 80 2 2 109 106 90 0.997 10.33 3.05 Intr - 129830 129669 162 2 0 76 60 189 0.792 14.15 3.04 Intr - 152841 152762 80 1 2 25 99 116 0.455 4.75 3.03 Intr - 162238 162134 105 2 0 82 54 63 0.421 1.57 3.02 Intr - 173380 173339 42 2 0 108 96 20 0.483 2.09 3.01 Init - 191689 191260 430 1 1 94 89 539 0.933 51.06 3.00 Prom - 201132 201093 40 -5.95 4.00 Prom + 205204 205243 40 -2.05 4.01 Init + 205333 205488 156 0 0 77 94 30 0.603 2.46 4.02 Intr + 206305 206508 204 0 0 117 96 49 0.566 7.07 4.03 Intr + 207011 207221 211 1 1 35 110 205 0.797 14.96 4.04 Term + 207677 208137 461 0 2 85 48 131 0.636 3.07 4.05 PlyA + 208948 208953 6 1.05 5.00 Prom + 210344 210383 40 -4.05 5.01 Init + 235241 235300 60 1 0 51 86 57 0.045 3.10 5.02 Intr + 247961 248144 184 1 1 48 78 131 0.056 6.54 5.03 Intr + 248760 248937 178 1 1 63 103 48 0.072 1.86 5.04 Intr + 251431 251449 19 1 1 82 95 23 0.139 -1.90 5.05 Intr + 253237 253318 82 0 1 77 93 44 0.446 2.09 5.06 Intr + 255218 255313 96 0 0 85 92 37 0.765 2.86 5.07 Intr + 258847 258956 110 2 2 109 88 64 0.945 7.58 5.08 Intr + 261566 261697 132 1 0 22 99 74 0.689 1.82 5.09 Intr + 267075 267132 58 2 1 51 95 25 0.852 -2.96 5.10 Intr + 267598 267675 78 2 0 96 93 53 0.964 5.20 5.11 Intr + 269364 269516 153 1 0 78 84 148 0.852 12.52 5.12 Intr + 271109 271230 122 0 2 69 92 58 0.952 3.59 5.13 Intr + 272406 272458 53 2 2 53 94 74 0.887 1.29 5.14 Intr + 273029 273214 186 2 0 32 44 138 0.377 1.68 5.15 Intr + 275969 276107 139 2 1 87 92 173 0.615 17.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 237316 237228 89 2 2 87 42 102 0.829 2.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:70162720_70454408|GENSCAN_predicted_peptide_1|321_aa MSRLKRIAGQDLRAGFKAGGRDCGTSVPQGLLKAARKSGQLNLSGRNLSEVPQCVWRINV DIPEEANQNLSFGATERWWEQTDLTKLIISNNKLQSLTDDLRLLPALTVLDIHDNQLTSL PSAIRELENLQKLNVSHNKLKILPEEITNLRNLKCLYLQHNELTCISEGFEQLSNLEDLE LHVGENQIEMLEAEHLKHLNSILVLDLRDNKLKSVPDEIILLRSLERLDLSNNDISSLPY SLGNLHLKFLALEGNPLRTIRREIISRTVIKAPADLVSGKDLFLIDGTFSMLSDGKRRLM SPQDFFYKGTIPVHEGSALNI >gi568815597r:70162720_70454408|GENSCAN_predicted_CDS_1|966_bp atgtcgcgcctgaagcggatagcggggcaggatctccgcgctggtttcaaagcaggtgga agagactgcggtacctcggtaccccaagggctgttgaaggcagcgaggaagagcggccag ttaaacctgtcgggtagaaacctcagtgaagtgccgcagtgtgtctggagaataaatgtg gatatccctgaggaagctaatcagaatctttcgtttggtgctactgaaagatggtgggag cagacagatttgaccaaactaataatatcaaacaataaacttcagtcacttacagatgac ctgcgactcttgcctgcactgactgttcttgatatacatgataatcagttgacatccctt ccttctgctataagagagctagaaaatcttcagaaacttaatgtcagccataataaactg aaaatactccctgaagaaattacaaacctaagaaacctgaagtgcctgtatctccagcat aatgaattaacctgcatatcagagggatttgaacaactttccaatttagaagatttagaa ttgcacgtaggtgaaaaccagattgaaatgttagaggcagaacatcttaaacatctgaat tcaattcttgtgctagacctgagggataacaagttaaaatctgttccagatgaaattata ctactacggtccttggaaaggcttgacctaagcaacaatgatattagtagtcttccctat tcattggggaaccttcatttgaaatttttggcattagaaggaaatcctttgagaacaatt cgaagagaaattataagtagaactgtgatcaaagcaccagcagatttggtgtctggtaag gacctgtttctcatagatggcaccttctccatgctttcagatggcaagaggaggttaatg agcccccaagacttcttttataagggcactattcccgttcatgagggttctgccctcaat atctag >gi568815597r:70162720_70454408|GENSCAN_predicted_peptide_2|423_aa MSNTTVVPSTAGPGPSGGPGGGGGGGGGGGGTEVIQVTNVSPSASSEQMRTLFGFLGKID ELRLFPPDDSPLPVSSRVCFVKFHDPDSAVVAQHLTNTVFVDRALIVVPYAEGVIPDEAK ALSLLAPANAVAGLLPGGGLLPTPNPLTQIGAVPLAALGAPTLDPALAALGLPGANLNSQ SLAADQLLKLMSTVDPKLNHVAAGLVSPSLKSDTSSKEIEEAMKRVREAQSLISAAIEPG GQEADRDGGHILSLGVGDDPKAQGGEDLIPEKEVEGQGAHQKQDKKKEDKEKKRSKTPPK SYSTARRSRSASRHKKEKKKDKDKERSRDERERSTSKKKKSKDKEKDRERKSESDKDVKV TRDYDEEEQGYDSEKEKKEEKKPIETGSPKTKECSVEKGTGDSLRESKVNGDDHHEEDMD MSD >gi568815597r:70162720_70454408|GENSCAN_predicted_CDS_2|1272_bp atgagcaacactaccgtcgtccccagcactgcaggtccgggccccagcggcgggcccggt ggcggaggtggtggtggcggcggaggcggcggcaccgaggtaatccaggtgactaatgtc tccccgagcgctagctctgagcagatgcggactctcttcggtttcctaggcaagatcgac gaactgcgcctcttcccgccggatgattcgcctttgccagtctcatctcgtgtctgcttt gttaagttccatgatccagactcagcagttgtggcacagcatctgacaaacactgtattc gttgacagagctttgatagtcgtaccatatgcagaaggagttattcctgatgaagctaaa gctttgtctctgttggcaccagctaatgcagtggcaggtcttctgcctggtggtggactc ctgcctactcctaacccacttacccagattggcgctgttccactggctgctttgggggct cctactcttgatcctgcccttgctgcacttgggcttcctggagcaaacttgaactctcag tctcttgctgcagatcagttgctgaagcttatgagtactgttgatcccaagttgaatcat gtagctgctggtctcgtttcaccaagtctgaaatcggatacctctagtaaagaaatagag gaagctatgaaaagagtacgagaagcacagtccctaatttctgctgctatagaaccaggc ggtcaagaagcagatcgagacggcggtcacattctaagtctaggagtcggcgacgatcca aaagcccaaggcggagaagatctcattccagagaaagaggtagaaggtcaaggagcacat caaaaacaagacaaaaagaaagaagacaaagaaaagaaacgttctaaaacaccaccaaaa agttacagcacagccagacgttctagaagtgcaagcagacataaaaaggagaagaagaaa gataaagacaaagaaagaagtagggatgaaagagaacgatcaacaagcaagaagaagaag agtaaagataaggaaaaggaccgggaaagaaaatcagagagtgataaagatgtaaaagtt acacgggattatgatgaagaggaacaggggtatgacagtgagaaagagaaaaaagaagag aagaaaccaatagaaacaggttcccctaaaacaaaggaatgttctgtggaaaagggaact ggtgattcactaagagaatccaaagtgaatggggatgatcatcatgaagaagacatggat atgagtgactga >gi568815597r:70162720_70454408|GENSCAN_predicted_peptide_3|409_aa MTGEKIRSLRRDHKPSKEEGDLLEPGDEEAAAALGGTFTRSRIGKGGKACHKIFSNHHHR LQLKAAPASSNPPGAPALPLHNSSVTANSQSPALLAGTNPVAVVADGGSCPAHYPVHECV FKGDVRRLSSLIRTHNIGQKDNHGNTPLHLAVMLGNKECAHLLLAHNAPVKVKNAQGWSP LAEAISYGDRQMTLLRKLKQQSRESVEEKRPRLLKALKEERVGNFLADFYLVNGLVLESR KRREHLSEEDILRNKAIMESLSKGGNIMEQNFEPIRRQSLTPPPQNTITWEEYISAENGK APHLGRELVCKESKKTFKATIAMSQEFPLGIELLLNVLEVVAPFKHFNKLREFVQMKLPP GFPVKLDIPVFPTITATVTFQEFRYDEFDGSIFTIPDDYKEDPSRFPDL >gi568815597r:70162720_70454408|GENSCAN_predicted_CDS_3|1230_bp atgaccggggagaagatccgctcactgcggagggaccacaagcccagcaaagaagaaggg gacctgctggagcccggggatgaggaagcggcggctgccctcggcggtacctttaccaga agcaggattggcaagggcggcaaagcttgtcataagatcttcagtaaccatcaccaccgg ctacagctgaaggcagctccggcctcctccaatccccccggcgccccggctctgccgctg cacaattcctccgtgactgccaactcccagtccccggcccttctggccggcaccaacccc gttgctgtcgtcgcggatggaggcagttgccccgcacactacccggtgcacgagtgcgtc ttcaagggggatgtgaggagactctcctctctcatccgcacgcacaatatcgggcagaaa gataatcacggaaatactcctttacaccttgctgtgatgttaggaaataaagaatgtgcc catttacttttggctcacaatgctccagtcaaggtgaaaaatgctcagggatggagccct ctggcggaagccatcagctatggagataggcagatgactcttttgaggaagcttaagcag caatccagggaaagtgttgaagaaaaacgacctcgattattaaaagccctgaaagaggaa agagtaggaaactttttggcagacttttacctggtgaatggacttgttttagaatcaagg aaaagaagagaacatctcagtgaagaggatattcttcgaaataaggccatcatggagagt ttgagtaaaggtggaaacataatggaacagaattttgagccgattcgaagacagtctctt acacctcctcctcagaacactattacatgggaagaatatatatctgctgaaaatggaaaa gctcctcatctgggtagagaattggtgtgcaaagagagtaagaaaacgtttaaagctacg atagccatgagccaggaatttcccttagggatagagttattattgaatgttttagaagta gtagctcccttcaagcactttaacaagcttagagaatttgttcagatgaagcttcctcca ggctttcctgtaaaattagatatacctgtgtttcccacaatcacagccactgtgactttt caggagtttcgatacgatgaatttgatggctccatctttactatacctgatgactacaag gaagacccaagccgttttcctgatctttaa >gi568815597r:70162720_70454408|GENSCAN_predicted_peptide_4|343_aa MHDIWCRDSDQGTSLGRSIRCPPALCSMRKIHLRPLVLRPTSPKNISPILNWTALPQVTP RHAESGSDSSSASAPPPYNPSITSPPHTRSSLQFHSVTSPPPPAQQFPLKEVAGAKGIVK PCPICVGPHWKLDCPTPLAATPRAPGTLAQGSLTDSFPDLLGLAAETDAAQSPRKPPGPS QMLLVTLTVEAHFKRIKARYHSPVTAWPFKAYKLSLQFPHFTCPKTGQVLQVSSGSVPYQ PNCFAYPPHGAKPIYSPILNTSLHNPSTTHNSVLDLKHAFFTIPLHPSSQPLFAFTWTDP DTHQPQQLTWAVLPQGFTDSPHYFSRAQISSSSIHYLSWHNSS >gi568815597r:70162720_70454408|GENSCAN_predicted_CDS_4|1032_bp atgcatgacatttggtgccgtgactcagatcaggggacctcccttgggagatcaatccgc tgtcctcctgctctttgctctatgagaaagatccacctacgacctctggtactcagacca accagcccaaagaacatctcaccaattttaaattggactgctcttcctcaggtcactccc cgccacgctgaatcaggctctgattcttcctcagcctctgctcccccaccctataatcct tctatcacctcccctcctcacaccaggtccagcttacagtttcattctgtgactagccct cccccacctgcccagcaatttcctcttaaagaggtggctggagctaaaggcatagtcaag ccatgtcccatctgtgtgggaccccattggaaattggactgtccaactcccctggcagcc actcccagagcccctggaaccctggcccaaggctctctgactgactccttcccagatctt cttggcttagcagctgaaactgatgctgcccaatcacctcggaagcctcctggaccatca cagatgcttttggtaactcttacagtggaggcacactttaaaaggattaaagcccgttat cactcacctgttacagcatggccttttaaagcctataaactctccttacaattcccccat tttacctgtccaaaaactggacaagtcttacaagttagttcaggatctgtgccttaccaa ccaaattgttttgcctatccaccccatggtgccaaacccatatactctcctattctcaat acctccctccacaacccctctacaacccataattctgttctggatctcaaacatgctttc tttactattcctttgcacccttcatcccagcctctgtttgctttcacttggactgaccct gacacccatcagcctcagcaacttacctgggctgtactgccacagggcttcacggacagc ccccattacttcagtcgagcccaaatttcttcctcatccatccattacctatcttggcat aattcttcatga >gi568815597r:70162720_70454408|GENSCAN_predicted_peptide_5|550_aa MIIHCGEGVGEEDMTKMLEKSSFRHRKHWVITEFSVPKTQAVTDENPSCKLGGLGNCFLN HPPRCGLRLLAPPSGQRMVVRGDPCGPGSRAMDLQGCSAPHLTVHHVQARGAWPALGELG LSGAVHSWAVKEIRLMNLKGMVQLQVMGFEYSRSGNPTRNCLEKAVAALDGAKYCLAFAS GLAATVTITHLLKAGDQIICMDDVYGGTNRYFRQVASEFGLKISFVDCSKIKLLEAAITP ETKLVWIETPTNPTQKVIDIEGCAHIVHKHGDIILVVDNTFMSPYFQRPLALGADISMYS ATKYMNGHSDVVMGLVSVNCESLHNRLRFLQNSLGAVPSPIDCYLCNRGLKTLHVRMEKH FKNGMAVAQFLESNPWVEKVIYPGLPSHPQHELVKRQCTGCTGMVTFYIKGTLQHAEIFL KNLKLFTLAESLGGFESLAELPFFKASEDIGTRSSLRSYSPAAYFTGKETKIQKVELIDS SVTQERVRSFTLGFASETVRGGKRAIMTHASVLKNDRDVLGISDTLIRLSVGLEDEEDLL EDLDQALKAA >gi568815597r:70162720_70454408|GENSCAN_predicted_CDS_5|1650_bp atgattatacactgtggggaaggagttggagaagaagacatgacaaaaatgttggagaag tcatcgtttcgtcatcgtaagcactgggtcattacagagttcagtgtacctaaaacgcaa gctgtgactgatgagaatccaagctgcaaactgggcggcttgggcaactgttttcttaac caccctccacgttgtggcttacgtcttttagcaccgccctcagggcaaaggatggtggtc agaggcgatccatgtgggccaggatccagagcaatggacctccagggctgtagtgccccc catctcactgtccaccacgttcaagcaaggggcgcctggccagcactcggtgagctgggt ctgtctggggctgtccacagttgggcggtaaaagagatacggcttatgaatttgaaaggc atggtacagctgcaggtcatgggttttgaatatagccgttctggaaatcccactaggaat tgccttgaaaaagcagtggcagcactggatggggctaagtactgtttggcctttgcttca ggtttagcagccactgtaactattacccatcttttaaaagcaggagaccaaattatttgt atggatgatgtgtatggaggtacaaacaggtacttcaggcaagtggcatctgaatttgga ttaaagatttcttttgttgattgttccaaaatcaaattactagaggcagcaattacacca gaaaccaagcttgtttggatcgaaacccccacaaaccccacccagaaggtgattgacatt gaaggctgtgcacatattgtccataagcatggagacattattttggtcgtggataacact tttatgtcaccatatttccagcgccctttggctctgggagctgatatttctatgtattct gcaacaaaatacatgaatggccacagtgatgttgtaatgggcctggtgtctgttaattgt gaaagccttcataatagacttcgtttcttgcaaaactctcttggagcagttccatctcct attgattgttacctctgcaatcgaggtctgaagactctacatgtccgaatggaaaagcat ttcaaaaacggaatggcagttgcccagttcctggaatctaatccttgggtagaaaaggtt atttatcctgggctgccctctcatccacagcatgagttggtgaagcgtcagtgtacaggt tgtacagggatggtcaccttttatattaagggcactcttcagcatgctgagattttcctc aagaacctaaagctatttactctggccgagagcttgggaggattcgaaagccttgctgag cttccttttttcaaagcatcagaggatattggtacaagaagcagtctaagatcatacagt ccagcagcttattttacaggcaaggaaacaaagatccagaaagttgagctgattgactca agtgtcacacaggaacgtgttcgcagcttcactcttggatttgcatcagaaacagtgagg ggaggtaaaagggcaatcatgactcatgcatcagttcttaagaatgacagagatgtcctt ggaattagtgacacactgattcgactttctgtgggcttagaggatgaggaagacctactg gaagatctagatcaagctttgaaggcagca