GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:53:56 Sequence gi568815587f:107828569_108062369 : 233801 bp : 42.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7188 7322 135 2 0 58 97 131 0.494 10.84 1.02 Term + 12029 12193 165 1 0 47 38 113 0.665 -0.87 1.03 PlyA + 13024 13029 6 1.05 2.04 PlyA - 13800 13795 6 1.05 2.03 Term - 20736 20635 102 0 0 69 39 66 0.141 -2.90 2.02 Intr - 30386 29727 660 2 0 74 94 367 0.400 26.97 2.01 Init - 49567 49487 81 1 0 59 55 84 0.072 1.22 2.00 Prom - 78162 78123 40 -2.05 3.00 Prom + 79292 79331 40 -8.55 3.01 Init + 79852 79981 130 0 1 84 44 240 0.546 19.73 3.02 Term + 80084 80196 113 0 2 71 32 193 0.940 9.74 3.03 PlyA + 80214 80219 6 1.05 4.00 Prom + 80583 80622 40 -9.55 4.01 Init + 82364 82374 11 1 2 68 77 15 0.111 -2.14 4.02 Term + 88308 88617 310 0 1 43 55 300 0.722 15.75 4.03 PlyA + 89691 89696 6 1.05 5.08 PlyA - 89716 89711 6 1.05 5.07 Term - 95260 95036 225 1 0 -13 39 407 0.984 21.50 5.06 Intr - 106614 106537 78 0 0 88 61 73 0.002 3.43 5.05 Intr - 115089 115071 19 2 1 81 94 -25 0.000 -6.90 5.04 Intr - 125602 125300 303 0 0 1 92 162 0.142 2.68 5.03 Intr - 126109 125868 242 1 2 61 51 178 0.034 6.83 5.02 Intr - 145230 145042 189 0 0 38 95 127 0.106 7.26 5.01 Init - 146165 146025 141 2 0 66 67 80 0.118 3.88 5.00 Prom - 146947 146908 40 -11.24 6.00 Prom + 148214 148253 40 -4.45 6.01 Sngl + 150906 151499 594 2 0 58 49 222 0.817 11.24 6.02 PlyA + 151747 151752 6 -1.75 7.02 PlyA - 152493 152488 6 1.05 7.01 Sngl - 154343 153915 429 2 0 53 48 331 0.972 21.63 7.00 Prom - 154387 154348 40 -18.07 8.00 Prom + 154409 154448 40 -8.65 8.01 Init + 155121 155259 139 2 1 85 108 112 0.600 13.25 8.02 Intr + 155831 156086 256 0 1 61 68 260 0.983 16.88 8.03 Term + 156120 156600 481 0 1 -30 42 274 0.976 4.28 8.04 PlyA + 160502 160507 6 1.05 9.04 PlyA - 160684 160679 6 1.05 9.03 Term - 161364 161243 122 1 2 39 47 187 0.941 7.36 9.02 Intr - 164140 164077 64 1 1 79 67 62 0.165 0.57 9.01 Init - 171858 171658 201 0 0 56 37 199 0.862 10.52 9.00 Prom - 176522 176483 40 -4.05 10.00 Prom + 178418 178457 40 -5.95 10.01 Init + 178927 179004 78 0 0 64 89 68 0.987 5.61 10.02 Intr + 180410 180576 167 1 2 78 17 186 0.528 8.34 10.03 Intr + 180823 180962 140 1 2 28 25 141 0.054 0.99 10.04 Term + 191700 191836 137 1 2 -17 44 219 0.123 4.20 10.05 PlyA + 191881 191886 6 1.05 11.00 Prom + 203126 203165 40 -4.15 11.01 Init + 203654 203707 54 1 0 45 109 10 0.861 0.14 11.02 Intr + 205234 205343 110 0 2 91 88 89 0.870 7.36 11.03 Intr + 217702 217801 100 1 1 72 92 56 0.776 3.59 11.04 Intr + 221322 221498 177 2 0 76 116 95 0.997 10.29 11.05 Intr + 224092 224233 142 0 1 82 80 53 0.971 2.91 11.06 Intr + 226079 226224 146 0 2 96 92 30 0.981 3.28 11.07 Intr + 226307 226387 81 1 0 88 92 114 0.882 10.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 96850 96836 15 1 0 80 58 5 0.886 -3.03 S.002 Init + 100001 100227 227 1 2 81 91 189 0.996 16.49 S.003 Term + 106528 106625 98 2 2 98 38 76 0.812 0.75 S.004 Term + 117368 117508 141 1 0 86 43 128 0.924 5.05 S.005 Term + 133378 133804 427 1 1 104 38 261 0.975 16.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_1|99_aa ILTLCQAMGAEKRSPCPHGIHNTAVKKNSKATIPNVVRVNKEMYKMKIINWHAAGRMKPT ANAMHKGLHRILLWFGLLAPAASLVSLWFGSVTLHDSVK >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_1|300_bp atcctaacactgtgtcaagctatgggagcagagaaacggagcccctgtcctcatggaatt cacaatacagcagtgaagaagaatagtaaagcaacaattccaaatgtggtgagggtgaac aaagagatgtacaagatgaagataatcaactggcatgcagctggaagaatgaagccaaca gcaaacgcaatgcacaaaggcttacacagaatcctgctgtggtttggactcctggctcct gctgcttctctggtctctctgtggtttggcagcgtaaccctccatgattctgtaaagtaa >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_2|280_aa MRLRGQAPWLTPVIPALWEARAGRSPEGLTVNDWSLKEPSGGARGTRASERGRERRARAE PQPAAQAPPPVFRDPALRGFWAPETRKGADGGRLASGPRRPRAPRGGSGGRVLQPAAQDK RQTLHLVSGRGVLQLRWGVGAAAGPARTWTRSTLSEAPSREAEVAAARTSVRGSTRTPGM VRFTWRREALGASPSAPVSKSWRCILEDLGGNPPCGTIRSVLWGVPRWRDEVPGDAVKEE CPPRAGQTVVIASIHASIYPRPQSSRYTEVRVMPNKTKEY >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_2|843_bp atgaggctccgaggccaggcgccgtggctcacaccggtaatcccagcactctgggaggcc agagcaggcagatcacctgagggcttgacagttaatgattggtcgctgaaggagccgagc ggaggagcgagagggacgcgagcaagcgagcgagggagggagcgcagggccagagcggag ccccagccggccgcacaggcaccgcctcccgtcttccgggacccggcacttcgggggttt tgggcgccggagacgcgcaagggcgccgatggaggcagactcgccagcgggccccggcgc cccagagcccctcgcggagggagcggcggccgagttctccagcctgctgcgcaggataaa aggcaaactcttcacctggtgagtggaaggggcgtgctgcagctccgctggggcgtgggg gcggctgcgggccctgcgcgcacgtggacaaggagcacactctctgaggcccccagcagg gaggcagaggtggcggccgcacggacttcggttcggggctcaacacgcacaccaggtatg gtgcgcttcacctggcgtcgggaagcacttggtgcgtctccctcggctcccgtttcgaaa tcgtggcgatgtattttggaggatttggggggaaacccaccctgtggcactattaggtcg gtcctctggggcgttccgcggtggagggacgaggtccccggggatgctgtgaaggaggag tgcccacctcgggctgggcagactgttgtaatagcttccatccatgcctccatctatccc cgaccccaatcatctcgctacacagaagtcagagtgatgcctaataaaacgaaagagtat taa >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_3|80_aa MRYVASYLLAALGGNSSPSAKDIKKILDSVGIEEDYDRLNKVISSAAPAAGSAPAAAEEK KDEKEESEESDDDMGFGLFD >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_3|243_bp atgcgctacgtcgcctcctacctgctggctgccctcgggggtaactcctcccccagcgcc aaggacatcaagaagatcttggacagcgtgggcatcgaggaggattacgaccggctcaac aaggttatcagctctgcagcccctgctgctggttccgcccctgctgcagcagaggagaag aaagatgagaaggaggagtctgaggagtcggacgatgacatgggatttggcctctttgat taa >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_4|106_aa MLASPHVIRFFQYTRARSRDTESPCDKTESLIELLNTSHLQRAKLQEHTVTQAHWGFESC KHLTLDAAVGSEPKNTPQYLHPLHAPPRGLSSRGLKKRATSLLHAL >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_4|321_bp atgttggccagcccacatgtgatccgatttttccagtacactagggcaagaagccgggat acagaaagcccttgtgataagacagagagtctaattgagctgcttaacacaagccacctg cagagagcaaaactgcaagagcacactgtaacacaggcccactggggcttcgagagctgt aaacacttaaccctagatgctgctgtggggtcggagcccaaaaacactccccagtacctg catcctctacatgctccccctagaggtttgagcagccgaggactgaagaagcgagccaca tccctgttgcatgccctgtga >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_5|398_aa MLDEGEGNPEWIVEEKMMSVSCGLETSCRRSFLYFVPLTFLFFLSFPESSTQESTSCPLQ INYIGPFPSWKGQRSVLSEIDTYSGCGFTFALRNCFGSSTILELAECLIQRKKIQVCLTN GSKQYAGISKPRSMENLKVSGKVPFVESLVLHLTVYFFQRKSWRDVGKDPHQVMSRREGF GMLGKKQDRGCSYRHPEYCTLLWGTIIEGTQICPHCNIFNLDLGHASTTIHEHADCLIHS HGILHIATDQVIHLTVVEIGKWNDAMGFIDHITPLSSRGVWPYKTVKWVIKSAWHIFKTC PEFFLVQDPRTLSWGLDRDPFPVTKKKKRRRGRRGSGSRRRRRRRKKKKKKKRRRKRRKR RKKKKRKKKKKKKEEEEEEEEEEEEDEGKGKEKEVISQ >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_5|1197_bp atgctagatgagggtgaggggaatccagaatggattgtagaggagaagatgatgagtgtc agttgtggccttgagactagttgccgaagaagcttcttgtattttgttccactaaccttc ctcttcttcttaagtttccctgaatcctccacacaggaatcaaccagctgtccattgcag attaattacattggaccttttccatcatggaaggggcagagatctgttcttagtgaaata gatacatattctggatgtggttttacctttgctcttcgcaattgttttggcagcagtacc atccttgaacttgcagaatgccttatccagaggaaaaaaattcaagtctgtcttacaaat ggctctaaacaatatgctggcatcagtaagccacgctcaatggagaacttgaaggtcagt ggaaaagttccatttgtggaatctctagtgctacatctcactgtttactttttccagagg aagagttggagagacgtaggtaaagatccacaccaagtcatgagcagacgcgaagggttt ggcatgctgggtaaaaaacaagatcggggctgtagctaccgacatcctgaatattgtacc ttactctgggggaccatcattgaggggacacagatatgtcctcattgtaatatatttaat ctggatttaggccatgctagtaccaccatccatgaacatgcagactgccttatacatagt catggtatcctacacattgccacagatcaagtaattcacttaacagtggtggagataggg aaatggaatgatgccatgggattcattgatcatatcacaccactatcatctagaggcgtc tggccctataaaactgtgaaatgggttattaagagtgcctggcatatatttaagacttgc cctgaattctttcttgtgcaagatccaagaaccctctcttggggtctggatcgggacccc tttcctgtaacaaagaagaaaaaaagaagaagaggaagaagaggaagtggaagtagaaga agaagaagaagaagaaagaagaagaagaagaagaagaggaggaggaagaggaggaagagg aggaagaagaagaagaggaagaagaagaagaagaagaaggaggaggaggaggaggaggag gaagaggaggaggaggacgaggggaaggggaaggagaaggaagttataagccaatag >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_6|197_aa MPDLPSFNVEEGIQRLREIVMVEWISHLKPTHLSWEGPEDIPLTRPSQNIFVRAAPASLK SPVIALLYMSDLMVRTAVTQLQNLNTMGIIGSQGGRDRVAPLNHQRQSGHGYHNGQQRES GNQNSLTGVELWHWLINHVVSRSEVDRKPTTFLLNLYKQKTSRSNGRKTSLNYKNRESRP LNQFPDLSQFTDPEPLE >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_6|594_bp atgcctgatctcccttcgtttaatgtagaggaagggatccaaaggcttagggagattgtg atggtggaatggattagtcacttgaaacctactcatctcagctgggagggtccagaagat atacccttgaccaggccctcgcaaaatatatttgtgagggcagcacctgcatctttgaag agccctgtaattgctcttctctatatgtcagatctaatggtgagaactgcagtcactcaa ctacaaaatttaaatacaatgggaataattggatcccaagggggcagggaccgagtggca ccactcaaccatcaaaggcaaagtgggcatggctaccataatggacagcagagggaaagt ggcaatcagaatagtctgactggtgtagagctctggcactggctaattaatcatgttgtt tctagaagtgaagttgataggaagcctactaccttcctacttaatttatacaagcagaaa acttctaggtcgaatggacgaaagactagtttgaattataaaaacagagaatcacggccc ctcaatcaattcccagacttgagccagtttacagacccagaaccccttgaatga >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_7|142_aa MAYLIKEIDVVAKGLPHCLQVVAAVAVLVSEAVKMIQGRDLTVWTSHDVNGILTAKGDLW LSDNCLLKYQALLLEGPVMWLHTCATLNPATFLPDNEKKIEHNCQQVIAQTYATRGDLLE VPLTDPDLNLYTDESSFVEKGI >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_7|429_bp atggcatacctgattaaggaaattgatgtagtggcaaagggtttgcctcactgtttacag gtagtggcggcagtagcagtcttagtatctgaagcagttaaaatgatacagggaagagat cttactgtgtggacatctcatgatgtcaacggcatactcactgctaaaggagacttgtgg ctgtcagacaactgtttacttaaatatcaggctctattacttgaagggccagtgatgtgg ctgcacacttgtgcaactcttaacccagccacatttcttccagacaatgaaaaaaagata gaacataactgtcaacaggtgattgctcaaacctatgccactcgaggggaccttctagag gttcccttgactgatcctgacctcaacttgtatactgatgaaagttcctttgttgaaaaa ggaatttga >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_8|291_aa MRPELPIINWVLSGPSSHKVGYAQQHSIIKWKWYIHDQFRAGPEGTTTPVIAQWAHEQSG HGGRHGGYAWTQQHGLPLTKADLAMATAECPICQQQKPTLSPRYGTIPQGDQPATWWQVD YIGPLASRKGQRYGFAYLARNVSAKTTIHGLMECLIHHHGIPHIIASDQGTHFVAKEVRQ WAHAHGIHWFHHVAYHPEAGGLTEWWNGLLKSQLQRQLGDNTLQDWGKVPQKAMYALIQC PIYGIVSPIARIHRSRNQGVKVEMAPFTITPSDPLAKCLLPVLVTLRSAGL >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_8|876_bp atgcgacctgaactgcctatcataaactgggtgctttctggcccatctagccataaagtg ggttatgcacagcagcattccatcatcaaatggaagtggtacatacatgatcagtttcga gcaggtcctgaaggcacaaccacccctgtcattgcccaatgggcccatgaacaaagtggc catggtggcaggcatggaggttatgcatggactcagcaacatggacttccactcaccaag gctgacctggctatggccactgctgagtgcccaatttgccagcagcagaaaccaacactg agccctcgatatggcaccattcctcagggtgatcagccagctacctggtggcaggttgat tatattggacctcttgcatcacggaaagggcagagatatgggtttgcctatcttgcacgc aatgtttctgccaagactaccatacatggactcatggaatgccttatccaccatcatggt attccacacatcattgcctctgaccaaggcactcactttgtggctaaagaagtgcggcag tgggctcacgctcatggaattcactggtttcaccatgttgcctatcatcctgaagcaggt ggattgacagaatggtggaatggccttttgaagtcacaattacaacgccaactaggtgac aatactttgcaggactggggcaaagttccccagaaggccatgtatgctctgattcagtgt ccaatatatggtattgtttctcccatagccaggattcacaggtccaggaatcaaggggtg aaagtggaaatggcaccattcaccatcacccctagcgatccactagcaaaatgtttgctt cctgttcttgtgacattacgttctgctggcctatag >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_9|128_aa MVSENKIKENSDQNAQGMLLLSCEVDPSLLNLHFTFPVAASDPGSSVPLSTTVDSLPQCT LTLHMGQAMDQYQSTAQGLGTPGLRDMAEEGIVILRGDSSICVIVPEDLPVGPDVEVEDS DIDDPDPV >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_9|387_bp atggtttcagagaataaaatcaaagaaaacagcgaccaaaatgctcagggcatgcttctc ttgtcctgtgaggtggacccttcacttctgaaccttcacttcacctttcctgtggctgcc tccgaccctggttcctctgtgcccctgtctacaactgtggactcgctcccacaatgtacc ctgactcttcacatggggcaggccatggaccagtaccagtccacggcccaggggttgggc acccctggactaagggacatggcagaagaaggcattgtcatcctaagaggtgatagctct atatgtgttattgtccctgaagaccttcctgtgggaccggatgtggaggtggaagacagt gatattgatgatcctgatcctgtgtag >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_10|173_aa MADNRWGQWNGELIEVRVVAVVFPEQVAAAGATTKANPKESGRLVESMLPLPSQVGSRYL LSIRRSGLPERVHEVFRVGKLRLGFTVWPPGSALWERHGSGSAAKCGSVQGLSTGREAAV AAGEYRNAGGIPEEGVVTIGDDSPIQVSVPEDFPVGQDVKVEDSDDDPDPVEA >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_10|522_bp atggcagataacaggtggggtcaatggaatggagaacttattgaagtcagagttgtagca gtggtgtttccagaacaagtggccgcggcgggtgcaaccacaaaggctaatccgaaggag tcggggaggctcgtggagtcgatgcttcctcttccaagtcaggtcggctcccgttacctt ctcagcattcgccgttccggtcttcctgagcgcgtgcatgaggtctttcgcgtggggaag ctccgcttgggttttactgtgtggccgccgggttcggctctttgggaaaggcacggctca ggttcagctgcgaagtgtgggagtgttcaggggttgtccactggcagggaagccgcggtg gcagccggcgagtaccggaacgcgggaggcattccagaagaaggcgttgttaccatagga gatgacagccccattcaggttagcgttcccgaagactttccagtgggacaagatgtgaag gttgaagacagtgatgatgatcctgaccctgtggaggcctag >gi568815587f:107828569_108062369|GENSCAN_predicted_peptide_11|270_aa MVLVQWLTAITSAFWETQNKGSLQFEDKWDFMRPIVLKLLRQESVTKQQWFDLFSDVHAV CLWDDKGPAKIHQALKEDILEFIKQAQARVLSHQDDTALLKAYIVEWRKFFTQCDILPKP FCQLEITLMGKQGSNKKSNVEDSIVRKLMLDTWNESIFSNIKNRLQDSAMKLVHAERLGE AFDSQLVIGVRESYVNLCSNPEDKLQIYRDNFEKAYLDSTERFYRTQAPSYLQQNGVQNY MKYADAKLKEEEKRALRYLETRRECNSVEA >gi568815587f:107828569_108062369|GENSCAN_predicted_CDS_11|810_bp atggtcctagttcagtggcttactgctataacctcagcattttgggagactcagaataaa ggttctcttcagtttgaagacaaatgggattttatgcgcccgattgttttgaagctttta cgccaggaatctgttacaaaacagcagtggtttgatctgttttcggatgtgcatgcagtc tgtctttgggatgataaaggcccagcaaaaattcatcaggctttaaaagaagatattctt gagtttattaagcaagcacaggcacgagtactgagccatcaagatgatacggctttgcta aaagcatatattgttgaatggcgaaagttctttacacaatgtgatattttaccaaaacct ttttgtcaactagagattactttaatgggtaaacagggcagcaataaaaaatcaaatgtg gaagacagtattgttcgaaagcttatgcttgatacatggaatgagtcaatcttttcaaac ataaaaaacagactccaagatagtgcaatgaagctggtacatgctgagagattgggagaa gcttttgattctcagctggttattggagtaagagaatcctatgttaacctttgttctaat cctgaggataaacttcaaatttatagggacaattttgagaaggcatacttggattcaaca gagagattttatagaacacaagcaccctcgtatttacaacaaaatggtgtacagaattat atgaaatatgcagatgctaaattaaaagaagaagaaaaacgagcactacgttatttagaa acaagacgagaatgtaactccgttgaagca