GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:37:09 Sequence gi568815595f:186753059_186954701 : 201643 bp : 44.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 283 315 33 0 0 127 72 3 0.601 1.02 1.02 Intr + 993 1065 73 2 1 105 115 18 0.912 5.18 1.03 Intr + 8382 8555 174 1 0 40 54 149 0.455 6.61 1.04 Intr + 11136 11160 25 1 1 116 52 -7 0.407 -4.42 1.05 Intr + 16182 16245 64 0 1 83 96 29 0.558 1.92 1.06 Term + 19366 19791 426 0 0 93 44 253 0.946 16.70 1.07 PlyA + 25982 25987 6 1.05 2.00 Prom + 30486 30525 40 0.74 2.01 Init + 31396 31419 24 0 0 63 94 37 0.882 1.34 2.02 Intr + 31506 31638 133 2 1 74 93 125 0.998 11.82 2.03 Intr + 31904 32043 140 0 2 86 72 88 0.824 7.18 2.04 Intr + 33444 33587 144 2 0 62 78 103 0.965 7.18 2.05 Intr + 34069 34206 138 0 0 72 99 135 0.999 13.66 2.06 Intr + 34437 34526 90 2 0 48 111 59 0.952 4.39 2.07 Intr + 34745 34824 80 1 2 75 94 6 0.966 -1.75 2.08 Term + 36067 36211 145 1 1 89 31 122 0.975 3.98 2.09 PlyA + 36686 36691 6 1.05 3.10 PlyA - 36860 36855 6 1.05 3.09 Term - 37006 36911 96 1 0 75 39 74 0.959 -0.83 3.08 Intr - 37197 37084 114 0 0 55 97 96 0.955 7.84 3.07 Intr - 37348 37268 81 1 0 54 98 76 0.966 5.03 3.06 Intr - 38792 38667 126 2 0 99 78 62 0.990 7.18 3.05 Intr - 39552 39432 121 2 1 100 76 89 0.983 9.40 3.04 Intr - 39889 39746 144 0 0 53 88 85 0.800 4.40 3.03 Intr - 41723 41600 124 0 1 12 89 104 0.445 2.54 3.02 Intr - 53926 53690 237 2 0 41 115 139 0.887 9.49 3.01 Init - 57751 57682 70 1 1 81 105 61 0.912 8.31 3.00 Prom - 62456 62417 40 -5.06 4.00 Prom + 62768 62807 40 -4.16 4.01 Init + 64574 64632 59 1 2 51 77 69 0.394 2.88 4.02 Intr + 71923 72152 230 1 2 72 38 107 0.291 1.61 4.03 Intr + 73494 73564 71 1 2 71 105 120 0.151 10.90 4.04 Intr + 73989 74086 98 2 2 48 92 64 0.712 1.61 4.05 Term + 74159 74504 346 2 1 88 39 167 0.889 5.77 4.06 PlyA + 76207 76212 6 1.05 5.00 Prom + 97597 97636 40 -5.96 5.01 Init + 99980 100214 235 1 1 46 110 203 0.749 14.60 5.02 Term + 101126 101646 521 0 2 80 43 645 0.999 53.66 5.03 PlyA + 103798 103803 6 1.05 6.04 PlyA - 103824 103819 6 1.05 6.03 Term - 121609 121515 95 2 2 86 49 54 0.273 -0.71 6.02 Intr - 136969 136585 385 1 1 33 76 457 0.615 33.42 6.01 Init - 140921 140841 81 2 0 38 98 22 0.630 -0.83 6.00 Prom - 145977 145938 40 -3.96 7.00 Prom + 146214 146253 40 -9.46 7.01 Sngl + 147140 147499 360 1 0 110 41 300 0.989 23.57 7.02 PlyA + 147512 147517 6 1.05 8.04 PlyA - 147557 147552 6 1.05 8.03 Term - 153282 153014 269 1 2 44 37 183 0.017 4.46 8.02 Intr - 162818 162651 168 0 0 72 52 62 0.006 0.92 8.01 Init - 169569 169260 310 0 1 65 42 351 0.315 25.68 8.00 Prom - 174186 174147 40 -2.56 9.03 PlyA - 178805 178800 6 1.05 9.02 Term - 189553 189380 174 1 0 107 48 70 0.497 2.66 9.01 Init - 198313 198230 84 1 0 86 55 82 0.622 3.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_1|264_aa GTAASQRINAQIVFPLSEIHNHEPGCLLLVLTRRHASKNRNEIVVKLLEGGVDPHAEDHC EATAMHRAAAKGNLKMIPILLYYRTSTNIQDTEGKTQRPSRRKGVPIQTPREDSWISHKK DFRGLSKAEVFLCARRSHSCQLGWRRVPPALVGGGQSWPGGRVPWGVAQTVCPERPRRER RLRGEREEEAADKVMARRWRKSRRRKRKEMEERMSPEETEGTNFDFAGEAVGQAERKGPA FSAAEESFHEEEKRRRKEQSDLTF >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_1|795_bp ggaactgctgcctcccagaggataaatgcgcagatagtctttccattatccgaaattcac aaccatgaacctggctgtttgttgttggtgctgactcgcaggcatgcttccaaaaacagg aatgagattgttgtcaagttactagaaggcggggttgatccacatgctgaggaccattgt gaggctacagcaatgcaccgggcagcagccaagggtaacttgaagatgattcctatcctt ctatactacagaacatccacgaacatccaagacactgaggggaagactcaaagaccttct aggaggaagggggtcccgatccagaccccaagagaggattcttggatctcgcacaagaaa gatttcaggggtctttcaaaagccgaggttttcctgtgcgctcggaggagccatagctgc cagctgggctggcggcgggtgcccccagcccttgtgggcggcgggcagagctggcctggg ggccgggtcccgtggggggtcgcgcagacagtgtgtccggagcgcccccggcgggagcgc aggctgcggggcgagagggaagaggaggcggcggataaggtgatggcgagaagatggagg aagagcagacgaaggaagagaaaggagatggaagagagaatgtcaccagaggagaccgaa ggaacaaattttgactttgcaggagaagctgtgggccaggcagaaagaaaaggaccagct ttttctgcagctgaagaaagtttccatgaggaagaaaaacggaggcgaaaggagcagagc gacctgaccttctga >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_2|297_aa MDPDGVIESNWNEIVDNFDDMNLKESLLRGIYAYGFEKPSAIQQRAIIPCIKGYDVIAQA QSGTGKTATFAISILQQLEIEFKETQALVLAPTRELAQQVVLLSATMPTDVLEVTKKFMR DPIRILVKKEELTLEGIKQFYINVEREEWKLDTLCDLYETLTITQAVIFLNTRRKVDWLT EKMHARDFTVSALHGDMDQKERDVIMREFRSGSSRVLITTDLLARGIDVQQVSLVINYDL PTNRENYIHRIGRGGRFGRKGVAINFVTEEDKRILRDIETFYNTTVEEMPMNVADLI >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_2|894_bp atggaccccgatggtgtcatcgagagcaactggaatgagattgttgataactttgatgat atgaatttaaaggagtctctccttcgtggcatctatgcttacggttttgagaagccttcc gctattcagcagagagctattattccctgtattaaagggtatgatgtgattgctcaagct cagtcaggtactggcaagacagccacatttgctatttccatcctgcaacagttggagatt gagttcaaggagacccaagcactagtattggcccccaccagagaactggctcaacaggtt gtgttgctttctgccacaatgccaactgatgtgttggaagtgaccaaaaaattcatgaga gatccaattcgaattctggtgaaaaaggaagaattgacccttgaaggaatcaaacagttt tatattaatgttgagagagaggaatggaagttggatacactttgtgacttgtacgagaca ctgaccattacacaggctgttatttttctcaatacgaggcgcaaggtggactggctgact gagaagatgcatgccagagacttcacagtttctgctctgcatggtgacatggaccagaag gagagagatgttatcatgagggaattccggtcagggtcaagtcgtgttctgatcactact gacttgttggctcgcgggattgatgtgcaacaagtgtctttggttataaattatgatcta cctaccaatcgtgaaaactatattcacagaattggcagagggggtcgatttgggaggaaa ggtgtggctataaactttgttactgaagaagacaagaggattcttcgtgacattgagact ttctacaatactacagtggaggagatgcccatgaatgtggctgaccttatttaa >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_3|370_aa MAPGCSASGEGLMLPYDMEEKQKDTIRSKSHVSSRGQGKASSDVSALKAHAFCSATKQDK EDRDLHLRRPAPTKIAGSGPSVDRQGSRGPRAAAPPSLLSLPDRPELFRLRVLELNASDE RGIQVVREKVKNFAQLTVSGSRSDGKPCPPFKIVILDEADSMTSAAQAALRRTMEKESKT TRFCLICNYVSRIIEPLTSRCSKFRFKPLSDKIQQQRLLDIAKKENVKISDEGIAYLVKV SEGDLRKAITFLQSATRLTGGKEITEKVITDIAGVIPAEKIDGVFAACQSGSFDKLEAVV KDLIDEGHAATQLVNQLHDVVVENNLSDKQKSIITEKLAEVDKCLADGADEHLQLISLCA TVMQQLSQNC >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_3|1113_bp atggcgccaggctgttcagcttctggtgagggcctcatgctgccttatgacatggaggag aagcagaaagacactattagaagtaaaagccacgtgtcctcaagagggcaaggcaaagca tcttcagatgtctctgcccttaaagcacacgcgttctgctctgcgacgaagcaggacaag gaggacagggacctgcacctccggaggcccgcacctacgaagatagcgggctcgggacct tcggtggaccggcagggttccagaggcccgcgcgccgccgccccgccctcattgctgagc ctgccagataggcctgaacttttccgattaagagttcttgagttaaatgcatctgatgaa cgtggaatacaagtagttcgagagaaagtgaaaaattttgctcaattaactgtgtcagga agtcgctcagatgggaagccgtgtccgccttttaagattgtgattctggatgaagcagat tctatgacctcagctgctcaggcagctttaagacgtaccatggagaaggagtcgaaaacc acccgattctgtcttatctgtaactatgtcagtcgaataattgaacccctgacctctaga tgttcaaaattccgcttcaagcctctgtcagataaaattcaacagcagcgattactagac attgccaagaaggaaaatgtcaaaattagtgatgagggaatagcttatcttgttaaagtg tcagaaggagacttaagaaaagccattacatttcttcaaagcgctactcgattaacaggt ggaaaggagatcacagagaaagtgattacagacattgccggggtaataccagctgagaaa attgatggagtatttgctgcctgtcagagtggctcttttgacaaactagaagctgtggtc aaggatttaatagatgagggtcatgcagcaactcagctcgtcaatcaactccatgatgtg gttgtagaaaataacttatctgataaacagaagtctattatcacagaaaaacttgccgaa gttgacaaatgcctagcagatggtgctgatgaacatttgcaactcatcagcctttgtgca actgtgatgcagcagttatctcagaattgttaa >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_4|267_aa MVSDANSEGSKNSAGIYSGCPGIIRSTEAMDTQGMVKVVAPSVLPFLPHRAFSKIDGHSS VLTAPGIRGPQGWMTSLKRNLSGFPAAAPKSRLEDEGCSSREFSSMDTFEGYEGAEDMEK LDDSHETCSSKSAFLERRESTQGEKEERDEEATATGSTKKERAGASTPADSCGGDGWFPG EGAAPEDLHPGELPSLRSPRLPGSRAPAFPREAMWGPGPCPATQKAAGAAAAAERGRPGP PPGSPRGALRARLAPAARRRTSRDGDP >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_4|804_bp atggtgagtgatgcaaacagcgagggctctaagaattcagcagggatctacagtggatgc cccggcatcatccggagcacagaggccatggacactcaggggatggtcaaggtcgtagct ccctccgttcttccgttcctccctcacagagctttcagcaagatagatggtcattcttct gtcctcactgctcccggcataaggggaccccagggctggatgacttccctaaagaggaat ctctcagggtttcctgcagccgctcccaagtcccgtctggaggatgaaggctgctccagc agggagtttagctccatggacacctttgaaggctatgaaggtgcagaagacatggaaaag ttggatgacagtcacgaaacctgttcctccaaatccgctttcctggagaggagagaaagc acccagggagaaaaagaagaacgcgatgaggaagcaacggcgaccggcagcaccaagaag gagcgtgctggggcgtccacgccggctgactcctgcgggggcgacggctggtttccaggc gagggcgcggcgcccgaggacctccaccccggagagctgccctccctgcggtcgccccgg ctccccggctccagagcgcccgcattcccgagagaggcgatgtgggggcccgggccctgc ccagctacgcagaaagcagccggggccgcggcggcggcagaaaggggacgcccagggccg cctcccgggagcccgaggggtgccctgcgtgctcgtctagctcccgccgcccggcgaagg acctcgcgggacggggacccctag >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_5|251_aa MWIPGLRMLLLGAVLLLLALPGHDQETTTQGPGVLLPLPKGACTGWMAGIPGHPGHNGAP GRDGRDGTPGEKGEKGDPGLIGPKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRS AFSVGLETYVTIPNMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYFAYHITVYMKDVKV SLFKKDKAMLFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGEGERNGLYADNDNDST FTGFLLYHDTN >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_5|756_bp atgtggattccagggctcaggatgctgttgctgggagctgttctactgctattagctctg cccggtcatgaccaggaaaccacgactcaagggcccggagtcctgcttcccctgcccaag ggggcctgcacaggttggatggcgggcatcccagggcatccgggccataatggggcccca ggccgtgatggcagagatggcacccctggtgagaagggtgagaaaggagatccaggtctt attggtcctaagggagacatcggtgaaaccggagtacccggggctgaaggtccccgaggc tttccgggaatccaaggcaggaaaggagaacctggagaaggtgcctatgtataccgctca gcattcagtgtgggattggagacttacgttactatccccaacatgcccattcgctttacc aagatcttctacaatcagcaaaaccactatgatggctccactggtaaattccactgcaac attcctgggctgtactactttgcctaccacatcacagtctatatgaaggatgtgaaggtc agcctcttcaagaaggacaaggctatgctcttcacctatgatcagtaccaggaaaataat gtggaccaggcctccggctctgtgctcctgcatctggaggtgggcgaccaagtctggctc caggtgtatggggaaggagagcgtaatggactctatgctgataatgacaatgactccacc ttcacaggctttcttctctaccatgacaccaactga >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_6|186_aa MCQGHKTIVGRGLADKHVNKGLCIIDKPGRQSETVSQKKKKKEREKEKRKRKRRNKKKKE EKEKERKRKRRKKKKKEEKEKEKEKRKKKERERERKRKRKKRKKKKKRKKKKKKKKKKII PHIFLNLKPPRFVLCRVGSISLLVSDNGDVNEMIIDMATIRFLNLFPTFPPFPFHKTAIV IMARSQ >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_6|561_bp atgtgtcagggtcacaagacaatagtggggagagggttagcagacaaacacgtgaacaaa ggtctttgcatcatagacaagcctgggcgacagagcgagactgtgtctcaaaaaaaaaaa aagaaggagagggagaaggagaagaggaagaggaagaggaggaacaagaagaagaaggaa gagaaggagaaggagaggaagaggaagaggaggaagaagaagaagaaggaggagaaggaa aaggagaaggagaagaggaagaagaaggagagggagagggagaggaagaggaagaggaag aagaggaagaagaagaagaagaggaagaagaagaagaagaagaagaagaagaagattatt cctcacatcttcctcaacctcaagcctcccagatttgtcctatgcagagtgggcagcatc tccctcttggtgagtgacaatggagatgtcaatgagatgatcattgacatggcaaccatc cgatttctcaatcttttccccacctttcccccctttccattccacaaaaccgccattgtc atcatggcccgttctcaatga >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_7|119_aa MAFKDTGKAPVEPEVAIHRIRITLTSRSVKSLEKVGADLIRGAKAKNLKVKGGVRMPSKT LRITTRKTPCGEGSKTWDRFQMRIHKRLFDLHSPSEIVKQITSISTEPGVEVEVTIADA >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_7|360_bp atggcttttaaggataccggaaaagcacccgtggagccggaggtggcaattcaccgaatt cgaatcaccctaacaagccgcagcgtaaaatccttggaaaaggtgggtgctgacttgatc agaggcgcaaaagcaaagaatctcaaagtgaaaggaggagttcgaatgccttccaagact ttgagaatcactacaagaaaaactccttgtggtgaaggttctaagacgtgggatcgtttc cagatgagaattcacaagcgactctttgacttgcacagtccttctgagattgttaagcag attacttccatcagtactgagccaggagttgaggtggaagtcaccattgcagatgcttaa >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_8|248_aa MCFAKKHNKKSLKKMQANSAEAMSAHAEAIRALVKPKEAKPKIPKDVSHKLNRLAYIPHP KLGKRARARVAKGLRLCRPKAKAKDQTKVQAAAPASTPAQAPKDYKAEALEEETTCTRKL LDKDGKMTLDDIISLLPTFNTMNSKSKSNRKKVPSISAFEIQATIREYYKHHCANKLENL EEMDKFLDTYTLSRLNQEEVESLNRPITGSEIEAVINNLPTKKNPESDGFTAEFYQRYNE ELVPFLLK >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_8|747_bp atgtgctttgccaagaagcacaacaagaagagcctaaagaagatgcaggccaacagtgcc gaggccatgagtgcacatgccgaggctatcagggcccttgtgaagcccaaggaggctaag cccaagatcccaaaggatgtcagccacaagctcaatcgacttgcctacattccccacccc aagcttgggaagcgtgctcgtgcccgcgttgccaaggggctcaggctgtgccggccaaag gccaaggccaaggaccaaaccaaggtccaggctgcagctccagcttcaactccagctcag gctcccaaagattataaagctgaggctctagaagaagaaaccacttgcacaagaaaactt ctggataaagatggcaaaatgactctagatgacataatctctctcctcccaacatttaac acaatgaacagcaaaagtaaaagcaacagaaaaaaagtaccatcaatttccgcttttgaa atacaagctaccatcagagaatactataaacaccactgtgcaaataaactagaaaatcta gaagaaatggataaattcctggacacatacaccctttcaagactaaaccaggaagaagtc gagtccctgaatagaccaataacaggctctgaaattgaggcagtaattaataatctacca accaaaaaaaatccagaatcagacggattcacagccgaattctaccagaggtacaacgag gagctggtaccattccttttgaaatga >gi568815595f:186753059_186954701|GENSCAN_predicted_peptide_9|85_aa MAKNYGLAQWLMPVIPALWEADVGGLPEERKGSALNTVGALDSSTEDHREVVKGTAFLKL KLEARHCTKENSINQQTDADLQRLQ >gi568815595f:186753059_186954701|GENSCAN_predicted_CDS_9|258_bp atggccaagaattacggcctggcgcagtggctcatgcctgtaatcccagcactttgggag gccgatgtgggcggattgcctgaggaaagaaagggatcagccttaaacacggtgggtgcc ctcgattccagcacagaggaccatcgagaggttgtaaagggtacagcattcctaaaatta aagctggaagctcggcactgtactaaggaaaactccatcaaccaacagacggatgcagat ttacaaagactgcaatga