GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:43:57 Sequence gi568815591f:63793285_63994641 : 201357 bp : 40.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 11670 11709 40 -1.75 1.01 Init + 21939 22088 150 2 0 68 75 79 0.835 4.69 1.02 Term + 28002 28103 102 2 0 18 52 164 0.801 3.10 1.03 PlyA + 28204 28209 6 1.05 2.00 Prom + 33066 33105 40 -7.35 2.01 Init + 33136 33360 225 0 0 92 35 164 0.148 10.02 2.02 Term + 44205 44324 120 2 0 23 38 159 0.432 1.89 2.03 PlyA + 45372 45377 6 1.05 3.00 Prom + 45410 45449 40 -5.75 3.01 Sngl + 50003 50128 126 1 0 76 42 211 0.449 7.93 3.02 PlyA + 50358 50363 6 1.05 4.00 Prom + 55407 55446 40 -5.45 4.01 Init + 72682 72826 145 0 1 78 103 115 0.588 12.33 4.02 Term + 83049 83497 449 1 2 40 49 541 0.895 39.79 4.03 PlyA + 83555 83560 6 -8.91 5.00 Prom + 83594 83633 40 -14.53 5.01 Init + 83660 83900 241 1 1 9 103 349 0.940 26.58 5.02 Intr + 92233 92443 211 2 1 77 12 123 0.004 0.65 5.03 Intr + 96013 96069 57 1 0 100 94 42 0.017 3.08 5.04 Intr + 97777 97884 108 1 0 58 86 93 0.047 4.58 5.05 Intr + 98091 98250 160 0 1 61 110 27 0.519 1.27 5.06 Intr + 98717 98819 103 1 1 95 -27 76 0.190 -4.37 5.07 Intr + 99997 100883 887 2 2 6 21 888 0.803 64.01 5.08 Term + 100968 101360 393 2 0 -27 52 449 0.781 23.85 5.09 PlyA + 101687 101692 6 1.05 6.00 Prom + 102343 102382 40 -6.85 6.01 Init + 107649 107754 106 2 1 83 87 88 0.672 8.68 6.02 Intr + 108034 108264 231 2 0 98 66 64 0.293 2.02 6.03 Term + 113100 113221 122 1 2 107 34 73 0.128 1.46 6.04 PlyA + 113259 113264 6 1.05 7.03 PlyA - 113333 113328 6 1.05 7.02 Term - 114328 114222 107 2 2 30 42 73 0.408 -5.51 7.01 Init - 115894 115696 199 1 1 88 100 275 0.921 27.81 7.00 Prom - 116209 116170 40 -7.45 8.07 PlyA - 117467 117462 6 1.05 8.06 Term - 121121 120924 198 2 0 36 48 129 0.071 0.12 8.05 Intr - 132152 132062 91 1 1 87 57 64 0.200 2.28 8.04 Intr - 132493 132368 126 0 0 65 51 79 0.264 0.77 8.03 Intr - 133824 133633 192 2 0 83 36 188 0.439 10.79 8.02 Intr - 134434 134250 185 1 2 47 28 116 0.394 -0.84 8.01 Init - 134855 134691 165 2 0 57 37 64 0.445 -2.12 8.00 Prom - 135312 135273 40 -8.45 9.00 Prom + 135670 135709 40 -7.85 9.01 Init + 137775 138215 441 2 0 56 67 455 0.653 34.31 9.02 Term + 138247 138714 468 0 0 13 55 404 0.322 23.59 9.03 PlyA + 139322 139327 6 1.05 10.00 Prom + 141221 141260 40 -6.15 10.01 Init + 141488 141558 71 1 2 72 55 80 0.123 3.67 10.02 Intr + 148330 148446 117 1 0 62 64 95 0.284 3.16 10.03 Term + 148824 148953 130 0 1 72 43 163 0.239 6.87 10.04 PlyA + 150220 150225 6 1.05 11.04 PlyA - 154019 154014 6 1.05 11.03 Term - 158317 158252 66 1 0 98 44 72 0.270 0.76 11.02 Intr - 158766 158606 161 1 2 9 89 147 0.155 5.69 11.01 Init - 160190 160154 37 2 1 65 80 24 0.064 -0.37 11.00 Prom - 164160 164121 40 -6.65 12.07 PlyA - 164813 164808 6 1.05 12.06 Term - 169318 169190 129 1 0 100 38 167 0.351 10.10 12.05 Intr - 170451 170315 137 1 2 95 75 43 0.294 3.07 12.04 Intr - 172616 172537 80 1 2 78 78 69 0.163 3.18 12.03 Intr - 187175 187085 91 0 1 32 96 79 0.468 1.23 12.02 Intr - 188192 188066 127 2 1 108 68 144 0.996 13.73 12.01 Intr - 188878 188771 108 1 0 32 55 152 0.928 5.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 33136 33420 285 0 0 92 42 170 0.815 8.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_1|83_aa MNEFVGLVEIIQGEILHRWRRKCGKNKILETCLNLPRAFRIQESEIIIKEVNGSKRRTVA APEETLGFLIPVQSSYTDAHRGK >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_1|252_bp atgaatgaatttgtgggacttgttgagattatacagggtgagatcctacacagatggaga agaaaatgtggcaagaataagattttggaaacttgcctaaacttgcctagagctttcagg attcaagaatcagaaatcataatcaaagaggtgaatggttctaaaaggcgaacagtggca gcaccggaggagacactgggcttcctgatcccagtgcagagctcttatactgacgcacac agaggaaagtga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_2|114_aa MEEGERHILHGSRQRRNERAKQKGKSLIKSSALVRLVHYHENSVRETTPTIQLSPTGSLP QNVGNMGAAIEDEIWSSCRKSISTLYPLNSNGDIEPLKIQASVSVMECISCRQE >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_2|345_bp atggaggaaggtgaaaggcacatcttacatggcagcaggcaaagaaggaatgaaagagcc aagcaaaaggggaaatcccttataaaatcatcagctcttgtgagacttgttcactaccat gagaacagtgtaagagaaaccacccccacaattcaattatctcccactgggtccctccca caaaacgtgggaaatatgggagctgcaattgaagatgaaatttggtccagctgccgaaag tctatctcaacactctaccctctaaacagcaacggggatattgagcctctgaagatccaa gcttcagtctccgtaatggaatgcatcagctgtagacaggaatga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_3|41_aa MGGEWMAGTMSGSKGGKKKALKQPKKQAKEKDEEDKAFKQK >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_3|126_bp atgggtggggaatggatggcaggcaccatgtctggcagcaaaggtggcaagaagaaggcc ctgaaacagcccaagaagcaggccaaggagaaggacgaggaagataaggctttcaagcag aaataa >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_4|197_aa MQPYMGICESEGANTSSNCYKLVSAGKDLLLWGPEADGTPFGLAVEMGEHKTFPKVILRR YESYGIQNLNLRKDWECVGDCKGQKKSYNGLNQCLSTTLSKIFQCDECGNAFNQCSILTQ HKRIHTREKPYKCEECGKAFNWFSNLIQNKRICTVGKPYKCEEFGKAFNQCSHLIGHKRI HTGEKPYKCEECGKAFN >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_4|594_bp atgcagccttatatgggtatctgtgaatctgaaggagcaaacacttcttctaattgttat aaactggtgtcagcaggtaaagatcttctcttgtggggtcccgaggctgatgggacccct tttgggcttgcagtggagatgggtgagcataaaacattcccaaaagtgatactgagaaga tatgaaagctatggcattcaaaatttaaacttaagaaaagactgggaatgtgtaggtgac tgtaagggacagaaaaaaagttataatggacttaaccaatgtttatcaactacccttagc aaaatctttcaatgtgatgaatgtggcaacgcttttaaccagtgctcaatccttactcaa cataagagaattcacacaagagagaaaccatacaaatgtgaggaatgtggcaaagccttt aactggttctcaaaccttattcaaaataagagaatttgtactgtagggaaaccctacaaa tgtgaagaatttggcaaagcctttaaccagtgctcacaccttattggacataagagaatt catactggagagaaaccttacaaatgtgaagaatgtggcaaagccttcaactag >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_5|719_aa MHGSILTKHKRIHTGEKCYKCKEYDKTFNQSSHLIGHKRIHTGEQPYKCGKCGKAFNWFS NLTKHKRIQTGKKLKKCEECGLNPSRDLEHSQALIKTCRMLPTALKETSRGPKPNSLKPH INSITRPTHGRHTGVEHLCFVPRKDTLQPPLESLPVCTKDGDSRGISQARFLTDLVKLDT AMKDCVNGSEPGAGKSGTRILRLGRRVGYKQINRPPRNLSMIPVPYPHPPIGMGLLLCVS NPPNKSRMQRMEPGSALPRLQPVLLLSPIPVGQPHSQGQEDQAMKNLRSTECLATTKREA EELIEIEIDGTEKAECTEESIVEQTYAPAECVSQAIDINEPIGNLKKLLEPRLQCSLDAH EICLQDIHLDPERSLFDQGVKTDGTVQLSVQVISYQGIEPKLNILEIVKPADTVEVVIDP DAHHAESEAHLVEEAQVITLDGTKHITTISDETSEQVTRWAAALEGYRKEQERLGIPYDP IQWSTDQVLHWVVWVMKEFSMTDIDLTTLNISGRELCSLNQEDFFQRVPQGEILWSHLEL LRKYVLASQEQQMNEIVTIDQPVQIIPASVQSATPTTIKVINSSVKAAKFLLELLTDKDA RDCISWVGDKGEFKLNQPELVAQKWGQRKNKPTMNYEKLSRALRYYYDGDMICKVQGKRF VYKFVCDLKTLTGYSAAELNRLVTECEQKKLAKMQLHGIAQPVTAVALATASLQTEKDN >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_5|2160_bp atgcatggctcaatccttactaaacataagagaattcatactggagagaaatgctacaaa tgtaaagaatatgacaaaacctttaatcagagctcacaccttattggacacaagagaatt catactggagagcaaccctacaaatgtggaaaatgtggcaaagcctttaactggttctca aaccttactaaacataagagaattcaaactggaaagaaactcaagaaatgtgaagaatgt ggcttaaacccaagccgggaccttgaacattcccaggcactgataaaaacgtgtaggatg ttgcccacagcattgaaagaaactagccgtggccctaagccaaattccttaaagcctcat ataaactccataaccagacccactcatggcagacatactggggtagaacatctctgtttt gtccctcgcaaggatacgctgcagccccctctggagagtcttcctgtctgcaccaaggat ggggacagcagaggaatctcacaggccaggttcctcactgacctggtgaagctggacact gccatgaaggactgtgtgaatgggagtgagcctggggcaggcaagtcgggtaccaggatc ctgaggcttgggagaagagttggatataaacagataaacaggccccctaggaatttgtcc atgattcctgtcccctacccacatcctccaattgggatgggtctcctcctctgtgtatca aaccctcccaataaatccaggatgcagaggatggagccagggagtgctctacccaggcta caacctgtcctgctgctgagccccatcccagtaggccagccacactctcaaggccaagaa gaccaggccatgaagaatctcaggtccactgagtgcctggccacgactaaaagagaagca gaggagctgatagaaattgagattgatggaacagagaaagcagagtgcacagaagaaagc attgtagaacaaacctacgcgccagctgaatgtgtaagccaggccatagacatcaatgaa ccaataggcaatttaaagaaactgctagaaccaagactacagtgttctttggatgctcat gaaatttgtctgcaagatatccacctggatccagaacgaagtttatttgaccaaggagta aaaacagatggaactgtacagcttagtgtacaggtaatttcttatcaaggaattgaacca aagttaaacatccttgaaattgttaaacctgcggacactgttgaggttgttattgatcca gatgcccaccatgctgaatcagaagcacatcttgttgaagaagctcaagtgataactctt gatggcacaaaacacatcacaaccatttcagatgaaacttcagaacaagtgacaagatgg gctgctgcactggaaggctataggaaagaacaagaacgccttgggataccctatgatccc atacagtggtccacagaccaagtcctgcattgggtggtttgggtaatgaaggaattcagc atgaccgatatagacctcaccacactcaacatttcggggagagaattatgtagtctcaac caagaagatttttttcagcgggttcctcagggagaaattctctggagtcatctggaactt ctccgaaaatatgtattggcaagtcaagaacaacagatgaatgaaatagttacaattgat caacctgtgcaaattattccagcatcagtgcaatctgctacacctactaccattaaagtt ataaatagtagtgtgaaggcagccaaatttttgctagaacttcttactgataaggacgct cgagactgtatttcttgggttggtgataaaggtgaatttaagctaaatcagcctgaactg gttgcacaaaaatggggacagcgtaaaaataagcctacgatgaactatgagaaactcagt cgtgcattaagatattattatgatggggacatgatttgtaaagttcaaggcaagagattt gtgtacaagtttgtctgtgacttgaagactcttactggatacagtgcagcggagttgaac cgtttggtcacagaatgtgaacagaagaaacttgcaaagatgcagctccatggaattgcc cagccagtcacagcagtagctctggctactgcttctctgcaaacggaaaaggataattga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_6|152_aa MRGGGQPRRVGHGNVDLAASTTMLATDRVHFLAKETRAQFRRCGEKSAAWPASAQVQSFA RGDMGCGFVKMSPSPATFSRMWILRGQGGPLLAYPGDAKSRGENFSSQACVSDASSVWKH LVHVVGLASYVFGARGSLFLALPPYREGKALN >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_6|459_bp atgcgaggaggcgggcagcccaggagggttggacacggcaatgtggacctcgccgcttcc acaacgatgctggctactgatcgcgtgcacttccttgccaaagaaaccagagcgcagttc cgccgctgtggggagaaatccgccgcctggcccgccagtgcacaagttcagagctttgca cggggtgacatgggctgtggcttcgtgaaaatgtcaccctcaccagcgactttttcgcgg atgtggatattgagggggcagggagggccattattggcttacccaggagatgctaaaagc agaggagaaaatttcagttcccaggcgtgtgtctctgatgcttctagtgtttggaaacat ctggttcatgtggttggtctggcttcctatgtgtttggtgccagagggagtctgtttctt gctcttccaccatacagagaaggcaaagctctgaactga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_7|101_aa MDTDDERQYQDFLEDLEEDEAIRKNVNIYRESTIPVESDDDEGAPGISLAEMPEDFHISH DATGEEEKNMNIKFYEAPNIFSKKNLIKIALGHIMIVKIQR >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_7|306_bp atggatacagatgatgaaaggcaataccaagattttcttgaagatcttgaagaagatgag gcaattagaaaaaatgtcaacatttacagagagtcaaccatccctgtggaaagtgatgat gatgaaggagcacctggaattagtctggctgagatgcctgaagactttcatatttcccac gatgccactggtgaagaagaaaagaatatgaacatcaagttttatgaagctccaaatatc ttcagcaagaagaacctaataaagattgcccttggacacattatgattgtcaaaatacaa agataa >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_8|318_aa MSRLKPGSRENQSLRPREDSGKGHRRTQPRARTQFSSVWELAAWSGWPCTAEETSPVGVS SVSEQVGHTKPSRTPGTQFHGVQPCKGQMSWVEKLTKSSFLPAKSGRKAPGLEQCSWSGG RVLPKAGELALWLEGVEGSSTMDSVWDLRPVPQQLRRAGGSLGWRRRDAFEGNRAEQREM DPTLYCSSPGYMRSAAPWSGDGSFLVSVLNRCLLPHVKVPSLWLVPRPRGPGVDLPAVIN AGAGPERLGPSQHCCKAYAKEKGRSCLWVPQAHTFLKSFSSQLLQTKDDISGKMRTCFQT STPVAMRSYGPKAMLTST >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_8|957_bp atgtcaagacttaagccagggagtagagagaatcagtctctcagacccagagaggattca gggaaagggcacaggagaacccagcccagagccagaacacagttcagctcagtgtgggaa ctcgcagcatggagtggatggccctgcacggctgaagagaccagtcctgtaggggtgtcc tcagtatcagaacaagtgggccacacaaagccctcgaggactccaggaacccagttccat ggggtgcagccctgcaagggccagatgagctgggtagaaaagctgacgaagtcttccttc cttccagccaaatctggaagaaaggcccctggcctggagcagtgttcctggtcaggcggc agagttctcccaaaagcgggtgaactggcgctctggctggagggtgtagagggcagcagc acaatggacagcgtctgggacctcaggccagttccacagcagcttaggagagcaggcggc tccctgggctggaggaggcgcgacgcttttgagggaaaccgagctgagcagcgggagatg gaccccaccctctattgttcctcccctggctacatgcgcagtgctgccccttggtccggc gacggaagcttcctcgtgagtgtcctaaaccgttgtttgctgcctcatgtgaaggtgccc agtctctggctggttcccaggccccggggtcctggtgtggacctgcctgccgtaattaac gcaggtgcaggacctgagcgccttggtccctcccaacactgttgtaaggcctatgccaag gagaaaggaaggtcatgcctttgggtgccccaggcacacacctttctgaaatctttctcc agccagctgctgcagacaaaagatgacatttctgggaagatgaggacttgtttccagacc agcaccccagtggccatgaggtcttatggcccaaaggctatgcttacctccacctga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_9|302_aa MLGPQTMLGARLPPAVARPRPRRPSGGSGEGDPGRRPGPRHRNRRPLPTERGRTHLQPDE RSHPRDTAHGFREYTVAATASRASTAASAPCSAALSPRRLWVLNVPVPQQPDEGCPGTAL QQRRLLCGWGARVAGAVVVIFPMDAIKTSSAEESPTGVREIVREQGDPAGPHGPRAEAGL EPGHPLLLLTPLHSWYRGDNPSKPMNPLVAGAFGAIAGAASVLGNAPLHGIETRMRGLKS TNAEHTGLRLQILRKEGLKAFYNGIVPHLGRVCLDVTTVFILCHEVGKLLNSVEDGVSPE GP >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_9|909_bp atgctgggacctcagaccatgctgggcgcccgcctcccgcccgccgtggcccgaccgcgt ccaagaaggcccagcggaggctccggggaaggcgaccctggcaggcggccaggcccgcga catcgaaatcggcgtccgttgcccacggagcgcgggaggacgcacctgcagcccgacgag cgctcgcacccgcgggacacggcgcatggattccgcgagtacactgtcgcggccacggca tcccgggccagcaccgcggcctcggctccctgctctgccgctctatccccaaggcggctt tgggttctgaatgttcctgttcctcagcagccagacgagggatgcccagggacggcctta caacaacgcaggctgctgtgcggctggggcgcccgcgtggccggggccgtggtggtcata tttcccatggacgccatcaagacgtccagtgcagaggagtctcccaccggggttagggag attgtgcgggaacaaggggacccagcggggcctcacggcccccgcgctgaagcagggctg gaaccaggccatccgcttcttctcctgacccccctgcatagctggtaccgaggggacaac cccagcaagcccatgaacccgctggtcgctggggccttcggagccattgcaggcgcagcc agtgtcttgggaaacgctccactgcacgggatcgagacccggatgcggggcctgaaaagc acaaatgcagaacacacgggactgcggctgcaaatcctgagaaaggaagggctcaaggcc ttctacaacggcattgtcccccacctgggccgggtctgcctggatgtgaccacagtgttt atcctctgccatgaggtaggaaagctgctcaacagtgtggaagacggagtaagcccggaa gggccttga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_10|105_aa MIATKKTSVSSGSMKNRLLMLSLIFKEARKFGLGGTQQSQQSGCGQTAILDSSSLGRASL KERASGGRGGCGHNLSGLKRFCLLALKRAADPDKEDFPSTVLELS >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_10|318_bp atgattgctacaaagaagacatctgtaagtagtggcagcatgaaaaacaggctcctgatg ttgtcccttatttttaaagaggccaggaagtttggactgggtggaactcaacagagccag caaagcggctgtggccagactgccattctagattcctcttcactgggcagagcatctctg aaagaaagagcatctgggggaaggggtggctgtgggcacaaccttagtggacttaaacgt ttctgcctgctggctctgaagagagcagcagatcccgacaaggaggattttcccagcaca gtgcttgagctcagctaa >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_11|87_aa MKVASRRHAELSGTSELLACLILGHGMQAIQDIEALHIVLNEGCLVKVVQVKVLNKNAIQ TACILQMHNQWDTKENEQGTHRGQEQD >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_11|264_bp atgaaggtggcatccagaagacatgctgagctgtcaggaacctctgagcttctggcctgt ctcatcctgggccatggaatgcaggccatacaggatattgaagcccttcacattgtgtta aacgaaggttgcctggtgaaggttgttcaggtgaaggtgctaaataaaaatgctatacaa actgcatgcattttgcaaatgcacaatcagtgggataccaaggagaatgaacaaggaaca catcgtggtcaggagcaagactga >gi568815591f:63793285_63994641|GENSCAN_predicted_peptide_12|223_aa VRVAHIPIAAAEIAAIPDRGGHLSVGKRKLTFWFFYGVLPFRDVAIEFSPEEWECLDSAQ QHLHRDAMLENYGNLVSLGLVISKPDLITLLEQRALECEETEDSSQIPRMGRGGTVNTTS LFSDFHAITISVTQVGPMLLSSAYMVVLLLSHQKKSWYLHSTCLSPRPSPEQSATQTILL LIRQGVRCAPDPTRLPLPLPIVLKAAIVPAGLSAWVRPNRAEH >gi568815591f:63793285_63994641|GENSCAN_predicted_CDS_12|672_bp gtgagggtggcccatattcctattgctgcagcagaaattgctgccatccctgacagagga gggcacctgagtgttggaaagcggaaactcacattttggtttttctatggagtgttgcca ttcagggatgtggccatagaattctccccagaagagtgggagtgcctggactctgctcag cagcatttgcacagggatgcgatgttagagaactatggaaacctggtgtctctgggtctt gttatttccaagccagacttgattacccttctggagcaaagagccctggaatgtgaagag acagaagacagtagtcaaatacccagaatgggcagaggtggaaccgttaacaccacttcc ctctttagtgacttccatgccatcaccatcagtgtgactcaagtaggaccgatgctgctt tcaagtgcatacatggtggttcttttgctcagccatcagaagaaatcctggtaccttcac agcacctgcctctccccaagaccttccccagagcagagtgccactcagaccatcctgctg ctaatccggcagggtgtccgctgtgctccggatccaacgagactcccattgccactcccg atcgtgctgaaggctgccattgttcctgcagggctaagtgcctgggttcgtcctaatcga gctgaacactag