GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:40:49 Sequence gi568815581f:61356491_61583513 : 227023 bp : 46.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11671 11770 100 0 1 75 59 71 0.691 3.34 1.02 Intr + 11837 12004 168 0 0 103 103 278 0.989 30.72 1.03 Intr + 15350 15378 29 0 2 44 86 64 0.265 -0.37 1.04 Intr + 26640 26690 51 2 0 104 100 4 0.271 2.30 1.05 Intr + 32056 32218 163 0 1 113 116 -26 0.770 2.25 1.06 Intr + 35487 35631 145 1 1 110 55 136 0.823 11.84 1.07 Intr + 38104 38229 126 1 0 -8 60 149 0.643 2.29 1.08 Intr + 39198 39366 169 0 1 -3 80 75 0.662 -2.45 1.09 Intr + 39456 39584 129 2 0 51 81 91 0.757 5.59 1.10 Intr + 42035 42090 56 1 2 92 96 15 0.073 0.48 1.11 Intr + 42353 42536 184 2 1 37 91 53 0.082 0.29 1.12 Intr + 43519 44081 563 0 2 44 75 651 0.111 51.01 1.13 Intr + 45194 45461 268 2 1 107 69 509 0.956 48.43 1.14 Intr + 46571 46717 147 1 0 114 91 355 0.942 38.83 1.15 Intr + 47802 48007 206 2 2 55 100 241 0.507 20.00 1.16 Intr + 48116 48279 164 2 2 110 78 181 0.993 18.92 1.17 Intr + 48712 49346 635 2 2 117 110 707 0.999 67.65 1.18 Term + 51564 52016 453 2 0 130 55 514 0.990 47.46 1.19 PlyA + 52111 52116 6 1.05 2.09 PlyA - 52676 52671 6 1.05 2.08 Term - 56123 55870 254 0 2 43 49 192 0.265 6.60 2.07 Intr - 56474 56132 343 2 1 75 5 172 0.244 2.60 2.06 Intr - 56623 56512 112 0 1 22 105 65 0.230 1.98 2.05 Intr - 63578 63476 103 0 1 108 57 28 0.161 0.93 2.04 Intr - 75257 75129 129 0 0 59 100 29 0.107 1.87 2.03 Intr - 77425 77300 126 2 0 41 84 90 0.018 4.55 2.02 Intr - 90883 90806 78 2 0 122 68 25 0.112 3.42 2.01 Init - 93801 93789 13 0 1 45 115 -6 0.045 -1.72 2.00 Prom - 93888 93849 40 -4.86 3.00 Prom + 96946 96985 40 -6.66 3.01 Init + 98745 98825 81 2 0 75 97 66 0.661 7.17 3.02 Intr + 99998 100186 189 1 0 95 91 287 0.745 29.48 3.03 Intr + 100510 100698 189 0 0 28 69 104 0.828 2.28 3.04 Intr + 101047 101141 95 0 2 96 76 162 0.474 14.56 3.05 Intr + 109329 109448 120 0 0 80 115 199 0.966 21.41 3.06 Intr + 111020 111167 148 2 1 93 77 137 0.984 13.34 3.07 Intr + 122137 122289 153 0 0 73 87 200 0.996 18.67 3.08 Intr + 123391 123479 89 0 2 138 59 147 0.993 15.57 3.09 Intr + 123600 123829 230 0 2 108 75 138 0.999 12.01 3.10 Intr + 126410 126831 422 0 2 58 57 311 0.220 18.66 3.11 Intr + 127948 128083 136 0 1 64 105 47 0.436 4.14 3.12 Intr + 134187 134307 121 1 1 57 50 61 0.412 -1.25 3.13 Intr + 134572 134666 95 1 2 111 97 35 0.942 6.31 3.14 Term + 138236 138414 179 0 2 103 43 95 0.449 4.35 3.15 PlyA + 139332 139337 6 1.05 4.06 PlyA - 139883 139878 6 1.05 4.05 Term - 141033 140931 103 2 1 86 54 113 0.359 5.45 4.04 Intr - 147510 147327 184 1 1 64 -1 111 0.012 -1.35 4.03 Intr - 153278 153223 56 1 2 111 92 13 0.039 2.62 4.02 Intr - 193517 193390 128 2 2 67 53 43 0.011 -1.82 4.01 Intr - 225464 225392 73 1 1 55 86 78 0.211 3.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 133640 133850 211 1 1 54 80 122 0.865 6.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:61356491_61583513|GENSCAN_predicted_peptide_1|1251_aa MSYFSCATQSVGGVLPSYRKATMDPGYGEERVGGTFDRSVTLLEVCGSWPEGFGLRHMSS MEHTEEGLRERLADAMAESPSRDVVGSGTGSLRMLKFEKGGGGERGASLPQSQGPKRSVY FSIVVHIFSKKIPQCFSFQKGKKKEKKNNANSPASVSNPTVTKHAFGMEEELQREGSIET LSNSSGSTSGSIPRNFDGYRSPLPTNESQPLSLFPTGFPGPLGIHVTYRFSRHYLVNGQL YYGLPLNLMKEKYLHDLERGRDGEGGGGTAGARIVARRHRAAPGNPDPLPAKAFPNANPE PVNAPASPPLPHRRPSQTSITTISPPCTSNARRCTDVHLTSADQVSQAYSGHPRNRGPGP KTTFRNQPECSRPSPIGSRRRALPRDEKKLKKWLAGRREEAGISEEGDFRLRYRIGRKFV GSRVENPGSARAVPPCAQRKSRVPLAVRRTGSPEEQLLRPPPGSSVHRARRRPGRGSEPR APGPGPGPRAPGPDVPMREPALAASAMAYHPFHAPRPADFPMSAFLAAAQPSFFPALALP PGALAKPLPDPGLAGAAAAAAAAAAAAEAGLHVSALGPHPPAAHLRSLKSLEPEDEVEDD PKVTLEAKELWDQFHKLGTEMVITKSGRRMFPPFKVRVSGLDKKAKYILLMDIVAADDCR YKFHNSRWMVAGKADPEMPKRMYIHPDSPATGEQWMAKPVAFHKLKLTNNISDKHGFTIL NSMHKYQPRFHIVRANDILKLPYSTFRTYVFPETDFIAVTAYQNDKAGGYQVDAPRVFPL RPAPADLGVPPTTLEEPRPDLAPPPWSPQITQLKIDNNPFAKGFRDTGNGRREKRKQLTL PSLRLYEEHCKPERDGAESDASSCDPPPAREPPTSPGAAPSPLRLHRARAEEKSCAADSD PEPERLSEERAGAPLGRSPAPDSASPTRLTEPERARERRSPERGKEPAESGGDGPFGLRS LEKERAEARRKDEGRKEAAEGKEQGLAPLVVQTDSASPLGAGHLPGLAFSSHLHGQQFFG PLGAGQPLFLHPGQFTMGPGAFSAMGMGHLLASVAGGGNGGGGGPGTAAGLDAGGLGPAA SAASTAAPFPFHLSQHMLASQGIPMPTFGGLFPYPYTYMAAAAAAASALPATSAAAAAAA AAGSLSRSPFLGSARPRLRFSPYQIPVTIPPSTSLLTTGLASEGSKAAGGNSREPSPLPE LALRKVGAPSRGALSPSGSAKEAANELQSIQRLVSGLESQRALSPGRESPK >gi568815581f:61356491_61583513|GENSCAN_predicted_CDS_1|3756_bp atgagctatttctcctgcgctacccagtctgttggcggcgtgcttccatcctacaggaag gctacaatggaccctgggtatggagaggagcgggtgggaggtacctttgacaggagcgtg accctgctggaggtgtgcgggagctggcctgagggcttcgggctgcggcacatgtcctcc atggagcacacggaggagggcctccgggagcgacttgccgacgccatggccgagtcacct agccgggacgtcgtgggatccggaacaggatccctgcgcatgctgaagtttgagaaggga gggggcggggagaggggcgcctccttgccgcagtcccagggccccaagagatcagtgtac ttctcaatcgttgtccatatcttttccaaaaagattcctcaatgcttttcttttcaaaag ggaaaaaaaaaggaaaaaaaaaacaatgccaacagcccagcgtccgtgagcaacccaaca gtaacaaagcatgcgttcgggatggaggaagaacttcagcgagagggaagcatcgagact ctgagtaacagctcaggctccaccagcggcagcataccaagaaactttgatggctaccga tctccgctgcccaccaatgagagccagcccctcagcctcttcccgactggcttcccggga cctttaggaatccatgtgacctaccgattctctcggcactacctggtcaacgggcagctt tactacggacttcctctcaatttgatgaaggaaaaataccttcatgatctggaaagaggc cgggatggggagggcggaggcggcacagctggagcccggattgtggcacgccgtcaccgt gctgctccggggaatcccgacccgctccctgcgaaagcgtttccgaacgcgaacccagag cctgtgaacgcgccggcaagccccccactcccccaccgccgcccgtcgcagacatccata accacgatctcgcctccatgcacatccaacgcacgacggtgcacagacgtgcacctgact tctgcggaccaggtgtctcaagcgtacagcggccacccgcggaaccgcggcccggggcca aaaacgaccttcaggaatcagcctgagtgttcgcgcccgagcccgattggaagcaggaga cgtgcgcttccgagagatgagaagaaactaaaaaagtggttggcagggcgaagagaggaa gccggtatttcggaagaaggagactttcgtctccggtaccggatagggcgaaaattcgtt gggtcgcgggtggagaatccggggtccgcccgcgcggtgccgccctgtgctcagcgaaag agccgggtgccccttgcggtgcgccggacgggaagccccgaggagcagctgctgcgcccg ccacccgggtcgtccgtccaccgcgcgcgccgccgcccgggccgggggtccgagccgcgc gcccccggccccggccccggcccccgggcgcctgggccggatgtcccgatgagagagccg gcgctggcggccagcgccatggcttaccacccgttccacgcgccacggcccgccgacttc cccatgtccgcctttctggcggcggcgcagccctccttcttcccggcactcgcgctgccg cccggcgcgctggccaagccgctgcccgacccgggcctggcgggggcggcggccgcggcg gcggcggcggcagcagcggccgaggcggggctgcacgtctcggcactgggcccgcacccg cccgccgcgcatctgcgctccctcaagagcctggagcccgaggacgaggtggaggacgac cccaaggtgacgctggaggccaaggagctgtgggaccagttccacaagctaggcacggag atggtcatcaccaagtccgggaggcggatgttcccccccttcaaggtgcgagtcagcggc ctggacaagaaggccaagtatatcctgctgatggacattgtagccgctgacgattgccgc tataagttccacaactcgcgctggatggtggcgggcaaggccgaccctgagatgcccaaa cgcatgtacatccacccagacagcccagccacgggggagcagtggatggctaagcctgtg gccttccacaagctgaagctgaccaacaacatctctgacaagcacggcttcaccatccta aactccatgcacaagtaccagccgcgcttccacatagtgcgagccaacgacatcctgaag ctgccttacagcaccttccgcacctacgtgttcccggagaccgacttcatcgccgtcact gcctaccagaatgacaaggcgggtgggtaccaagtggacgctccccgggtcttccctctg cggccagcacccgctgacctcggggtgccaccgaccacgctggaagagccacggcctgac ttagcgccgcccccttggtccccgcagatcacacagctgaagatcgacaacaacccgttt gccaagggcttccgggacaccgggaacggccggcgggagaaaaggaagcagctgacgctg ccgtctctacgcttgtacgaggagcactgcaaacccgagcgcgatggcgcggagtcagac gcctcgtcgtgcgaccctccccccgcgcgggaaccacccacctccccgggcgcagcgccc agtccgctgcgcctgcaccgggcccgagctgaggagaagtcgtgcgccgcggacagcgac ccggagcctgagcggttgagcgaggagcgtgcgggggcgccgctaggccgcagcccggct ccagacagcgccagccccactcgcttgaccgaacccgagcgcgcccgggagcggcgtagt cccgagaggggcaaggagccggccgagagcggcggggacggcccgttcggcctgaggagc ctggagaaggagcgcgccgaagctcggaggaaggacgaggggcgcaaggaggcggccgag ggcaaggagcagggcctggcgccgctggtggtgcagacagacagtgcgtcccccctgggc gccggacacctgcccggcctggccttttccagccacttgcacgggcagcagttctttggg ccgctgggagccggccagccgctcttcctgcaccctggacagttcaccatgggccctggc gccttctccgccatgggcatgggtcacctactggcctcggtggcaggcggcggcaacggc ggaggtggcgggcctgggaccgccgcggggctggacgcaggcgggctgggtcccgcggcc agcgcagcaagcaccgccgcgcccttcccgttccacctctcccagcacatgctggcatct cagggaattccaatgcccactttcggaggcctcttcccctacccctacacctacatggca gcagcagccgcagccgcctcggctttgcccgccactagtgctgcagctgccgccgccgca gccgccggctccctctcccggagccccttcctgggcagtgcccggccccgactgcgtttc agcccctatcagatcccggtcaccatcccgcctagcactagcctcctcaccaccgggctg gcctctgagggctccaaggccgctggtggaaacagccgggagcctagccccctgcccgag ctggctctccgcaaagtaggggccccatcccgcggtgccctgtcgcccagtggctcggcc aaggaggcggccaatgaactgcagagcatccagagactggtgagtgggctggagagccag cgagccctctccccaggccgggagtcgcccaagtga >gi568815581f:61356491_61583513|GENSCAN_predicted_peptide_2|385_aa MEGTGLLLSGRTRNFQVEGTEDWAAFISELDIIENVRKAYRYSCGQHRREKEDTVMFIYS TENQAIGEAGQQLMMMNKVVTLENCVSSEKELVSSENKMIYGFLSYAKNIPSESASPVLA NKLPFFFSLSWISVRTTEGFLTNMSRGQLRPRPEEPDEGGEEGGSEPRGPSPLEERWIPA DPSPIPPGRAPRALRMPATAPFGGAQEAGGRRLTSRRENARAQPRGRAEEDKLGSAAAWK VAELALPGGSGSRGEGPTPSLKPSAQCVRSPGVPRSLGGLCRAPDPALPVSVAVPPCPAC RVYLPLSPEAGCEAAARALACSAISWTCEELPRPEAWAGSGFAPTRAPGGELRELPRGLA GKPGAAVSAGNRCFRSGFGVPGPIQ >gi568815581f:61356491_61583513|GENSCAN_predicted_CDS_2|1158_bp atggaagggacaggccttctcctgtcaggcagaaccagaaacttccaggtagaaggaact gaggactgggctgccttcatctctgagctggatattattgaaaatgtcagaaaggcctac agatactcctgtgggcaacacaggagagaaaaagaggacacagttatgtttatctatagc acagaaaatcaagctattggagaagctggacagcagttgatgatgatgaacaaggtggtg actttggagaattgtgtatcaagtgagaaagaactggtgtcaagtgagaacaaaatgata tatggctttctttcatatgcaaaaaatataccttcagagtctgccagtcctgtgcttgcc aataaactgccctttttcttcagcttgagttggatttctgtccgtacgactgaaggattc ctgactaatatgtccagaggccagctgagaccccgccctgaagagccggatgagggtggt gaggagggcgggtcggagccgagggggccctctccgctggaggagcgctggatccccgcc gatccttcccctatcccgcccggccgtgcgccccgcgcgctgcggatgccagccacagcc cccttcggaggggcccaggaggccggaggccggcggctcacttcccgcagagagaacgcg agggcccagccccgcgggcgcgcagaagaggacaagctggggtctgcagcagcctggaag gtcgccgagctggcgctgccgggcggttctggcagcaggggagaaggccccaccccgagc ctcaagccctccgctcagtgtgtccgcagccccggagtccctcggtccctcggcggcctg tgccgggctccggacccggctctgcctgtgtcggtggctgtgcctccgtgccccgcttgc cgggtctatctcccgctctcgcccgaggcgggctgtgaggccgcggcccgggccctggcc tgcagcgccatctcgtggacctgtgaggagttgccgaggcccgaggcctgggcgggctct gggttcgctcccactcgtgctcccggcggggagctgcgggagctgccaagaggactcgct gggaaaccaggcgcggccgtcagcgctggaaatcggtgcttccggagtgggtttggggtc cccggccccatccaatag >gi568815581f:61356491_61583513|GENSCAN_predicted_peptide_3|748_aa MRRPLNKMGERPEPGDAPGGPGESGKKEMLQDKGLSESEEAFRAPGPALGEASAANAPEP ALAAPGLSGAALGSPPGPGADVVAAAAAEQRAWDPTARGQEEATPPVRTGAAAGIRGLWD VAEGFTVIIGRSQRTGCRLGICVHYLQARAWPRTIENIKVGLHEKELWKKFHEAGTEMII TKAGRRMFPSYKVKVTGMNPKTKYILLIDIVPADDHRYKFCDNKWMVAGKAEPAMPGRLY VHPDSPATGAHWMRQLVSFQKLKLTNNHLDPFGHIILNSMHKYQPRLHIVKADENNAFGS KNTAFCTHVFPETSFISVTSYQNHKITQLKIENNPFAKGFRGSDDSDLRVARLQSKEYPV ISKSIMRQRLISPQLSATPDVGPLLGTHQALQHYQHENGAHSQLAEPQDLPLSTFPTQRD SSLFYHCLKRRDGTRHLDLPCKRSYLEAPSSVGEDHYFRSPPPYDQQMLSPSYCSEVTPR EACMYSGSGPEIAGVSGVDDLPPPPLSCNMWTSVSPYTSYSVQTMETVPYQPFPTHFTAT TMMPRLPTLSAQSSQPPGNAHFSVYNQLSQSQEATLELMSGISRACSALRKSTLLERERP TEIKIVEKNLFLPVNSAAGTCTGHQTAEEEPLVSLPEDKLLSKTSVSACANINLRHRVPG RTVPSRALWNWELIENGEANGQRVFGGLVVKPPLPDAKVQIYVRLGAQGQRQESDAGEFL PGPPSPLSAPHCQGFGAHPGHRTAFKDT >gi568815581f:61356491_61583513|GENSCAN_predicted_CDS_3|2247_bp atgaggaggcctctgaacaagatgggcgagaggcctgagcccggggatgctccgggaggg cccggggagagcgggaagaaggagatgctgcaggataagggcctgtccgagagcgaggag gccttccgggccccgggcccagcgctcggagaggccagcgcagccaacgcccccgagccc gcgctggcagcgccgggcctcagcggagccgcgctaggcagccccccgggacccggggcc gacgtcgtcgccgccgccgccgcggagcagagggcctgggacccaacggcgagggggcag gaggaggccaccccgccggtgcgcacgggagccgctgcggggatccgaggcctctgggac gtcgccgagggcttcacagtgataatcggacgatcacagcgcacgggatgccgcctgggg atctgtgtgcactatctgcaggcgcgcgcctggccgaggaccatcgagaacatcaaggtg gggctgcatgagaaggagctctggaagaagttccacgaggcgggcaccgagatgatcatc actaaggctggcaggaggatgttccccagctacaaggtaaaagtcacaggcatgaacccc aagaccaagtatatcctgctgattgacattgtccctgccgatgaccatcgctacaagttc tgtgacaacaaatggatggtggcagggaaggctgagccagccatgccaggaaggctgtat gtccacccggattctcctgccacaggagcccactggatgcggcagctggtctccttccag aagctgaagctgacaaacaaccacctggacccctttggccatatcatcctcaactctatg cacaagtaccagccgcggctccacatcgttaaggctgatgagaacaatgctttcggctcc aaaaacactgctttctgcacccacgtgttcccagagacctccttcatctctgtgacctcc taccagaatcacaagatcacccagctgaaaattgagaacaacccttttgccaagggattc cggggcagtgatgacagtgacctgcgtgtggcccgactgcagagcaaagaataccccgtg atttccaaaagcatcatgaggcagaggctcatctccccccagctctcagccacaccggac gtgggccccctgctcggcacccaccaggcactccagcactaccagcacgagaacggggca cactcacagctcgcggagccgcaggacctgcccctcagcacctttcccacccagagggac tcaagcctcttctatcactgcctgaaaagacgagacggtacccgccacctggacttacct tgcaagcgatcctatctggaagccccctcttcggtgggggaggatcactatttccgttcc ccccctccctacgaccagcaaatgctgagcccctcctactgcagtgaggtgacccccaga gaagcatgtatgtactcaggttcagggcccgagattgccggggtgtctggggtggacgac ctgcccccacctccgctgagctgtaacatgtggacttcagtgtcgccgtacaccagctat agcgtgcagacgatggagactgtgccgtaccagcccttccccacgcacttcaccgccacc accatgatgccgcggctgcccaccctctccgctcagagctcccagccaccaggaaatgcc cactttagtgtctacaatcagctctcccagtctcaggaggccacactggagctgatgtct ggcatctccagggcctgttctgctctcagaaaatctacgcttctggagcgagaaaggcca acagaaataaagatagttgaaaaaaatttgtttcttccagtaaattcagccgcaggtacc tgcacgggccatcagacagcagaggaggagccactcgtcagcctgcctgaagataagcta ttgagcaagacaagtgtctcagcatgtgctaatataaacctccggcatcgggtgccaggc aggacagtcccaagtagggctctttggaattgggagctgatcgaaaacggggaggccaat ggccagagggtctttggaggcttggttgttaagcctcctttaccagacgccaaagtccag atctacgtgcggttaggtgcacaaggccagcggcaggagagtgacgctggggaattcctg cctggtcctcccagccctctgagcgccccccactgccaggggtttggggctcatccagga cacaggacagcattcaaggacacttag >gi568815581f:61356491_61583513|GENSCAN_predicted_peptide_4|181_aa XALKIVDENEKHSVYRKKIMLLNGKQWGGGGATGIYWVEARDAAEHLLMARTTSHNKELS SQTWVVPRLHYPYWHVRYMWLTYVSLLYSYLLKKVNYRIASGRPFRVIPEGVVIIEDDST LCVIAPEDLPVGQDVKVEDSDIDDPDPVPGCSLSWELLPTLMETASRTGFIDAAFKAKAA I >gi568815581f:61356491_61583513|GENSCAN_predicted_CDS_4|546_bp nnggcattaaaaattgtggatgaaaatgaaaaacattctgtataccgaaagaaaataatg ctgctaaatggtaagcaatggggtggtggtggtgctactggcatctattgggtagaggcc agagatgctgctgaacatcttctcatggccaggacaacctcccacaacaaagaattatcc agtcaaacatgggtagtaccaaggctgcactatccttactggcatgttcgttacatgtgg ctcacctacgtaagtctattgtactcctacttattaaaaaaagttaattatagaatagcc tcaggcaggcccttcagagttattccagaaggtgttgttatcatagaagatgatagcacc ctgtgtgttattgcccctgaggaccttccagtgggacaagacgtgaaggtggaagacagt gatattgatgatcctgaccctgtgcctggctgctccctcagctgggagctgctgcccact cttatggaaactgcatccaggacaggctttattgatgctgccttcaaagccaaggcggcc atctga