GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:42:21 Sequence gi568815596f:190260103_190471399 : 211297 bp : 39.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1400 1423 24 1 0 112 111 -1 0.529 1.90 1.02 Intr + 2345 2384 40 1 1 59 111 54 0.634 1.68 1.03 Term + 8933 9207 275 0 2 0 54 211 0.521 3.55 1.04 PlyA + 9744 9749 6 1.05 2.14 PlyA - 11038 11033 6 1.05 2.13 Term - 15954 15745 210 0 0 76 42 106 0.314 1.11 2.12 Intr - 19930 19700 231 1 0 74 93 202 0.646 16.35 2.11 Intr - 21396 21252 145 2 1 99 92 29 0.929 3.76 2.10 Intr - 22043 21796 248 2 2 -21 -5 256 0.462 0.73 2.09 Intr - 27536 27484 53 0 2 115 98 41 0.892 5.51 2.08 Intr - 30383 30303 81 0 0 98 76 63 0.895 4.79 2.07 Intr - 34528 34444 85 1 1 27 88 79 0.871 0.27 2.06 Intr - 36851 36711 141 2 0 59 76 86 0.290 4.13 2.05 Intr - 46307 46171 137 0 2 47 87 76 0.023 2.77 2.04 Intr - 59890 59727 164 0 2 71 52 90 0.000 2.40 2.03 Intr - 78534 78417 118 1 1 77 72 43 0.041 0.20 2.02 Intr - 79519 79398 122 0 2 73 31 109 0.224 2.92 2.01 Init - 84086 83890 197 2 2 96 52 257 0.492 19.35 2.00 Prom - 86318 86279 40 -6.75 3.00 Prom + 94125 94164 40 -3.45 3.01 Init + 94423 94457 35 0 2 83 94 52 0.671 4.69 3.02 Intr + 96283 96359 77 1 2 47 94 79 0.521 2.64 3.03 Intr + 99975 100204 230 1 2 0 116 306 0.365 21.27 3.04 Intr + 106523 106793 271 1 1 106 97 155 0.981 14.39 3.05 Intr + 109001 109175 175 0 1 92 98 147 0.999 14.18 3.06 Term + 110742 111300 559 0 1 117 44 360 0.997 27.33 3.07 PlyA + 111459 111464 6 1.05 4.00 Prom + 116729 116768 40 -3.35 4.01 Init + 140886 140973 88 2 1 89 101 65 0.922 8.76 4.02 Intr + 147415 147648 234 2 0 59 85 72 0.548 0.74 4.03 Intr + 148026 148155 130 1 1 74 19 138 0.906 4.33 4.04 Intr + 148250 148401 152 2 2 60 109 138 0.905 11.99 4.05 Term + 170515 170609 95 2 2 85 49 71 0.371 -0.09 4.06 PlyA + 171439 171444 6 -0.45 5.00 Prom + 171785 171824 40 -8.05 5.01 Init + 175928 177459 1532 1 2 73 110 1215 0.334 114.04 5.02 Intr + 202474 202640 167 1 2 7 66 122 0.005 0.58 5.03 Term + 206653 206915 263 2 2 123 38 139 0.862 7.10 5.04 PlyA + 208054 208059 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:190260103_190471399|GENSCAN_predicted_peptide_1|112_aa RHIAIIHLLELDLDPAISITQQNRNWTALLTPYTKINLRWIKDLNVRPKTIKTLEENLGN TIQDIGIGKDFMTKTAKAMPTKAKIDKWDLIKLKSFCTAKETVINVNRQPME >gi568815596f:190260103_190471399|GENSCAN_predicted_CDS_1|339_bp agacacatagctattatccatttgttagaactagacctagatcctgccatcagcatcaca caacaaaacagaaactggaccgctctccttacaccttatacaaaaattaacttaagatgg attaaagacttaaacgtaagacctaaaaccataaaaaccctagaagaaaatctaggcaat accattcaggacataggcattggcaaagacttcatgactaaaacagcaaaagcaatgcca acaaaagccaaaattgacaaatgggatctaatcaaactaaagagcttctgcacagcaaaa gaaactgtcatcaacgtgaacaggcaacctatggaatga >gi568815596f:190260103_190471399|GENSCAN_predicted_peptide_2|643_aa MPGMPGRARPGAASLPGQGPRRPARRGEEPSRRHRLTYPRRGDGARRSPARAPDPRRRGS GAGTLWPWVIPLVQCVHTVDAPLSVSHFVVGLVVRPTVKVLQYLCPNSVPQKVEFFINYS SWGLMPVGFDQWVTPSVDWRMEKEKSSLSAFSLARSPIPSSPSTISSRRRSCPALPRRVA ETVALASWRTCHLCPAAQDGAKTFFFVESTAKDLFLHGGYLENDPGKEWSGPWVHGIPIH STPSSKRMSKHTDAAEEVLLEKKGCTGVITLNRPKFLNALTLNMIRQIYPQLKKWEQDPE TFLIIIKGAGGKAFCAGGDIRVISEAEKAKQKIAPVFFREEYMLNNAVGSCQKPYVALIH GITMGGTLERIWRSRIEKNLYCCERSIKGDSGEGSEEDKAGENVELLRDYLSGHGQNANR NMDSNGHFDEDPDENEEQDIGVKSILVIKPRALAQMVLGDGLGPPPISLPPGAAWASAPR IPAQCSSVVPAMAQVPQKEYTCRQFCYKSTANKGDRVIYNLTRSPYCCVRFPLAGTGPHI LYLSQLDSNLELSKRGKGRGEQRKEEVTCGMLRKDHTGKAMFHLLLQFFKEMLQDLDLTY LKFPLKALFLSAADLDAMVLALIGWKVCSALIFQSELCKLNQL >gi568815596f:190260103_190471399|GENSCAN_predicted_CDS_2|1932_bp atgccgggcatgcctggccgggcccggccgggggcggcctcgctgccggggcagggacct cgcaggccggcgcgcagaggcgaggaaccctcccggcgccatcgactcacctacccgcgg agaggcgacggcgcgcgccgctcaccagcccgggccccagatccaaggaggaggggcagc ggcgcggggaccctgtggccatgggtcatccctcttgttcagtgtgtccacactgtagat gctcccctgtctgttagtcactttgtagttggcttggttgtcagacctactgtcaaggta ttgcagtacttatgtccaaactctgtgcctcagaaggtggaatttttcataaactattcc agctggggtctcatgccagttggttttgaccaatgggtaacaccatcagtagattggagg atggaaaaggaaaaaagctcgctctccgccttctccctggcccggtccccgatcccttcc tctccctcaactatttcctccaggcgtcgttcttgccccgcccttccccgtcgcgtagcg gaaacggtcgctctggcttcctggaggacttgtcatctgtgcccggcggcgcaggacggc gcaaaaaccttcttcttcgtggagtctacagcaaaagacctttttcttcatggtggctac cttgaaaatgacccaggcaaggagtggtcaggcccttgggttcatggcatacctattcat agcacacccagcagtaagagaatgtccaagcacacagatgcagcagaagaggtgctattg gaaaaaaaaggttgcacgggagtcataacactaaacagaccaaagttcctcaatgcactg actcttaatatgattcggcagatttatccacagctaaagaagtgggaacaagatcctgaa actttcctgatcattataaagggagcaggaggaaaggctttctgtgccgggggtgatatc agagtgatctcggaagctgaaaaggcaaaacagaagatagctccagttttcttcagagaa gaatatatgctgaataatgctgttggttcttgccagaaaccttatgttgcacttattcat ggaattacaatgggtgggacacttgaaagaatttggaggagcaggatagaaaaaaacctg tattgctgtgaacggagcattaagggtgattctggtgagggctcagaagaagacaaggct ggggaaaatgtggaacttcttagagattatttaagtggtcatgggcagaatgctaataga aatatggacagtaatggccattttgatgaggacccggatgaaaatgaggaacaagatatt ggagtaaagagcatccttgtaataaagcccagagctctggcacaaatggttttgggtgat ggacttgggcctccccctataagcttaccgcccggggctgcttgggcctctgctcccaga attccagcacagtgctcctcagttgtccccgctatggctcaagtgccccagaaagagtac acttgccggcagttttgctacaagagtacagcgaacaaaggggacagggtaatttataac ctgacgcgttcaccctactgctgtgtccggtttccattggctggaacgggacctcacatt ctgtatttgtcccaattggatagcaacttagaactttctaaaagaggcaaaggcagagga gaacaaaggaaggaggaagtaacttgtggaatgctgagaaaggatcatactggtaaagcc atgtttcatctcctgttacagttcttcaaagaaatgcttcaggatcttgatctcacttac ttaaaatttccattgaaagctctgttcttgtctgcagcagatctggatgcaatggttttg gcactcatcggatggaaagtttgctcagctttaatttttcagtcagaattgtgtaaactg aaccaattataa >gi568815596f:190260103_190471399|GENSCAN_predicted_peptide_3|448_aa MPEINAHDGKARAAVNSQYFFPIGQVAVLAKQQLLRSAATEKQGSEMSDILRELLCVSEK AANIARACRQQEALFQLLIEEKKEGEKNKKFAVDFKTLADVLVQEVIKQNMENKLSLSVL SPLKLLKCNGLSCGFTLGEKITLRLCSTEEETAELLSKVLNGNKVASEALARVVHQDVAF TDPTLDSTEINVPQDILGIWVDPIDSTYQYIKGSADIKSNQGIFPCGLQCVTILIGVYDI QTGVPLMGVINQPFVSRDPNTLRWKGQCYWGLSYMGTNMHSLQLTISRRNGSETHTGNTG SEAAFSPSFSAVISTSEKETIKAALSRVCGDRIFGAAGAGYKSLCVVQGLVDIYIFSEDT TFKWDSCAAHAILRAMGGGIVDLKECLERNPETGLDLPQLVYHVENEGAAGVDRWANKGG LIAYRSRKRLETFLSLLVQNLAPAETHT >gi568815596f:190260103_190471399|GENSCAN_predicted_CDS_3|1347_bp atgccagagattaacgcccacgatggaaaggccagggctgcagtgaatagccagtacttc tttcccattggtcaagttgctgtgctcgctaagcagcaactgctcaggtcagctgcaact gaaaagcaaggttcagaaatgtcagatatcctccgggagctgctctgtgtctctgagaag gctgctaacattgcccgggcgtgcagacagcaggaagccctcttccagctgctgatcgaa gaaaagaaagagggagaaaagaacaagaagtttgcagttgacttcaagacgctggctgat gtactggtacaggaagttataaaacagaatatggagaacaagctctctctctctgttctt tctccactgaaactcttgaaatgtaatggcttatcgtgtggctttaccctaggggaaaag attaccttgaggttgtgttcaacagaggaggaaacagcagagcttcttagcaaagtcctc aatggtaacaaggtggcatctgaagcattagccagggttgttcatcaggatgttgccttt actgacccaactctggattccacagagatcaatgttccacaggacattttgggaatttgg gtggaccccatagattcaacttatcagtatataaaaggttctgctgacattaaatccaac cagggaatcttcccctgtggacttcagtgtgtcaccattttaattggtgtctatgacata cagacaggggttcccctgatgggagtcatcaatcaaccttttgtgtcacgagatccaaac accctcaggtggaaaggacagtgctattggggcctttcttacatggggaccaacatgcat tcactacagctcaccatctctagaagaaacggcagtgaaacacacactggaaacaccggc tctgaggcagcattctcccccagtttttcagccgtaattagtacaagtgaaaaggagact atcaaagctgcattgtcacgtgtgtgtggagatcgcatatttggggcagctggggctggt tataagagcctatgtgttgtccaaggcctcgttgacatttacatcttttcagaagatacc acattcaaatgggactcttgtgctgctcatgccatactgagggccatgggtgggggaata gtagacttgaaagaatgcttagaaagaaatccagaaacagggcttgatttgccacagttg gtgtaccacgtggaaaatgagggtgctgctggggtggatcggtgggccaacaagggagga ctcattgcatacagatccaggaagcggctggagacattcctgagcctcctggtccaaaac ctggcacctgcagagacgcatacctag >gi568815596f:190260103_190471399|GENSCAN_predicted_peptide_4|232_aa MVPAFAQLLVRPKKFTVMVEGKRGAPVSHEQDNKLLKNRMWHPPKPITGTQTYLPSEYST LPQLSACLTRNTTLIIFLVRHTRLAFQQDASSSFLKKGAFVSVFPLLGGGALHLAAGPQR PALKRVQPVVTGGGDCEAGAGEEEEPQRERSPSQPRRSARAARDGARCPDSRRPGAPSRR PVLRGQGSPRPAQLSAAAAAADTATIRFLNLFPTFPPFLFHKTAIVIMARSQ >gi568815596f:190260103_190471399|GENSCAN_predicted_CDS_4|699_bp atggtgccagcatttgctcagcttctggtaagacctaagaagtttacagtcatggtggaa ggcaaaaggggagccccagtatcacatgaacaagacaataaactccttaagaacaggatg tggcatcccccaaagcccatcacaggcacacagacatacttgccaagtgaatactccact ctcccccaactctctgcgtgtttaactagaaatacaactttaattatattcttagtgcga cacacacggttggcttttcagcaagatgccagtagcagcttcctgaaaaagggtgccttc gtctcagtgttcccccttcttggaggcggagcgctgcacctggcggccggtcctcagcgc ccggccctgaaacgggtccagccggtagtgaccggcggcggcgactgtgaggccggggcc ggggaggaggaggagccgcagcgggagagaagcccgtcgcagccccggaggagcgcgcga gcagcccgggatggtgcgcgctgccccgacagccgccgccccggcgccccgagccgccgc cccgtgctccggggacagggctccccgcgccccgcgcagctgagcgctgccgccgccgca gcagacacagcaaccatccgatttctcaatcttttccccacctttcccccttttctattc cacaaaaccgccattgtcatcatggcccgttctcaatga >gi568815596f:190260103_190471399|GENSCAN_predicted_peptide_5|653_aa MADDKVAILTDDEEEQKRKYVLADPFNGISREPEPPSNETPSSTETSAIPEEEIDWIEKH CVKINNDLLISKVFYFFFYSAYGSLYPLLPVYYKQLGMSPSQSGLLVGIRYFIEFCSAPF WGVVADRFKKGKIVLLFSLLCWVLFNLGIGFVKPATLRCVPKIRPTTHPTNASHQLTILP TNSSFTSFLTISPKMREKRNLLETRLNVSDTVTLPTAPNMNSEPTLQPQTGEITNRMMDL TLNSSTATPVSPGSVTKETTTVIVTTTKSLPSDQVMLVYDQQEVEAIFLVILVVVIIGEF FSASSVTIVDTVTLQYLGKHRDRYGLQRMWGSLGWGLAMLSVGIGIDYTHIEVLIDGKGC KPPEYRNYQIVFIVFGVLMTMALIVATQFRFRYNHFKNDDSKGKEVEIPQVERNNSTESS EETPTTTSHSQAFNFWDLIKLLCSVQYGSVLFVAWFMGFGYGFVFTFLYWHLEDLNGTTT LFGVCSVLSHVSELTAYFFSHKLIELIGHIRVSVFSVQQTVMSPLMETKSVAEEQRGTGG LIIPGDILGGFLEGMIPFNSGTLKIRGYLFAPKGLSIVKVEMWERREGYIKFIATQLVDG NLRPKGPRSCEPVTPVTEFHLGRGLKAKHNLPPKLDQLTLGYFKPGVLDYGLD >gi568815596f:190260103_190471399|GENSCAN_predicted_CDS_5|1962_bp atggcagatgataaagttgctatcttaacggatgatgaagaggaacagaagagaaagtat gtgcttgcagatccctttaatggtatttccagggaaccagaaccaccttcgaatgaaaca ccttcctccacagaaacatctgctattcctgaggaggaaatagactggatagagaaacat tgtgttaagataaacaacgatcttctaatttccaaggtcttttattttttcttttactct gcctatggctctctctatccccttttgcctgtgtattacaaacagctgggaatgtctcca agccagagtggactactagtaggtattcgttacttcattgaattctgcagtgcccccttt tggggtgtagttgcagaccgctttaaaaaaggcaaaattgtcctcctcttttctcttttg tgttgggttttattcaacctgggcattggatttgtcaaacctgctaccttgagatgtgta ccaaagattcgcccaacaactcaccccaccaatgcaagtcaccagttaactatcctgcca acaaattcttcctttacctctttcctcaccatatcaccaaaaatgcgtgagaaaagaaac cttttggaaacaaggctcaatgtctcagacaccgttactttgccaacagctccaaacatg aacagtgaacccactctgcagccccagacaggtgaaattactaaccgtatgatggacttg actttgaactcaagcacagcaacccctgtctccccaggaagcgtaaccaaggagacaacc actgttattgttaccaccaccaaatctttaccttctgaccaagtcatgcttgtttatgat caacaagaagttgaagctatattcttggtgatcttggtagttgtcataataggagaattt ttcagtgcctcttctgtcacaatcgtagacacggtcacactccagtatctgggaaaacac agagatcgctatgggttgcagcgcatgtggggctccctgggctggggcctggcgatgctg tctgtgggcatcgggatcgactacacccacatcgaagtgctcatcgatggaaaggggtgt aagccccccgagtacaggaattaccagatcgtcttcatcgtcttcggcgttctcatgacc atggccttgatcgttgccactcagttccggttccgctacaaccatttcaaaaacgatgat tctaaagggaaagaggtggagatcccgcaggtggaaaggaacaactctacagagtcctct gaggagacaccaaccaccacaagccactcgcaggccttcaacttttgggacttaatcaag ctgctctgcagcgtgcagtatggctcagtgctgtttgtggcttggttcatgggttttgga tatggcttcgtgttcacctttctctactggcatttggaagacctcaatggaactacaacc ctctttggggtctgttcagtcctgagtcatgtgtctgagctgacagcatatttttttagt cacaagcttattgaattgatcggccacatcagggtctcagtcttcagtgtgcaacagact gtgatgagtcctctaatggaaacaaagagtgtagcagaggaacagcgggggacaggagga ctaattatacctggagacatcttgggaggcttcttggaagggatgattcctttcaacagt gggactctcaaaatcagagggtacctgtttgctcctaaaggactatcaattgtgaaagta gaaatgtgggaaagaagggaaggctatatcaaattcatagccactcagcttgttgatggc aatctgagacctaagggtcccagaagttgtgaaccggtcactccagtgactgaattccat ctgggaagaggtttgaaggcaaagcacaatctacctcctaaactggatcaattgacactg ggttattttaagccaggagttttggactatggcctggattga