GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:06:06 Sequence gi568815586f:71655175_71885670 : 230496 bp : 38.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1225 1220 6 -1.95 1.05 Term - 1389 1328 62 1 2 93 42 84 0.932 1.29 1.04 Intr - 2099 1711 389 1 2 -31 90 332 0.063 15.01 1.03 Intr - 8401 7775 627 0 0 25 48 598 0.038 39.89 1.02 Intr - 8727 8544 184 1 1 82 90 152 0.964 12.72 1.01 Init - 9068 8849 220 2 1 114 37 101 0.764 6.44 1.00 Prom - 17560 17521 40 -6.75 2.00 Prom + 18793 18832 40 -4.85 2.01 Init + 20701 20782 82 0 1 63 84 41 0.021 2.38 2.02 Intr + 31004 31290 287 0 2 43 77 199 0.418 10.44 2.03 Intr + 31486 31636 151 0 1 85 115 -46 0.481 -3.39 2.04 Intr + 34417 34530 114 2 0 87 87 58 0.481 5.10 2.05 Intr + 41262 41399 138 1 0 94 93 91 0.971 9.71 2.06 Intr + 42106 42360 255 2 0 103 97 113 0.528 10.19 2.07 Intr + 43726 43935 210 2 0 117 99 104 0.901 12.36 2.08 Intr + 45658 45817 160 2 1 116 18 76 0.155 1.52 2.09 Intr + 56300 56409 110 2 2 74 53 73 0.016 1.41 2.10 Intr + 75779 75828 50 0 2 74 89 46 0.129 0.78 2.11 Intr + 76619 76816 198 1 0 -14 -26 300 0.857 7.03 2.12 Intr + 77105 77226 122 1 2 59 98 99 0.579 6.37 2.13 Term + 82870 82879 10 1 1 108 49 0 0.144 -5.21 2.14 PlyA + 83339 83344 6 1.05 3.00 Prom + 86291 86330 40 -6.05 3.01 Init + 99956 100114 159 1 0 80 80 306 0.118 26.87 3.02 Intr + 114626 114685 60 1 0 89 106 28 0.120 2.71 3.03 Intr + 118785 118848 64 2 1 91 99 -2 0.084 -1.43 3.04 Intr + 126857 126911 55 0 1 71 91 43 0.420 0.02 3.05 Intr + 127396 127484 89 1 2 70 56 65 0.419 0.20 3.06 Intr + 130357 130480 124 2 1 112 15 145 0.067 8.32 3.07 Intr + 149793 149908 116 0 2 70 36 71 0.005 -0.73 3.08 Term + 153518 153612 95 0 2 56 48 138 0.066 3.61 3.09 PlyA + 154234 154239 6 1.05 4.00 Prom + 162103 162142 40 -6.45 4.01 Init + 165650 165691 42 1 0 96 93 11 0.917 2.87 4.02 Intr + 168896 169110 215 1 2 48 95 93 0.642 2.59 4.03 Term + 170269 170536 268 1 1 81 48 123 0.902 1.58 4.04 PlyA + 170827 170832 6 1.05 5.07 PlyA - 170998 170993 6 1.05 5.06 Term - 179991 179853 139 2 1 45 50 162 0.870 4.55 5.05 Intr - 180306 180106 201 2 0 24 52 177 0.016 5.18 5.04 Intr - 184757 184412 346 0 1 59 1 218 0.005 3.73 5.03 Intr - 189782 189664 119 1 2 41 66 61 0.259 -1.61 5.02 Intr - 197522 197361 162 1 0 60 121 112 0.667 9.87 5.01 Init - 199693 199599 95 1 2 59 81 48 0.418 1.10 5.00 Prom - 200996 200957 40 -5.65 6.00 Prom + 201618 201657 40 -6.95 6.01 Init + 201988 202053 66 0 0 79 64 47 0.189 0.52 6.02 Intr + 216896 216994 99 1 0 101 115 53 0.967 8.69 6.03 Intr + 217755 217829 75 2 0 93 63 117 0.942 8.49 6.04 Intr + 225295 225433 139 0 1 53 64 126 0.181 5.82 6.05 Term + 229637 229851 215 0 2 76 42 63 0.014 -3.09 6.06 PlyA + 230413 230418 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 2127 1711 417 1 0 10 90 338 0.832 19.27 S.002 Init - 3842 3791 52 2 1 53 98 13 0.886 0.19 S.003 Term - 8401 7756 646 0 1 25 32 602 0.936 40.91 S.004 Term + 21515 21934 420 1 0 60 32 189 0.845 4.80 S.005 Term + 130357 130499 143 2 2 112 36 142 0.902 8.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:71655175_71885670|GENSCAN_predicted_peptide_1|493_aa MADTTRSFSPPPRGSGSTAFLWQRSSEQLKGRGVFPSDYYKLQEFYNKTLSRLQQQLHLL LHAPNCTKPDPAVGSDVSGTAATRVPFSRGGRTPIAEEQGGARKGNWVATVRRERAGVLG CGEVGEQRPLGVRVGGLSPKEEGELEDGEISDDDNNSQIRSRSSSSSSGGGLLPYPRRRP PHSARGGGSGGGGGSSSSSSSSQQQLRNFSRSRHASERGHLRGPSSYRPKEPFRSHPPSV RMPSSSLSESSPRPSFWERSHLALDRFRFRGRPYRGGSRWSRGRGVGERGGKPGCRPPLG GGAGSGFSSSQSWREPSPPRKSCILNVAVGAGCVTKVPFSVITRKQNYSSKNENCVEETF EDLLLKYKQIQLELECINKDEKLALSSKEENVQEDPKTLNFEDQTSTDNVSITKDSSKEV APEEKTQVKTFQAFELKPLRQKLTLPGDKNRLKKVKDGAKPLSLKSDTTDSSQGIPYRVK EGFTPIPGLKFSA >gi568815586f:71655175_71885670|GENSCAN_predicted_CDS_1|1482_bp atggcggacacaacccgttctttctccccgcccccaagagggagcggatctactgcgttt ctttggcagcggtcctcagagcaattaaaaggaagaggtgttttcccctccgattactat aaattacaagagttttacaataaaactctctcaaggctgcaacagcagctccacctactt ctgcatgcacccaactgtaccaagccagaccctgcggtgggaagtgacgtttcgggtaca gccgctaccagagtccctttctcgcgaggcggaagaaccccgatcgctgaggagcaaggg ggcgctaggaaagggaactgggttgcgacggtccggcgagagagagctggggtgctgggg tgcggggaagttggggagcagaggccgcttggtgtccgagtaggtggcctctcgccgaag gaagaaggggagcttgaagatggggaaatcagtgacgacgataataacagccagatacgg agtcggagcagcagcagcagcagcggcggcgggctgttaccctatccgcggcgaaggcct cctcactcggcccggggcggtggatctggcggaggcggtggctcttcctcgtcatcgtcc tcttctcagcagcagctgaggaatttctcacgctcgcggcacgcgtctgagcggggccac ctcaggggacccagcagctaccgacccaaagaaccgttccggtctcatccgccttctgta cggatgccttcgagctcactgtccgaaagcagtccccggccgtctttctgggagcggagc cacctcgccttggaccgtttccgctttcgaggcaggccttaccggggtgggagtcgctgg agtcgggggcgaggagtgggtgagcgaggaggcaagccggggtgcagacctcctctggga ggaggagcaggatccgggttcagcagcagtcagagctggcgagagccctctccacctcgg aagagctgtatccttaacgtggcggtcggcgctgggtgtgtcactaaagttccctttagc gtcattactagaaaacaaaattattcatcaaaaaatgaaaactgtgtggaagaaactttt gaagatttgcttttaaagtataaacaaatacagttggaactagaatgcatcaataaggat gaaaaactagcattgagtagcaaagaagagaatgtgcaggaagatcctaaaacattgaac ttcgaggaccaaactagcactgataatgtcagtattacaaaggattcaagtaaagaagta gctcctgaggagaaaacacaagtcaaaacttttcaggcatttgaattaaaaccactcagg caaaaattgactttaccaggagataagaaccgtttgaaaaaagttaaagatggagcaaaa ccactttccctgaaatccgacactactgattctagtcaaggtatcccttatcgggtaaag gagggttttactcctattcctggtttgaaattttcagcgtga >gi568815586f:71655175_71885670|GENSCAN_predicted_peptide_2|628_aa MWINEDQNEVERSIKNNIVRTDMIRAQWRLAQLSGRDCVQPRRACFPDLPKFGCPSCPFA AAGLPGPRVRRLRREVGGTRQRRDRGLAASAFSFPRGLRREALKRPAAGDRPSPSAGLRS AFAELISLFSMTDLNDNICKRYIKMITNIVILSLIICISLAFWIISMTASTYYGNLRPIS PWRWLFSVVVPVLIVSNGLKKKSLDHSGALGGLVVGFILTIANFSFFTSLLMFFLSSSKL TKWKGEVKKRLDSEYKEGGQRNWVQVFCNGAVPTELALLYMIENGPGEIPVDFSKQYSAS WMCLSLLAALACSAGDTWASEVGPVLSKSSPRLITTWEKVPVGTNGGVTVVGLVSSLLGG TFVGIAYFLTQLIFVNDLDISAPQWPIIAFGGLAGLLGSIVDSYLGATMQYTGLDESTGM VVNSPTNKARHIAGKPILDNNAVNLFSSVLIALLLPTAAWGFWPRGVTKLIAVYRPEYLC GDAKVQLRSYSTKVIVKLRRGLARQGTSIAGCALTDGGMAHPQAAGRGEEQICGRRHKRL DKGWHASRPSTNQTKWSLAQEVGKEPDRSEAQLQGKTIPLLAPPSRSEQRGTEEVSQTRI AHPVRGTKEPFRSNIKVVLTQGQDLQAS >gi568815586f:71655175_71885670|GENSCAN_predicted_CDS_2|1887_bp atgtggattaatgaggatcaaaatgaagtggagagatcaataaagaataatatagtcaga acagatatgatacgggcccagtggcggcttgcgcagctgtcgggacgtgactgcgttcag ccgcgtcgggcgtgcttcccagacttgcccaagttcgggtgccctagctgcccctttgca gccgctggcctacccggcccgcgggtgagaaggttgcgacgggaggtgggtggaactcgc cagcgccgggaccgcggattggctgcctcggctttctcttttccccgtgggctccggcgt gaggcgctgaagcggccggcagccggcgaccggccctcaccgtccgccgggttgcgctct gcttttgcggagctcatatccttattttccatgacagatcttaacgacaatatatgcaaa agatatataaagatgataactaatatagttatactgagcctgatcatttgcatttcgtta gctttctggattatatcaatgactgcaagcacctattatggtaacttacgacctatttct ccgtggcgttggctgttttctgttgttgttcctgttctgatcgtctctaatggccttaaa aagaaaagtctagatcacagtggggctctaggagggctagtcgttggatttatcctaacc attgcaaatttcagcttttttacctctttgctgatgtttttcttgtcttcttcgaaactc actaaatggaagggagaagtgaagaagcgtctagattcagaatataaggaaggtgggcaa aggaattgggttcaggtgttctgtaatggagctgtacccacagaactggccctgctgtac atgatagaaaatggccccggggaaatcccagtcgatttttccaagcagtactccgcttcc tggatgtgtttgtctctcttggctgcactggcctgctctgctggagacacatgggcttca gaagttggcccagttctgagtaaaagttctccaagactgataacaacctgggagaaagtt ccagttggtaccaatggaggagttacagtggtgggccttgtctccagtctccttggtggt acctttgtgggcattgcatacttcctcacacagctgatttttgtgaatgatttagacatt tctgccccgcagtggccaattattgcatttggtggtttagctggattactaggatcaatt gtggactcatacttaggggctacaatgcagtatactgggttggatgaaagcactggcatg gtggtcaacagcccaacaaataaggcaaggcacatagcagggaaacccattcttgataac aacgcagtgaatctgttttcttctgttcttattgccctcttgctcccaactgctgcttgg ggtttttggcccaggggagtcaccaagttaatagcagtataccgaccagaatacctctgt ggggatgctaaagttcagttgagaagctacagcaccaaagttattgtaaaactaagaagg gggctggcaaggcaaggtacttctatagcagggtgcgcccttacagatggaggaatggca cacccacaagctgctggacgtggagaggagcagatctgtggaagaagacacaagcggctg gataagggctggcatgccagcaggccatcgaccaaccagacgaagtggagtttggcccag gaagtcggaaaagagccagaccgctcagaggcccaactccaggggaaaaccatccccctt cttgctcccccatcgaggtctgagcagcggggcactgaagaagtgagccaaacccgcatc gcacaccctgtgagggggacaaaggaacctttccgttccaacatcaaggttgttttgacc caagggcaagatttacaagcatcataa >gi568815586f:71655175_71885670|GENSCAN_predicted_peptide_3|253_aa MAAAGGGGGGAAAAGRAYSFKVVLLGEGCVGKTSLVLRYCENKFNDKHITTLQASFLTKK LNIGGKRVNLAIWVKNWVKELRKMLGNEICLCIVGNKIDLEKERHVSIQEAESYAESVGA KHYHTSAKQNKGIEELFLDLCKRMIETAQVDERAKGNGSSQPGTARRGVQIIDDEPQAQT SGGGAGTPLKTGTAVLISLLVLLAQCLAAGECDSDECMGSHYGTDLDASDTAITENKRTD KISARMQSLMSMK >gi568815586f:71655175_71885670|GENSCAN_predicted_CDS_3|762_bp atggctgcggccggcggcggcggcggcggggcggcggcggcgggccgagcctactcgttc aaggtggtgctgctgggggaaggctgcgtggggaagacgtcgctggtgctgcgctactgc gagaacaagtttaacgacaagcacatcaccactctgcaggcatcattcttaacaaagaag ttaaatattggtgggaaaagagtaaaccttgccatatgggtaaaaaactgggtcaaagaa ttacggaaaatgttgggaaatgaaatctgtttatgtatagttggtaataaaatagacttg gaaaaggagagacatgtttccattcaagaagcagagtcgtatgcagaatctgtgggagca aaacattatcatacttcagccaaacagaacaaaggaattgaggaactctttcttgacctt tgtaaaaggatgatagaaacagcacaagtggatgagagagcaaaaggcaatggctctagt cagccgggaactgcaaggcgaggtgtacagattattgatgatgaacctcaagcccagacc agtggtggaggagcaggaactcctttgaagactggaactgctgttcttatctctttatta gtccttttggctcagtgtctggctgcaggtgaatgtgacagtgatgagtgcatggggagt cattatggcaccgatttagatgccagcgacacagcaattacagaaaacaaacgaacagac aaaatctctgcccgtatgcagtctttaatgtcaatgaagtga >gi568815586f:71655175_71885670|GENSCAN_predicted_peptide_4|174_aa MLYIRSLELIPLITQLRQAGSGESGLQKRKSRCSSSKSDQQVIYLKEQKVKGHGCCVHIP TLLWNTESGNAEPQTLAPAQNAKSSLLSEALSSNVMVYMLLNLHSYESGGQHLNRDTATR KFKPEILMNSWRPTRGQLQNVKLLEGPIKGGLPLPAPPWAFPPGNPRGSHRVKG >gi568815586f:71655175_71885670|GENSCAN_predicted_CDS_4|525_bp atgctgtacattagatccctagaacttattcctcttataactcagctaagacaggctggg tctggggaatcaggactgcagaaaagaaaatcaagatgcagcagcagcaagtctgaccag caggtaatctacctaaaagagcaaaaagttaagggccatggttgctgtgtccatatacca acattgctgtggaatactgaatctgggaatgctgaaccacagactctagcaccagcccaa aatgccaagagcagccttctatcagaagctttgtcttcaaatgtgatggtttatatgctc ctgaacttacattcttatgagtcaggaggccagcatcttaacagggacacagccaccagg aagttcaaaccagaaatcttgatgaattcctggaggccaacaagaggccagctccagaat gtgaagctgctggagggtcccataaagggaggtctccccctacccgctccaccctgggct tttcctccaggaaacccacgaggctcccatagggtcaaaggttaa >gi568815586f:71655175_71885670|GENSCAN_predicted_peptide_5|353_aa MKKYTLRKRKGRTDQRFEEVAPPVTKRICKRRDLWNLELEMDDLGYPVEEISKQQSLQDV AWLFLTTYAHMHTQRNDLKLELIFKRSFFTHAFSTFMSTNHMLEIKGVSGKNTHKEKCLT KLAVLYATGCPDTESPSCPSPNQLPGSKSIPVPEKSRGLPGGRYLPSRSQHPPPPCFLRV PGNDVAEPHVSLGHTTTSGKTRITSGREKGGEWREGSGGKEPGGKKPPDACVLNWAQRRL GEIRGDFASTKQEYSENKSKLLEIKNMMLNLVKITEVQATMQELSKEMLKAGIVEEMLEG TFESIEDHRGLGQSKVTAVLLEPESSRAMAASEDKEEEEALEPMQSWLVTLCS >gi568815586f:71655175_71885670|GENSCAN_predicted_CDS_5|1062_bp atgaaaaaatataccttgagaaaaagaaagggaagaacagatcaaagatttgaggaagta gcaccacctgtcaccaagcgaatctgcaaaagaagggatctgtggaacttggaacttgag atggatgatttagggtatccagtggaagaaatttctaagcagcaaagtcttcaagatgtg gcctggctgtttctaacaacctatgctcatatgcatacgcaaagaaatgacctgaaactg gaacttatatttaaaagaagtttctttactcatgcattcagcacatttatgagcaccaac cacatgctggaaataaaaggtgtcagtggcaaaaacacacacaaggaaaagtgtctgact aaactagcagtattatacgccacgggttgtccggacacagaaagtccctcgtgtccctcc cccaaccagctgccagggagcaaaagcatccccgtcccagagaaaagccgaggtcttcct ggaggccgttacctaccttcccgctcacaacacccgccgccgccatgtttcctgcgcgtg cctggtaatgacgtagcagaacctcacgtgtctctcggacatacaacaacatccggcaaa actagaataacatccgggcgggaaaagggcggggagtggcgggaagggagtggaggaaag gagccgggcggaaaaaagccgcctgacgcgtgcgtgctgaactgggctcagagacggctg ggagagataagaggagattttgcttccacgaaacaagaatattcagagaacaaaagtaag ctgttagaaattaaaaatatgatgctaaatcttgtgaagattacagaagtccaggccacc atgcaggagctgtccaaagaaatgttgaaggctgggatcgtagaggagatgttagagggc acttttgaaagcatagaagatcacaggggccttgggcaaagcaaagtgactgctgtcctt ctagagccagaatcttcaagagcgatggccgcctcagaggacaaggaggaagaagaggct ctggagcccatgcagtcctggctggttacactctgcagctag >gi568815586f:71655175_71885670|GENSCAN_predicted_peptide_6|197_aa MVLNSWPQVILLPWPPSVLGLQIIYEQEGVYIHSSCGKTNDQDGLISGILRVLEKDAEVI VDWRPLDDALDSSSILYARKDSSSVVEWTQAPKERGHRGSEHLNSYEAEWDMVNTVSFKR KPHTNGDAPSHRNGKSKWSFLFSLTDLKSIKQNKEGMGWSYLVFCLKDDVVLPALHFHQG DSKLLIESLEKYVVLCE >gi568815586f:71655175_71885670|GENSCAN_predicted_CDS_6|594_bp atggtcttgaactcctggcctcaagtgatcctcctgccttggcctcccagtgtcctggga ttacagattatatatgaacaagaaggagtatatattcactcatcttgtggaaagaccaat gaccaagacggcttgatttcaggaatattacgtgttttagaaaaggatgccgaagtaata gtggactggagaccattggatgatgcattagattcctctagtattctctatgctagaaag gactccagttcagttgtagaatggactcaggccccaaaagaaagaggtcatcgaggatca gaacatctgaacagttacgaagcagaatgggacatggttaatacagtttcatttaaaagg aaaccacataccaatggagatgctccaagtcatagaaatgggaaaagcaaatggtcattc ctgttcagtttgacagacctgaaatcaatcaagcaaaacaaagagggtatgggctggtcc tatttggtattctgtctaaaggatgacgtcgttctccctgctctacactttcatcaagga gatagcaaactactgattgaatctcttgaaaaatatgtggtattgtgtgagtaa