GENSCAN 1.0 Date run: 23-Apr-119 Time: 11:12:06 Sequence gi568815584f:72837979_73058802 : 220824 bp : 46.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2197 2267 71 0 2 51 88 86 0.433 5.42 1.02 Intr + 13981 14069 89 1 2 61 96 40 0.099 1.71 1.03 Intr + 34539 35419 881 1 2 32 53 274 0.097 9.35 1.04 Intr + 41764 41872 109 0 1 67 62 25 0.009 -2.34 1.05 Term + 46080 46192 113 1 2 121 43 27 0.160 0.22 1.06 PlyA + 48529 48534 6 1.05 2.05 PlyA - 49061 49056 6 1.05 2.04 Term - 49367 49261 107 0 2 98 42 86 0.191 3.57 2.03 Intr - 60863 60750 114 0 0 81 84 55 0.567 4.82 2.02 Intr - 63241 63134 108 2 0 107 80 82 0.764 9.56 2.01 Init - 69077 69074 4 2 1 86 69 0 0.160 -1.94 2.00 Prom - 79839 79800 40 -2.86 3.04 PlyA - 82592 82587 6 1.05 3.03 Term - 88160 87999 162 2 0 107 40 87 0.074 3.74 3.02 Intr - 91831 91676 156 1 0 -47 79 227 0.169 8.51 3.01 Init - 91952 91893 60 2 0 110 66 161 0.946 17.35 3.00 Prom - 95291 95252 40 -2.56 4.00 Prom + 96769 96808 40 -8.66 4.01 Init + 101771 101924 154 1 1 81 110 30 0.959 4.65 4.02 Intr + 102242 102399 158 0 2 36 86 244 0.974 18.73 4.03 Intr + 103767 103846 80 2 2 14 63 70 0.377 -4.55 4.04 Intr + 105016 105118 103 1 1 86 105 123 0.439 13.98 4.05 Intr + 107906 108049 144 1 0 122 102 286 0.995 33.88 4.06 Intr + 109164 109240 77 2 2 85 48 46 0.457 -1.39 4.07 Intr + 113820 113899 80 0 2 63 92 23 0.271 -0.61 4.08 Intr + 116186 116284 99 0 0 71 99 45 0.444 3.98 4.09 Intr + 116408 116505 98 0 2 89 12 118 0.983 4.03 4.10 Intr + 117545 117718 174 1 0 76 97 142 0.970 14.04 4.11 Intr + 118408 118522 115 0 1 116 80 219 0.999 23.92 4.12 Term + 120634 120827 194 2 2 102 31 221 0.999 15.38 4.13 PlyA + 121650 121655 6 1.05 5.16 PlyA - 123085 123080 6 1.05 5.15 Term - 123606 123460 147 0 0 31 44 93 0.206 -2.90 5.14 Intr - 125868 125777 92 1 2 68 84 77 0.162 5.01 5.13 Intr - 133136 132915 222 0 0 92 17 238 0.084 15.20 5.12 Intr - 136215 136102 114 1 0 71 52 212 0.935 16.32 5.11 Intr - 136981 136801 181 1 1 33 65 177 0.992 9.34 5.10 Intr - 137743 137573 171 1 0 69 60 201 0.996 15.54 5.09 Intr - 137858 137832 27 2 0 80 113 12 0.699 1.21 5.08 Intr - 140066 139949 118 1 1 111 46 141 0.662 12.67 5.07 Intr - 140256 140159 98 0 2 57 92 45 0.992 0.61 5.06 Intr - 140991 140883 109 2 1 45 113 111 0.766 9.59 5.05 Intr - 143917 143811 107 1 2 93 106 133 0.986 14.61 5.04 Intr - 155379 155165 215 1 2 77 86 241 0.752 21.23 5.03 Intr - 160337 159833 505 2 1 126 85 432 0.833 39.25 5.02 Intr - 186537 186048 490 2 1 66 115 317 0.036 25.21 5.01 Intr - 210218 210049 170 2 2 88 87 76 0.351 6.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 133136 132904 233 0 2 92 40 246 0.916 16.84 S.002 Init - 167532 167476 57 0 0 80 87 60 0.882 4.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:72837979_73058802|GENSCAN_predicted_peptide_1|420_aa MAFTCNTADLEGKACSRGTKLSPSIEQTSINPQKTLSRRTLSLLETQKLRGAAEISSTKL PMTFFTELEKTTLKFIWNQKRARIAKAILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQ NRDIDQWNRTEASEIMPHIYNYLIFDKPDKNKQWGKDSLFNKWCWENWLAIWRKMKLEPF LTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGIGKDFMSKTPKAMATKAKIDKWD LNKLKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYNELKQIYKKKTNNPINKW AKDINRHFSKEDIYAAKRHMKQCSSSLAIREMQIKTTMRYHLTPVRKGHRASPQTNKFTH VSHTEVLTLEPMGLQGIQRWALPVISVEGGARDQARTKGPSSQKSTPLYQPRFCFLWFGG >gi568815584f:72837979_73058802|GENSCAN_predicted_CDS_1|1263_bp atggccttcacttgtaacacagctgatctggagggtaaagcctgcagccgtggaaccaag ctctcgccaagcatagagcaaacatcaattaatcctcaaaagacgctttcaagaagaaca ttatcccttttggagacgcagaagctaagaggtgccgcagaaatttcttccaccaagcta ccaatgactttcttcacggaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcattgccaaggcaatcctaagccaaaagaacaaagctggaggcatcatgcta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagaccaatggaacagaacagaggcctcagaaataatgccgcatatctac aactatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatggagaaaaatgaaactggagcccttc cttacaccttatactaaaattaattcaagatggattaaagatttaaatgttagacctaaa accataaaaaccctagaagaaaacctaggcaataccattcaggacataggcataggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaaataaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacagaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaacaagtgg gcgaaggatataaacagacacttctcaaaagaagatatttatgcagccaaaagacacatg aaacaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagatac catctcacaccagttagaaagggacacagagcatcccctcagactaacaagttcacacac gtctctcacactgaggttcttaccctagagcccatgggcctccagggcatccagagatgg gctcttccagtcatctcagttgaaggaggagccagagaccaggcaaggaccaaaggccca agttcccagaagagcaccccactctaccaaccgcgcttctgcttcctctggtttggtggg tga >gi568815584f:72837979_73058802|GENSCAN_predicted_peptide_2|110_aa MGGLASCFDLGQKNYQDFRYGVHVGGLKEDLENVDLAAQCLEEHIRAGSGSQGTSTANQS LRPWLVALWMDGGIEVIFFSLASHPEADENEELEDQLAYFDVDKHSVFGG >gi568815584f:72837979_73058802|GENSCAN_predicted_CDS_2|333_bp atgggaggccttgccagttgctttgacttaggacagaagaactaccaggatttcagatat ggtgtccatgtgggtgggctgaaggaggacttggaaaatgtggacttggcagctcagtgc ctggaggaacatatcagagctggttctggatctcagggaaccagcacagccaaccagtcg ctgaggccatggctggttgctctgtggatggacggagggattgaggtgattttcttctca ctagcatcccatccagaagctgatgagaatgaggaacttgaagaccaattggcctatttt gacgtggataagcactcagtttttggtggttaa >gi568815584f:72837979_73058802|GENSCAN_predicted_peptide_3|125_aa MALRYLMAVGLNKGHKVTKNRDMIREVCGFASYELRAMESLKVFKEKWALKFIKKRVGTH IRAKRKREELSKGDAALEKSSRCSGKSEGVCEARDRSNVMLRLAAKSSAVASGGRGRCPG SAFRV >gi568815584f:72837979_73058802|GENSCAN_predicted_CDS_3|378_bp atggctctgcgctaccttatggccgtgggcctcaacaagggccacaaggtgaccaagaac cgggacatgatccgggaggtgtgtggcttcgcttcgtacgagctgcgcgccatggagtca ctgaaggtcttcaaggagaaatgggccctcaagttcatcaagaaaagggtggggacgcac atccgtgccaagaggaagcgggaggagctgagcaagggtgatgcggccctggagaagtcc tctcgctgtagtggaaagagcgaaggagtctgtgaggcgagggacagaagtaacgtgatg ttacgactggcggcgaagtcttcggcagtggcctcaggcggccgaggccgctgccctggg tctgcgttcagggtgtag >gi568815584f:72837979_73058802|GENSCAN_predicted_peptide_4|491_aa MAGTQVLGCKLTQLCVNRSDSRAAQPAHDSGHGDDESPSTSSGTAGTSSVPELPGFYFDP EKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIARMGFNASSMLRKSQ LGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILADTNSDRLFTVNDVK VGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDSHILYPLAEEEVWL CLMGLAETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCFSTGL SRRVLLTNVVTGHRQSFGTNSDVLAQQFALMAPLLFNGCRSGEIFAIDLRCGNQGKGWKA TRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPLHVHE EEGILVAVGQDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAPGLLMA VGQDLYCYSYS >gi568815584f:72837979_73058802|GENSCAN_predicted_CDS_4|1476_bp atggctggaactcaagtcctgggctgcaagttaacacagctgtgtgtcaacaggtctgac tcccgggcagcacagcccgctcacgattccggccacggtgatgacgagtctccgtcaacc tcgtctggcacagctgggacctcctctgtgccagagctacctgggttttactttgaccct gaaaagaaacgctacttccgcttgctccctggacataacaactgcaaccccctgacgaaa gagagcatccggcagaaggagatggagagcaagagactgcggctgctccaggaagaagac agacggaaaaagattgccaggatgggatttaatgcatcttccatgctacgaaaaagccag ctgggttttctcaacgtcaccaattactgccatttagcccacgagctgcgtctcagctgc atggagaggaaaaaggtccagattcgaagcatggatccctccgccttggcaagcgaccga tttaacctcatactggcagataccaacagtgaccggctcttcacagtgaacgatgttaaa gttggaggctccaagtatggtatcatcaacctgcaaagtctgaagacccctacgctcaag gtgttcatgcacgaaaacctctacttcaccaaccggaaggtgaattcggtgtgctgggcc tcgctgaatcacttggattcccacattctgtatcctttggcagaggaagaggtttggcta tgcctcatgggactcgcagagactccaggctgtgccaccctgctcccagcatcactgttc gtcaatagtcacccaggaatagaccggcctggcatgctctgcagtttccggatccctggt gcctggtcctgtgcctggtccctgaatatccaagcaaataactgcttcagtacaggcttg tctcggcgggtcctgttgaccaacgtggtgacgggacaccggcagtcctttgggaccaac agtgatgtcttggcccagcagtttgctctcatggctcctctgctgtttaatggctgccgc tctggggaaatctttgccattgatctgcgttgtggaaatcaaggcaagggatggaaggcc acccgcctgtttcatgattcagcagtgacctctgtgcggatcctccaagatgagcaatac ctgatggcttcagacatggctggaaagatcaagctgtgggacctgaggaccacgaagtgc gtaaggcagtacgaaggccacgtgaatgagtacgcctacctgcccctgcatgtgcacgag gaagaaggaatcctggtggcagtgggccaggactgctacacgagaatctggagcctccac gatgcccgcctactgagaaccataccctccccgtaccctgcctccaaggccgacattccc agtgtggccttctcgtcgcggctggggggctcccggggcgcgccggggctgctcatggct gtcgggcaggacctttactgttactcctacagctaa >gi568815584f:72837979_73058802|GENSCAN_predicted_peptide_5|921_aa CGPHSKSDFPNAQKMAMLSPQTFEFRNKIELISEALPEDQERTFQDLQEPELSHTPNVSM SAQTSPAEKGLNPGLMCQESYACSGTDEAIFECDECCSLQCLRCEEELHRQERLRNHERI RLKPGHVPYCDLCKGLSGHLPGVRQRAIVRCQTCKINLCLECQKRTHSGGNKRRHPVTVY NVSNLQESLEAEEMDEETKRKKMTEKVVSFLLVDENEEIQVTNEEDFIRKLDCKPDQHLK VVSIFGNTGDGKSHTLNHTFFYGREVFKTSPTQESCTVGVWAAYDPVHKVAVIDTEGLLG ATVNLSQRTRLLLKVLAISDLVIYRTHADRLHNDLFKFLGDASEAYLKHFTKELKATTAR CGLDVPLSTLGPAVIIFHETVHTQLLGSDHPSEVPEKLIQDRFRKLGRFPEAFSSIHYKG TRTYNPPTDFSGLRRALEQLLENNTTRSPRHPGVIFKALKALSDRFSGEIPDDQMAHSSF FPDEYFTCSSLCLSCGVGCKKSMNHGKEGVPHEAKSRCRYSHQYDNRVYTCKACYERGEE VSVVPKTSASTDSPWMGLAKYAWSGYVIECPNCGVVYRSRQYWFGNQDPVDTVVRTEIVH VWPGGVSGSELATTDGFLKDNNNAAQRLLDGMNFMAQSVSELSLGPTKAVTSWLTDQIAP AYWRPNSQILSCNKCATSFKDNDTKHHCRACGEGFCDSCSSKTRPVPERGWGPAPVRVCD NCYEARNVQLAVTEAQVDDEGGTLIARKVGEAVQNTLGAVVTAIDIPLGLVKDAARPAYW VPDHEILHCHNCRKEFSIKLSKHHCRACGQGFCDECSHDRRAVPSRGWDHPVRVCFNCNK KPAVMSLPMWLKTLGAEHRITLITVDDKKLGEREGPEILGLAELAFLPREMRKISPLSLT HVEKMDEIKINWTKWYKGSCE >gi568815584f:72837979_73058802|GENSCAN_predicted_CDS_5|2766_bp tgtgggcctcattctaagtctgacttccctaatgctcagaagatggctatgttgagtcca caaacttttgaattcagaaacaagattgagttgatttcagaagctctcccagaagatcaa gaaagaactttccaggatctccaggaacctgaattgagtcatacgccaaacgtgagtatg agtgcccagacttccccagcagagaagggcctgaatccggggctgatgtgccaggaaagt tacgcttgcagcgggactgatgaagctatctttgagtgtgatgagtgctgcagtctgcag tgtctccgctgcgaggaggagctccatcggcaggagcgcctgagaaaccatgagcggata agactcaaacctggccatgtcccttactgtgacctctgcaagggtctcagtgggcattta ccaggtgttaggcagagggcaatagtgaggtgccagacctgcaaaattaacttgtgcctg gagtgccagaagaggactcattctgggggtaacaaaaggagacaccctgttactgtgtac aatgtcagtaatctccaggagtcactggaggcagaagagatggatgaggagaccaagagg aagaagatgactgagaaggttgtgagtttcctcctagtagacgaaaatgaagaaattcag gtaacaaatgaagaagactttattagaaaattggactgcaaacctgatcagcatctgaaa gtggtttccatttttggaaatactggtgatggaaagtctcatactctcaaccacactttc ttttatggtcgtgaagtctttaaaacctccccgacccaggagtcctgcactgtgggagtg tgggcagcctatgacccagttcacaaagtagcagtgatcgatacggaagggctcctgggg gccaccgtgaatctaagccagagaacacggctgctgcttaaggtcctggccatctcagac ctcgtcatctatcgaactcatgcagaccggctgcataacgacctcttcaaattccttggg gatgcctcagaagcttatctgaagcacttcaccaaggagctcaaggccaccactgctcgc tgtggcctggatgtccctttatccacactgggccctgcagttatcatcttccatgagacc gtgcacacccagctactgggctctgatcatccctcagaggtgccagagaagctcatccag gaccggttccggaagctgggccgtttccctgaagcctttagttccattcactacaaggga acgaggacttacaaccctcccacggacttttctgggcttcggcgtgctttggagcagcta ctagagaataacaccacccgttctccccggcacccgggagtcatcttcaaagccctgaag gcactaagtgaccgcttcagcggtgagatccccgatgaccagatggcgcacagctccttt tttccagatgagtatttcacctgctcctccttgtgcctcagctgtggggttggatgtaag aaaagcatgaatcatgggaaggaaggagtgcctcatgaagccaagagccgctgcagatac tcccaccagtatgacaaccgagtgtatacctgcaaggcctgctatgagagaggcgaggaa gtcagtgtagtgcccaaaacatctgcttccactgactccccctggatgggtctcgcaaaa tatgcctggtctgggtatgtgatcgaatgtcctaactgtggcgtggtctatcgtagtcgg cagtactggtttggaaaccaagatcctgtggatacggtggtgcggacagagattgtgcat gtgtggcctggaggagttagtggttctgagttggccacgactgatgggtttctgaaggac aacaacaatgctgcccagcgcctgttggacgggatgaacttcatggctcagtcggtgtcc gagcttagccttggacccaccaaggctgtgacttcctggctgacagaccagatcgcccct gcctactggaggcccaactcccagattctgagctgcaacaagtgtgcgacgtcctttaaa gataacgacactaagcatcactgccgagcctgtggggagggcttctgtgacagctgttca tcaaagactcggccagtgcctgagcggggctggggccctgcgccagtgcgggtctgtgac aactgctacgaagccaggaacgtccagttagctgttaccgaggcacaagtggacgatgaa ggtggaacgctcattgctcggaaggtgggcgaggccgtgcagaacactctgggagccgtg gtgacagccattgacataccactaggtctggtaaaggacgcggccaggcctgcgtactgg gtgcctgaccacgaaatcctccactgccacaactgccggaaggagttcagcatcaagctc tccaagcaccactgccgggcctgcggacagggcttctgtgatgagtgctcccatgaccgc cgggctgttccttctcgtggctgggaccatcccgtccgagtctgcttcaactgcaataaa aagcccgctgtgatgagcctgcctatgtggctgaagacccttggtgctgaacaccggatc accctaattacagttgatgacaagaaactaggggaaagggaggggccagaaattctgggc ttggcagagcttgcctttctcccacgagaaatgagaaaaatctcacccttgtctctgacc catgtggaaaaaatggatgagatcaagatcaattggaccaaatggtacaaaggatcatgt gagtag