GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:44:40 Sequence gi568815595f:87140698_87353819 : 213122 bp : 35.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4611 4754 144 2 0 83 49 87 0.878 3.86 1.02 Intr + 4909 4977 69 0 0 114 62 79 0.909 6.36 1.03 Intr + 5969 6036 68 1 2 66 103 52 0.817 1.28 1.04 Intr + 10976 11019 44 2 2 69 67 39 0.332 -3.13 1.05 Intr + 13641 13702 62 1 2 84 73 88 0.632 4.43 1.06 Intr + 16763 16873 111 1 0 86 84 77 0.861 6.76 1.07 Term + 33027 33080 54 2 0 74 37 90 0.113 -0.92 1.08 PlyA + 33856 33861 6 1.05 2.05 PlyA - 34503 34498 6 1.05 2.04 Term - 49603 49386 218 2 2 47 48 176 0.812 5.82 2.03 Intr - 53924 53855 70 2 1 77 110 23 0.939 1.24 2.02 Intr - 55631 55404 228 2 0 74 37 104 0.467 1.04 2.01 Init - 60555 60487 69 0 0 48 116 87 0.502 8.60 2.00 Prom - 62945 62906 40 -6.85 3.00 Prom + 67720 67759 40 -7.25 3.01 Init + 78613 78692 80 0 2 80 49 107 0.749 6.58 3.02 Intr + 79786 79891 106 1 1 59 68 13 0.365 -4.30 3.03 Intr + 85944 86071 128 2 2 77 99 58 0.678 4.36 3.04 Intr + 86464 86638 175 1 1 66 33 206 0.253 11.92 3.05 Intr + 87108 87319 212 2 2 66 52 169 0.189 7.89 3.06 Intr + 92894 92976 83 2 2 88 103 30 0.549 2.86 3.07 Intr + 100002 100093 92 1 2 70 119 120 0.998 12.09 3.08 Intr + 105017 105211 195 1 0 111 106 137 0.957 16.49 3.09 Intr + 109178 109280 103 1 1 -4 80 103 0.772 -0.87 3.10 Intr + 112707 112813 107 1 2 82 86 218 0.995 20.01 3.11 Term + 113015 113125 111 1 0 52 38 176 0.998 6.58 3.12 PlyA + 113990 113995 6 1.05 4.07 PlyA - 114784 114779 6 1.05 4.06 Term - 119407 119197 211 0 1 51 44 218 0.943 9.48 4.05 Intr - 120636 120576 61 1 1 71 61 59 0.962 -1.63 4.04 Intr - 121538 121374 165 0 0 95 64 140 0.996 11.31 4.03 Intr - 123815 123591 225 0 0 99 77 197 0.980 16.73 4.02 Intr - 132799 132650 150 2 0 42 110 41 0.602 0.91 4.01 Init - 135765 135624 142 0 1 52 84 88 0.891 5.14 4.00 Prom - 135926 135887 40 -3.35 5.00 Prom + 144360 144399 40 -2.75 5.01 Sngl + 182571 183281 711 2 0 86 36 926 0.997 83.17 5.02 PlyA + 184220 184225 6 1.05 6.03 PlyA - 184348 184343 6 1.05 6.02 Term - 202655 201184 1472 0 2 43 36 534 0.108 33.50 6.01 Init - 203444 202778 667 2 1 49 -33 261 0.140 5.52 6.00 Prom - 203536 203497 40 -6.15 7.02 PlyA - 203705 203700 6 1.05 7.01 Sngl - 204913 204584 330 1 0 88 44 320 0.846 23.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_1|183_aa VIVYLYYETLPSDASSKLERVNFSKLKLVGLGLGSGEGNPEAQHASKRAIKLQTVMQPEP LTMVPSAGNLLKETQSSPGYQRSAKSTPKMQVTCASLCGYVPPRNKQQGPELERWNLAEV AFPDSNGFYMHPVLEHQTLKSFNFGTHTGSPCSSACRQPIVGPCDHTGEREKDLDVFKGV AAS >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_1|552_bp gtaattgtgtatctgtactatgagacactcccctcagatgcatcctccaaactggaaaga gttaatttctcaaaacttaaactggttggcttaggactgggctcaggggaagggaaccca gaagcccaacatgccagcaaaagggccatcaaactccaaacagtcatgcaaccagagcct ctgacaatggtcccttctgctgggaacctgttgaaagaaacacagtcatctccaggttac caaaggtcagcaaaatctactccaaaaatgcaagttacatgtgcctctctctgtggctac gttcccccaagaaacaagcagcaaggtcctgagctggaacgatggaatctagcagaagtt gcctttccagattcaaacggcttctatatgcatcctgtccttgaacatcagactcttaag tccttcaattttggaactcacactggctctccttgctcctcagcctgcagacagcctatt gtgggaccttgtgatcatactggagaacgagaaaaagatttagatgttttcaagggagta gctgccagttaa >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_2|194_aa MMKTDENQKDWPQVPELFGGLEKVGIECLCFSRCTVQAVSESTILESGGRWPSFTTPPGP APVGTLHVGSSPTFPLCTALAEVFHEGSVPSADFCLYIQRSKVERLSQRSTGIGVLSSLQ TAVFVGYTRISCGLYEVGWVYSDGLRAYTLPEQSITSNINLLQQAGEKTKSKVVIVYISR ENLKKDFETNNEKM >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_2|585_bp atgatgaaaactgatgaaaatcaaaaagattggccacaagttccagagctgtttggaggt ctagagaaggttggcattgagtgtctgtgcttttccagatgcacagtgcaagctgtcagt gaatctaccattctggagtctggaggacggtggccctctttcacaactccaccaggccct gccccagtggggactctgcatgtgggctccagccccacatttcccttatgcactgcccta gcagaggttttccatgagggctctgtcccttcagcagacttctgtctgtatatccagaga tccaaagttgaaagactttcacaaagaagcacaggaattggagtcttatcttcattgcaa actgcagtatttgtaggttacacaaggatcagctgtggcttatatgaagtcggttgggtt tattctgatggtttacgagcttatactctgccagagcagtcaataacaagtaatataaac ttgctacagcaagctggagagaaaacaaaaagcaaagttgttattgtctacatttcaaga gaaaacctgaagaaagattttgaaacgaacaatgagaaaatgtga >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_3|463_aa MDNDSKGQHIRVNNVNSDLNKHMEDVRRTALERLTPMIQSPPTSSLPQHMGIMGATIQDE IWARTLNKKCSIKKLPFVLETPPTLDTGSPETHAGSSESGAGGGSSRLAPLPTPGAGGTG SRRREESGRGFKLRSAQAPHNAQAPPRSDFSKKCVSSRSPELRQDPERFEEASWPLSVNK LGNLMDFLLVSSASIVALQLGDILLVGIHTGNWQSRNGEGGDNRLQLGNEVGSGTIRICP VLLPVSQCPLFPLTLAVTELADVIKEQNRELRGTQRAIIRDRAALEKQEKQLELEIKKMA KIGNKEACKVLAKQLVHLRKQKTRTFAVSSKVTSMSTQTKVMNSQMKMAGAMSTTAKTMQ AVNKKMDPQKTLQTMQNFQKENMKMEMTEEMINDTLDDIFDGSDDEEESQDIVNQVLDEI GIEISGKMAKAPSAARSLPSASTSKATISDEEIERQLKALGVD >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_3|1392_bp atggacaatgatagcaaaggacagcatattcgtgtcaataatgtcaacagtgaccttaat aaacatatggaagatgttaggagaacagcactggaaagactcacccccatgattcaatca cctcccaccagctccctcccacaacacatgggaattatgggagctacaattcaagatgag atttgggcaagaaccttaaacaaaaagtgtagcattaaaaaacttcccttcgtcctcgag acccccccaactctagacactggatccccggagacccacgctgggtcgtcggagtctgga gccgggggcggcagctccagactcgccccgctgccaactcccggtgcaggaggaactggc agccgcagacgtgaggaaagcggccgcggcttcaaactccgtagtgcgcaggcgccacac aacgcgcaggcgccgcctagaagtgacttctccaaaaagtgtgttagttcccggtcacct gagctccggcaggatcctgagcgtttcgaggaggctagctggcccttgagtgtcaataaa cttggaaatctgatggatttcttgttggtttcctctgcttcgattgttgctcttcagttg ggcgatatactacttgtaggcatccatacaggaaactggcagtctcggaatggggaggga ggggataacaggctccagttaggaaatgaagttgggtctggcacaatcaggatttgtcca gtgctgttaccagtgtcacagtgcccactattcccactgacactagctgttacagagctt gcagatgtaataaaggaacagaatcgagagttacgaggtacacagagggctataatcaga gatcgagcagctttagagaaacaagaaaaacagctggaattagaaattaagaaaatggcc aagattggtaataaggaagcttgcaaagttttagccaaacaacttgtgcatctacggaaa cagaagacgagaacttttgctgtaagttcaaaagttacttctatgtctacacaaacaaaa gtgatgaattcccaaatgaagatggctggagcaatgtctactacagcaaaaacaatgcag gcagttaacaagaagatggatccacaaaagacattacaaacaatgcagaatttccagaag gaaaacatgaaaatggaaatgactgaagaaatgatcaatgatacacttgatgacatcttt gacggttctgatgacgaagaagaaagccaggatattgtgaatcaagttcttgatgaaatt ggaattgaaatttctggaaagatggccaaagctccatcagctgctcgaagcttaccatct gcctctacttcaaaggctacaatctcagatgaagagattgaacggcaactcaaggcttta ggagtagattag >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_4|317_aa MSCQAFTSADTFIPLNSDASATLPLIMHHSAAECLPVSNHATNVMSTVPSILSLIQTPKC LCTHFSVTTLGNTATGLHYSVPSCHYGNQPSTYGVMAGSLTPCLYKFPDHTLSHGFPPIH QPLLAEDPTAADFKQELRRKSKLVEEPIDMDSPEIRELEKFANEFKVRRIKLGYTQTNVG EALAAVHGSEFSQTTICRFENLQLSFKNACKLKAILSKWLEEAEQVGALYNEKVGANERK RKRRTTISIAAKDALERHFGEQNKPSSQEIMRMAEELNLEKEVVRVWFCNRRQREKRVKT SLNQSLFSISKEHLECR >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_4|954_bp atgagttgccaagcttttacttcggctgatacctttatacctctgaattctgacgcctct gcaactctgcctctgataatgcatcacagtgctgccgagtgtctaccagtctccaaccat gccaccaatgtgatgtctacagtcccatctattttgtctttgatccaaactcctaaatgt ttgtgcacacatttctcggtgacaacgttgggaaacacagcaacaggacttcattattct gttccttcctgtcattatggaaaccagccatcaacctatggagtgatggcaggtagttta accccttgtctttataaatttcctgaccacaccttgagtcatggatttcctcctatacac cagcctcttctggcagaggaccccacagctgctgatttcaagcaggaactcaggcggaaa agtaaattggtggaagagccaatagacatggattctccagaaatcagagaacttgaaaag tttgccaatgaatttaaagtgagacgaattaaattaggatacacccagacaaatgttggg gaggccctggcagctgtgcatggctctgaattcagtcaaacaacaatctgccgatttgaa aatctgcagctcagctttaaaaatgcatgcaaactgaaagcaatattatccaaatggctg gaggaagctgagcaagtaggagctttgtacaatgaaaaagtgggagcaaatgaaaggaaa agaaaacgaagaacaactataagcattgctgctaaagatgctctggagagacactttgga gaacagaataaaccttcttctcaagagatcatgaggatggctgaagaactgaatctggag aaagaagtagtaagagtttggttttgcaaccggaggcagagagaaaaacgggtgaaaaca agtctgaatcagagtttattttctatttctaaggaacatcttgagtgcagataa >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_5|236_aa MSIRVTQKFYKVFTSGPRAFSSNSYTSGPGVHISFSSFSRVGSSSFWDGLGRGCDGASSK GGITAVTVNQSLLSPLNLQVDPNIQAMCTQEKEQIKTLNNKFASFIDKVWLLEQQNKMLE TKWSLLQQQKMTLSNMHSMLQSYINNLQRQLETLGQENLKLGAELGSMQGLVEGFKNKYK DKINKRTEMENEFVLIKKVVDDAYMNKAELESHLEGLTDEINFLRQLYEEEIRELS >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_5|711_bp atgtccatcagggtgacccagaagttctacaaggtgttcacctctggcccccgggccttc agcagcaactcctacacaagtgggcctggtgtccacatcagtttctcaagcttctcccga gtgggcagcagcagcttctgggatggcctgggcagaggctgtgatggggccagcagcaag ggaggcatcacagctgtcacagtcaaccagagcctgctgagcccccttaacctgcaggtg gaccccaacatccaggccatgtgcacccaggagaaggagcagatcaagaccctcaacaac aagtttgcctccttcattgacaaggtatggttgctggagcagcagaacaagatgctggag acgaagtggagccttctgcagcagcagaagatgactctgagtaacatgcacagcatgtta cagagctacatcaacaaccttcagcggcagctggagactctgggtcaggagaacctgaag ctgggggcagagcttggcagcatgcaggggctagtggagggcttcaagaacaaatataag gataagatcaataagcgtacagagatggagaatgaatttgtcctcatcaagaaggttgta gatgacgcttacatgaacaaggcagagctggagtctcacctggaagggctgactgatgag atcaacttcctcaggcagctgtatgaagaggagatccgggagctgtcctag >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_6|712_aa MGDFNTPLSILDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKGKEIITNYLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFMALNAHKRKQERSKIDTLTSQLK EQEKQEQTYSKDSRRQEITKIRAELKEIETQKPSKKFMNPGAEIQTTIREYYEHLYANKL ENLEEMDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQR YKEELVPFLLKLFQSIEKEGILPNSFYEASVILIPKPGRDTTKKENFRPISLMNIDAKIL NKILANRIQQHIKKLIHHDQVGFIPAMQGWFNTRKSINVIQHINKTKDKNHMIISIDAEK AFHKIQQPFMLKTLNKLGIHGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPL SPLLFNIVLEVLARAIRQEKEINGIQLRKEEVKLSLFAGDMIVYLENPIVSAQNLLKLIS NFSKVSGYKINVQKSQAFLYTNNRQTESQIMSALPFTIASKRIQYLGIQLTKDVKELFKE NYKPLLEEIKEGTNRWKNTPCSWVGRINIMNMAILPKVIYRFNAISIKLPMTFFIELEKT TLKFIWNHKRDRIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQTEI >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_6|2139_bp atgggagattttaacaccccactgtcaatattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaagggaaaagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatggcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaacaagaaaagcaagagcaaacatattcaaaagatagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaccctccaaaaaattcatgaatcca ggagctgaaatacaaactaccatcagagaatactatgaacacctctacgcaaataaacta gaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaaccag gaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatcaat agcttaccaaccaaaaagagtccaggaccagatggcttcacagccgaattctaccagagg tacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagaggga atcctccctaactcattttatgaggccagcgtcatcctgataccaaagccgggcagagac acaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctc aataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaa gtgggcttcatccctgcgatgcaaggttggttcaatacacgcaaatcaataaatgtaatc cagcatataaacaaaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaag gcctttcacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattcat gggatgtatctcaaaataataagagctatctatgacaaacccacagccaatatcatactg aatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctc tcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggagaag gaaataaacggtattcaattaagaaaagaggaagtcaaattgtccctgtttgctggcgac atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgcactcccattcacaattgcttca aagagaatacaatacctaggaatccaacttacaaaggatgtgaaggagctcttcaaggag aactacaaaccactgctcgaggaaataaaagagggtacaaacagatggaagaacactcca tgctcatgggtaggaagaatcaatatcatgaatatggccatactgcccaaggtaatttac agattcaatgccatctccatcaagctaccaatgacattcttcatagaattggaaaaaact actttaaagttcatatggaaccataaaagagaccggatcgccaagtcaattctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaacagagatatag >gi568815595f:87140698_87353819|GENSCAN_predicted_peptide_7|109_aa MGKKKSQKTGNSKKQSASPPPKERSSSPATEQSWMENDFHELREEDFRRSNYSELLEDIQ TKGKEVENFEKSLEECIIRITNTEKCLKELMELKTKARELREECRSLRS >gi568815595f:87140698_87353819|GENSCAN_predicted_CDS_7|330_bp atggggaaaaaaaagagccaaaaaactggaaactctaaaaagcagagtgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttcac gagctgagagaagaagacttcagacgatcaaattactccgagctactggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaagtttagaagaatgtataattagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga