GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:42:34 Sequence gi568815581f:1180002_1395623 : 215622 bp : 50.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2266 2310 45 1 0 127 110 29 0.863 6.52 1.02 Intr + 9021 9142 122 0 2 74 76 36 0.234 1.14 1.03 Intr + 11902 12031 130 2 1 107 95 67 0.991 9.05 1.04 Intr + 13292 13484 193 2 1 35 75 139 0.820 6.79 1.05 Term + 18191 18433 243 1 0 71 37 122 0.663 1.30 1.06 PlyA + 19834 19839 6 1.05 2.05 PlyA - 19938 19933 6 1.05 2.04 Term - 21250 21149 102 1 0 89 44 73 0.034 1.28 2.03 Intr - 36897 36781 117 0 0 65 47 112 0.354 5.46 2.02 Intr - 48774 48659 116 1 2 48 84 11 0.200 -3.13 2.01 Init - 49629 48792 838 0 1 57 80 1300 0.987 118.96 2.00 Prom - 53667 53628 40 -5.76 3.00 Prom + 56887 56926 40 -5.66 3.01 Sngl + 69984 70418 435 2 0 55 42 222 0.573 10.58 3.02 PlyA + 73056 73061 6 1.05 4.00 Prom + 79790 79829 40 -7.36 4.01 Init + 90563 91127 565 1 1 69 15 666 0.922 50.61 4.02 Term + 91300 91511 212 2 2 45 50 174 0.943 6.76 4.03 PlyA + 91797 91802 6 1.05 5.00 Prom + 93138 93177 40 -8.16 5.01 Init + 100001 100387 387 1 0 96 115 440 0.989 44.31 5.02 Term + 100818 100892 75 2 0 70 55 0 0.179 -7.26 5.03 PlyA + 103814 103819 6 1.05 6.04 PlyA - 104678 104673 6 1.05 6.03 Term - 106902 106784 119 1 2 26 48 158 0.630 4.30 6.02 Intr - 108432 108289 144 1 0 -58 56 184 0.407 0.85 6.01 Init - 108716 108434 283 2 1 64 3 371 0.332 23.40 6.00 Prom - 111432 111393 40 -9.75 7.03 PlyA - 111602 111597 6 1.05 7.02 Term - 113753 113429 325 1 1 47 44 469 0.544 32.64 7.01 Init - 114337 114297 41 1 2 70 97 -21 0.497 -3.33 7.00 Prom - 114870 114831 40 -9.46 8.00 Prom + 114987 115026 40 -10.94 8.01 Init + 115433 115622 190 1 1 60 94 336 0.414 28.57 8.02 Intr + 124779 125051 273 1 0 77 65 149 0.819 9.01 8.03 Intr + 147811 147902 92 2 2 93 56 38 0.444 0.81 8.04 Term + 148932 149510 579 2 0 18 49 385 0.878 22.09 8.05 PlyA + 150166 150171 6 -3.94 9.03 PlyA - 150312 150307 6 1.05 9.02 Term - 151883 151779 105 2 0 85 50 68 0.822 1.11 9.01 Init - 154885 154829 57 1 0 110 92 54 0.881 9.31 9.00 Prom - 159773 159734 40 -5.06 10.07 PlyA - 163655 163650 6 1.05 10.06 Term - 165498 165446 53 1 2 118 53 98 0.901 6.89 10.05 Intr - 174346 174210 137 0 2 57 115 143 0.998 14.11 10.04 Intr - 181297 181091 207 0 0 75 111 147 0.997 13.89 10.03 Intr - 182007 181901 107 0 2 97 51 30 0.991 -0.79 10.02 Intr - 185057 184858 200 0 2 91 81 182 0.426 16.87 10.01 Init - 202854 202764 91 0 1 86 80 45 0.354 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_1|244_aa XSENRGPEQLSNPLRGSWHLVLSWSAQSHEHTVRCQELARETSTHDSCSPERRVKKGGKE RSTEHSLNPAAGSATNLLCDSEELPFPVCPLMFSPRHEEMIQDSPALPASLLNALRPTCP EHRQYLRSRHPVKHVTFTLRKQPLYVEATVTTPALQKRLKFTLLRPHMRLYMQAGHISFQ AHKQPRLPAKTTNPNVGTRGTLPKAELLLIFSVTFRTKKEEVLSGQGELGLRHQQTATSH LASA >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_1|735_bp nngtcagaaaaccgaggccctgagcagctgagtaacccactcagaggctcttggcacctt gtcctgagctggtcagctcagtctcatgaacacacagtccggtgccaggagctggcacga gaaactagcacacatgactcgtgttctcccgaaaggagagtgaagaagggagggaaagag cgctcgaccgagcacagcctgaatccagctgctggttctgccactaacttgctgtgtgac tctgaagaactgcctttccctgtctgtcccttgatgttctcacctcgacacgaagagatg attcaggactcccccgctctcccggcgagtttgctgaacgcccttcggccgacgtgtccc gagcatcgccagtatttacgttctaggcaccctgttaagcacgtaaccttcactcttcgc aagcagcctctctatgtggaagccacagtcaccacgcccgctttacagaagaggctcaag ttcacactgctaaggccacacatgaggctgtacatgcaggcaggccacatttccttccag gcccacaaacaacctcgacttcctgcaaagaccactaaccccaacgtcggtacacgtggc acgttgcccaaggccgagctgctgctgatttttagtgtgactttcaggaccaaaaaggaa gaggttctgtctggacaaggagagcttggcctgagacaccaacagacggcaacttcccac ctggcctctgcctga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_2|390_aa MWDPRAFERRWRAEFPGAEAPVPRLESVRDAERELERRRLNLERLQQVLAEEQLKASLPQ AALARGGGGDSGEPGRPDPEAESPRGDPGRPDPEAESPRGDLPAGPGAAGEAGGGRSWAD AVHRQLLQPQLRFRAREDDAPLGDPQAAPGTDEGGDGGDHDFEMVDFNEKFILSHWLVAP CRFGTRERARSPRRWMHLIPGGRRPQRGTRDLEQREAEAKGRPGSPLAAEETPGREHLPP WRRRRFLRVPERDSPGHSSPERDSDGSRHSSDREDDFSADPARPRRAPPTPGCRDPFPAD SAALFNLELGAGERAGRRRAQSGFLNAELVGNLCGLAVSTQKLIIHINGLKELLFTENPH PDLKHCALHPEQQPRGHRPREAIADVNSLE >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_2|1173_bp atgtgggacccccgcgcgttcgagcggcgctggcgggccgagttccccggggcggaggcg ccggtcccgaggctggagtcggtgcgggacgcggagcgggagctggagcgccgcagactc aacctggagcggctgcaacaggtgctggcggaggagcagctcaaagcctcgctcccgcag gccgcgctggcccggggcggcggcggagactccggggagcccgggcgccccgaccccgag gccgagtctccccgcggggaccccgggcgccccgaccccgaggccgagtctccccgcggg gacctgcccgccgggcccggggcggccggagaagccggcgggggaaggagctgggccgac gccgtccaccggcagctcctgcagccccagctgcggttccgggcccgggaggacgacgcg cccctcggggacccccaggcggctcccggcacggacgaggggggcgacggcggcgaccac gacttcgagatggtggatttcaacgagaagttcatcctcagccactggctggtggcccct tgccggttcgggacccgcgagcgggcgcgcagcccccgccgctggatgcacctgatcccc ggggggcggcggccgcagcgggggacccgcgacctggagcagcgggaggccgaggccaag ggccgcccgggctcgcccctcgccgcggaggagacccccgggcgcgagcacctgcccccg tggcggcgccgcaggttcctgcgggtgcccgagcgggactcgcccggccacagctcgccg gagagagacagcgacggcagccggcacagctccgaccgcgaggacgacttctccgcagac cccgctcgaccccgccgcgcgccccccaccccgggctgccgggaccccttcccggccgac tccgcggccctgtttaacctcgagctcggggccggggagcgggcgggaaggaggagggca cagagtggcttcctgaatgccgagcttgttggtaacctttgtgggctggcagtgagcacg cagaaattaattattcatatcaatgggctgaaggagctgttgttcactgagaacccacat cctgacctaaagcattgtgctttgcatccggagcagcagcctcgtggacaccggccccgt gaagctattgctgatgtgaacagtcttgagtga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_3|144_aa MGTLAGDCGGKIMGTLAGDCGGKIMGTLAGGCGGKIMGTLAGGCGGKIMGTLAGGCGGKI MGTLAGGCGGKIMGTLAGGCGGKIMGTLAGGCGGKIMGTLAGCCGGKIMGTLAGGCGGEE KSRERQTEFHSLQCTQLPLQQLCA >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_3|435_bp atggggacccttgctggggactgtggagggaagatcatggggacccttgctggggactgt ggagggaagatcatggggacccttgctgggggctgtggagggaagatcatggggaccctt gctgggggctgtggagggaagatcatggggacccttgctgggggctgtggagggaagatc atggggacccttgctgggggctgtggagggaagatcatggggacccttgctgggggctgt ggagggaagatcatggggacccttgctgggggctgtggagggaagatcatggggaccctt gctgggtgctgtggagggaagatcatggggacccttgctgggggctgtggaggggaagaa aagtcacgtgaacgacagacggaattccattccctccagtgcacacagctgccgctgcag caattatgtgcttaa >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_4|258_aa MLRGAPGLGLTARKGAEDSAEDLGGPCPEPGGDSGVLGANGASCSRGEAEEPAGRRRARP VRSKARRMAANVRERKRILDYNEAFNALRRALRHDLGGKRLSKIATLRRAIHRIAALSLV LRASPAPRGPCGHLECHGPAARGDTGDTGASPPPPAGPSLARPDAARPSVPSAPRCASCP PHAPLARPRRPGWQLDYKTRDCKGVLDRAELDRLRPGVEEKELAREKELRDPGLRPHIPA WGREKELRGLGLRPHIPA >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_4|777_bp atgctgcggggcgcgccaggactaggcctcacggcgcggaagggggccgaggactctgcg gaggacttggggggcccctgccccgagcccgggggcgattcgggggtgctgggggcgaac ggcgcttcctgcagccggggcgaggcggaggagccggcgggcaggaggcgcgcgcggccg gtgcggtccaaggcgcggcgcatggccgccaacgtgcgggagcgcaagcgcatcctagac tacaacgaggccttcaacgcgctgcgccgggcgctgcggcacgacctgggcggcaagagg ctctccaagatcgccacgctgcgcagggccatccaccgcatcgccgcgctctccctggtc ctgcgcgccagccccgcgccccgcgggccctgcggacacctggagtgccacggcccggcc gcgcgcggggacaccggggacacaggcgccagccccccgccgcctgcagggcccagcctc gcgcgcccagacgccgcccgcccctcggtgccgtccgcgccccgctgcgcctcgtgcccc ccgcacgcgcccctggcacggcccaggaggccgggctggcagctggactataaaacccgg gactgcaaaggcgtcttggacagagcagaactggaccggctgagacctggcgtggaagag aaggagctcgccagagagaaggagctccgggacccggggttgcgcccccacatcccagcc tggggcagagagaaggagctccggggcctggggctgcgcccccacatcccagcctag >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_5|153_aa MAHPVQSEFPSAQEPGSAAFLDLPEMEILLTKAENKDDKTLNLSKTLSGPLDLEQNSQGL PFKAISEGHLEAPLPRSPSRASSRRASSIATTSYAQDQEAPRDYLILAVVACFCPVWPLN LIPLIISIMATWTCSQPLPAPVSKSRSGFPAVP >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_5|462_bp atggcccacccggtgcagtccgagtttccttcagcacaggagccaggctccgccgcattc ctggacctgccggagatggagatactcctcaccaaggcagagaacaaggatgacaagacc ctgaatctgtccaagaccctctcggggcctctggatctggagcagaacagccagggccta cccttcaaggccatctccgaggggcacctggaggccccactgcctcggtccccctcccgg gccagctcaaggagggcgtcctccatcgccaccacctcctatgcccaagaccaagaagcc cccagagattacctcatcctggccgtcgtcgcctgcttctgccccgtctggcccctcaac ctcatccccctcatcatttccatcatggccacctggacatgctcccagcctctgccggct ccggtttccaagagtcgcagcggcttccctgccgtaccctga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_6|181_aa MGNPVGDEEPVGGGEPVGDEKLEGDGEPMGDGEPVGDEKLEGDEEPMGDGEPIGDGEPVG DEKLEGDGEPVGDGKPMGDGEPVGDEEPVGDGEPMGDEEPVGVGEPVGDEEPMGDGEPMG DGEPVGDEEPVGDGEPVGDGEPAGANHTRLANNNTPQADNNTPQADNNTPQADNTPRADN T >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_6|546_bp atggggaacccggtgggggatgaggagcccgtggggggtggggaacccgtgggggatgag aagctggagggggatggggaacccatgggggatggggaacccgtgggggatgagaagctg gagggggacgaggagcccatgggggatggggaacccattggggatggggaacccgtgggg gatgagaagctggagggggatggggagcccgtgggggatgggaaacccatgggggatggg gaacccgtgggggatgaggaacccgtgggggatggggaacccatgggggatgaggaacct gtgggggttggggaacctgtgggggatgaggagcccatgggggatggggagcccatgggg gatggggaacccgtgggggatgaggagcccgtgggggatggggaacccgtgggggatggg gaacccgctggagcaaatcacacccggctggccaataacaacactccacaggccgacaac aacaccccacaggccgacaacaacaccccacaggccgacaacaccccacgggctgataac acctga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_7|121_aa MWLHGGLVLCITPRSVWDSVTSSNSTCDIIKLNPQHQTQLMTSSNPAHDIIKPNLRHHQT QPTTSSNSTHDIIKPNPRHHQTQLVTSSNPTCDIIKHNPRHHQTQPTTSSNPTHDIIKLN S >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_7|366_bp atgtggcttcacgggggccttgttttatgtatcactcctcgctccgtctgggacagcgtg acatcatcaaattcaacctgtgacatcatcaaactcaacccacaacatcaaactcaactc atgacatcatcaaacccagcccatgacatcatcaaacccaacctgcgacatcatcaaaca caacccacgacatcatcaaactcaacccacgacatcatcaaacccaacccacgacatcat caaactcaactcgtgacatcatcaaacccaacctgcgacatcatcaagcacaacccacga catcatcaaactcaacccacgacatcatcaaacccaacccacgacatcatcaaactcaac tcgtga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_8|377_aa MKLTHSLPGSRGLSVLSPQSRSSMQQGNVDGARRLGRLARLLSITLIIMGIVIIMVAVTV NFTGIPAPGVGPGSYAAFVLTAKWALLYRQKHENCKRLPFLGCLRLDAQPGVLRVVGFQR SWALPEQGTVCEAVLGRPARYALLGSFPGGGSTAGGDGVSICKPGPSADAGSACTFTSDF PASRTRNTGRTLTQYPTETTGRTLTQYPTGTTGRTLARYPTETAGRTLTQYPTETMGRTL TQYSRGPTGRTLTQYPTETTGRTLTKYTTETAGRTLTQYLRGPTGRTLSQYPTETARRTL TRYSTEIARRTLTQYPTETTGRTLTQYPTETAGRTLTQYPTVLRKRVSPRSCRGARSGSM GSAGGDSGPPLKMGFGS >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_8|1134_bp atgaagctgacccacagccttcccggttcccggggtctctctgtgctctctccgcagtct cgaagcagcatgcaacagggcaacgtggacggcgcccggaggctgggccgcctggctcgg ctgctcagcattaccctcatcatcatgggcatcgtcattatcatggtggccgtgaccgtc aacttcacaggaattcctgctcccggagttggcccaggttcatatgctgcctttgtgctg actgcaaagtgggccctcctatacagacaaaaacatgaaaactgcaaacgcctccccttc ctgggatgtttgcggctggacgcccagcctggtgtgctgagggtagttggatttcagagg tcctgggcactgcctgagcagggcacagtctgcgaggccgtcctgggcagacctgcccgc tacgccttgttggggtcctttcccggaggtggcagcacagccggtggggacggagtgtcc atctgcaaaccaggaccctcggccgacgctgggtctgcctgcaccttcacctcagacttc ccagcctccagaacgagaaacactggaaggactctgacccagtaccccacagagaccact ggaaggactctgacccagtaccccacagggaccactggaaggactctggcccggtacccc acagagactgctggaaggactctgacccagtaccccacagaaaccatgggaaggactctg acccagtactccagagggcccactggaaggactctgacccagtacccgacagagaccact ggaaggaccctgaccaagtacaccacagagactgctggaaggactctgacccagtacctc agagggcccactggaaggactctaagccagtaccccacagagaccgctagaaggactctg actcggtactccacagagattgctagaaggactctgacccagtaccccacagagaccact ggaaggactctgacccagtaccccacagagacagctggaaggactctgacccagtacccc acagttctaaggaagagggttagtccacggagctgcagaggtgcccgctccggctctatg gggtcagcaggaggtgactcaggaccacctctgaagatgggctttgggtcctga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_9|53_aa MAPYEFLISVLEGIKADTQSCLLIFLSSLQLVITLFKDTVQLPVVCDRVWWYP >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_9|162_bp atggccccatacgagtttctaatctcggtgctcgagggaattaaggcagatactcagagc tgcctccttatcttcctcagcagcttacaacttgtcatcactctcttcaaggacacggtg cagctccctgttgtttgtgaccgtgtgtggtggtacccatga >gi568815581f:1180002_1395623|GENSCAN_predicted_peptide_10|264_aa MGFRQVGQAGLDLLTSSDPLASASQIAGNTEMVESMKKVAGMDVELTVEERNLLSVAYKN VIGARRASWRIISSIEQKEENKGGEDKLKMIREYRQMVETELKLICCDILDVLDKHLIPA ANTGESKVFYYKMKGDYHRYLAEFATGNDRKEAAENSLVAYKAASDIAMTELPPTHPIRL GLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLW TSDMQGDGEEQNKEALQDVEDENQ >gi568815581f:1180002_1395623|GENSCAN_predicted_CDS_10|795_bp atggggtttcgccaggttggccaggctggtctcgatctcctgacctcaagtgatccgctg gcctcagcctctcaaattgctgggaatacagaaatggtggagtcaatgaagaaagtagca gggatggatgtggagctgacagttgaagaaagaaacctcctatctgttgcatataagaat gtgattggagctagaagagcctcctggagaataatcagcagcattgaacagaaagaagaa aacaagggaggagaagacaagctaaaaatgattcgggaatatcggcaaatggttgagact gagctaaagttaatctgttgtgacattctggatgtactggacaaacacctcattccagca gctaacactggcgagtccaaggttttctattataaaatgaaaggggactaccacaggtat ctggcagaatttgccacaggaaacgacaggaaggaggctgcggagaacagcctagtggct tataaagctgctagtgatattgcaatgacagaacttccaccaacgcatcctattcgctta ggtcttgctctcaatttttccgtattctactacgaaattcttaattcccctgaccgtgcc tgcaggttggcaaaagcagcttttgatgatgcaattgcagaactggatacgctgagtgaa gaaagctataaggactctacacttatcatgcagttgttacgtgataatctgacactatgg acttcagacatgcagggtgacggtgaagagcagaataaagaagcgctgcaggacgtggaa gacgaaaatcagtga