GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:28:29 Sequence gi568815591r:28855614_29058046 : 202433 bp : 40.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 2505 2500 6 1.05 1.10 Term - 12185 12086 100 1 1 105 48 100 0.438 4.32 1.09 Intr - 27506 27337 170 2 2 92 47 39 0.015 -1.98 1.08 Intr - 30478 30374 105 1 0 45 63 94 0.080 2.09 1.07 Intr - 34894 34861 34 0 1 62 76 76 0.476 1.11 1.06 Intr - 39752 39596 157 0 1 81 95 101 0.937 8.25 1.05 Intr - 68841 68721 121 0 1 34 94 51 0.005 -0.55 1.04 Intr - 71109 70986 124 2 1 64 99 62 0.069 4.57 1.03 Intr - 71368 71215 154 2 1 72 -13 175 0.070 4.01 1.02 Intr - 71606 71457 150 0 0 78 42 82 0.678 1.81 1.01 Init - 72660 72570 91 0 1 37 92 74 0.872 3.41 1.00 Prom - 72762 72723 40 -10.45 2.00 Prom + 73055 73094 40 -6.15 2.01 Init + 78839 78992 154 1 1 79 79 153 0.862 13.69 2.02 Term + 96288 96493 206 1 2 37 48 146 0.111 2.05 2.03 PlyA + 96997 97002 6 1.05 3.02 PlyA - 97194 97189 6 1.05 3.01 Sngl - 102433 99998 2436 1 0 87 42 3036 0.999 289.08 3.00 Prom - 110236 110197 40 -4.85 4.07 PlyA - 110327 110322 6 1.05 4.06 Term - 118922 118726 197 0 2 45 37 240 0.652 11.19 4.05 Intr - 122146 121960 187 1 1 104 23 111 0.626 4.54 4.04 Intr - 123764 123539 226 1 1 83 50 50 0.451 -2.24 4.03 Intr - 130017 129862 156 2 0 7 98 176 0.598 8.70 4.02 Intr - 132349 132220 130 2 1 90 64 110 0.625 7.63 4.01 Init - 133967 133823 145 2 1 46 64 75 0.330 1.23 4.00 Prom - 139633 139594 40 -5.35 5.03 PlyA - 139643 139638 6 1.05 5.02 Term - 140269 140159 111 1 0 112 37 136 0.988 8.48 5.01 Init - 144086 144057 30 2 0 47 96 42 0.436 0.70 5.00 Prom - 151934 151895 40 -3.15 6.00 Prom + 152127 152166 40 -5.45 6.01 Init + 152401 152490 90 0 0 54 89 38 0.182 1.04 6.02 Intr + 158413 158658 246 0 0 129 44 108 0.221 7.23 6.03 Intr + 163738 163796 59 0 2 78 48 126 0.176 4.36 6.04 Term + 164349 164484 136 0 1 84 48 53 0.189 -2.59 6.05 PlyA + 164789 164794 6 1.05 7.00 Prom + 166059 166098 40 -5.25 7.01 Sngl + 166113 166586 474 2 0 43 38 278 0.908 14.16 7.02 PlyA + 167589 167594 6 1.05 8.06 PlyA - 170409 170404 6 -0.45 8.05 Term - 173696 173682 15 2 0 113 40 10 0.436 -4.04 8.04 Intr - 175146 174964 183 0 0 84 91 223 0.678 21.26 8.03 Intr - 178004 177971 34 1 1 109 98 -1 0.401 0.31 8.02 Intr - 186522 186403 120 2 0 34 75 95 0.002 1.49 8.01 Init - 202054 201849 206 1 2 51 38 121 0.008 1.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 39986 39833 154 2 1 44 9 138 0.828 1.69 S.002 Term + 188267 188464 198 1 0 65 47 202 0.946 10.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_1|401_aa MGEREDNAYALMGLPGFLPVSSFAPYKTPGAYCPLAVTSLCSTRKRGTNTNRFLRMKPRR ELAGGSGRGSTSRVTSGFADSPSGTLLLPPGYRTSSLQREFKATSMLNRKSILWVEISDE ERFPNARKGVATGASYFASAPLLSPCPSPDHLVNLSGTGTVLQMHVFVVLRALVGPGHHS VRVREASRAAHGRGEQFGCARQGLDSSPAQYLPEIAAVIPTFSNQHSDQSATIRIKARPS TSKDYNLQKAQMIIGIFKQLRNFKLRKQLEKPDRQALDRWSDTRYSIELEAEVIFRSYFS SECYFLVNSEDPMYDTPKGSLSRGSMTQESHLISRSFQLLARLSVNLTLMQSFVPEAGTS LVYAGKSQSHPDPHQLPNPMERANHVLVEVFPDLFEQLIPQ >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_1|1206_bp atgggagaaagagaagacaatgcctacgcccttatgggcctaccaggctttctacctgtg agctcatttgctccttacaagaccccaggagcatattgccctctagcggtcacttctctt tgtagcacacggaagagaggcaccaatacaaatcgctttttaagaatgaaacccaggagg gaactggctgggggaagtggacgtggttccacgagccgggtaacttctggcttcgcggat tcacccagtggcactttgcttctcccgccgggataccggacctcttctttgcagagagaa ttcaaagctacttccatgttaaacagaaagagcatcctgtgggtagaaatatccgacgag gagcgttttccgaatgcccggaagggagttgctactggagccagctactttgcaagtgct ccccttctttctccttgcccatccccagaccatttagttaacctcagtgggacaggaacc gtcctacaaatgcatgtgtttgtcgtcttaagagctttggtaggtcctggccatcatagc gtcagggtccgggaagcaagcagggcggctcatggacgaggagagcagtttgggtgtgca agacaaggtctggattcaagtcctgctcagtacttaccagaaattgccgcagtaattcca accttcagcaaccaacactctgatcagtcagcaaccatccgcatcaaagcaagaccctcc accagcaaagattacaacttgcagaaggctcagatgatcattggcatttttaaacaatta agaaattttaaattaagaaagcagcttgaaaaaccagaccggcaagcattggacagatgg tctgataccaggtactccattgagctggaagcggaagttatcttcaggtcatatttctca agtgaatgctatttcctggtgaattctgaagacccaatgtatgacacacccaaaggaagt ctaagcaggggctcaatgacacaagaatcacacctcatctcaaggtccttccagcttctg gcaaggctttctgtgaacttgacacttatgcagagttttgttcctgaggcaggcacctcc cttgtatatgcaggaaaaagccagagccatcctgacccccatcaactccccaaccccatg gaaagggcaaatcacgtcctcgtggaagtcttccctgatctctttgagcagctcatcccc cagtga >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_2|119_aa MELLKLQWNPEDEDKLGMCRKGRQVLREEGTACTALWGSDVELHGSMGLGHSVGTDFMTK TPKTTATKAKIDKWDLIKLKSCCTAKETINRVNRQATIEWEKIFANCASDKASVRNLNL >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_2|360_bp atggagttgctgaagctccagtggaatccagaggatgaagacaagcttggcatgtgtaga aaggggagacaggtattgcgggaggaaggaactgcgtgcactgccttgtgggggagtgat gtggaacttcatggatcaatgggactggggcacagcgtgggcacagatttcatgacaaag acaccaaaaacaacggcaacaaaagcaaaaattgacaaatgggatctaattaaacttaag agctgctgtacagcaaaagaaactatcaacagagtcaacagacaagctaccattgaatgg gagaaaatatttgcaaactgtgcatctgacaaagcatctgtaaggaacttaaatttataa >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_3|811_aa MEAARALRLLLVVCGCLALPPLAEPVCPERCDCQHPQHLLCTNRGLRVVPKTSSLPSPHD VLTYSLGGNFITNITAFDFHRLGQLRRLDLQYNQIRSLHPKTFEKLSRLEELYLGNNLLQ ALAPGTLAPLRKLRILYANGNEISRLSRGSFEGLESLVKLRLDGNALGALPDAVFAPLGN LLYLHLESNRIRFLGKNAFAQLGKLRFLNLSANELQPSLRHAATFAPLRSLSSLILSANN LQHLGPRIFQHLPRLGLLSLRGNQLTHLAPEAFWGLEALRELRLEGNRLSQLPTALLEPL HSLEALDLSGNELSALHPATFGHLGRLRELSLRNNALSALSGDIFAASPALYRLDLDGNG WTCDCRLRGLKRWMGDWHSQGRLLTVFVQCRHPPALRGKYLDYLDDQQLQNGSCADPSPS ASLTADRRRQPLPTAAGEEMTPPAGLAEELPPQPQLQQQGRFLAGVAWDGAARELVGNRS ALRLSRRGPGLQQPSPSVAAAAGPAPQSLDLHKKPQRGRPTRADPALAEPTPTASPGSAP SPAGDPWQRATKHRLGTEHQERAAQSDGGAGLPPLVSDPCDFNKFILCNLTVEAVGADSA SVRWAVREHRSPRPLGGARFRLLFDRFGQQPKFHRFVYLPESSDSATLRELRGDTPYLVC VEGVLGGRVCPVAPRDHCAGLVTLPEAGSRGGVDYQLLTLALLTVNALLVLLALAAWASR WLRRKLRARRKGGAPVHVRHMYSTRRPLRSMGTGVSADFSGFQSHRPRTTVCALSEADLI EFPCDRFMDSAGGGAGGSLRREDRLLQRFAD >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_3|2436_bp atggaggctgcccgcgccttgcgcctcctgctcgtggtgtgcggctgcctcgcgctcccg ccgctggccgagcccgtgtgcccggagcgctgcgactgccagcatccccagcatctcctg tgcaccaacagggggctccgcgtagtgcccaagaccagctcgctgccgagcccccacgac gtgctcacctacagcctcggcggcaacttcataaccaacatcacggccttcgacttccac cgtctggggcagctcagacggctggacctgcagtacaaccagatccgctctctgcacccc aagaccttcgagaagctctcgcggctggaagagctgtacctggggaacaacctcttgcag gcgctcgccccgggcacgctggccccgctgcgcaagctgcgcatcctctacgccaacggg aacgagatcagccgcctaagccgcggctccttcgagggcctggagagtctagtcaagctg cggctggacgggaacgccctgggggcgctgccggacgcggtcttcgctcccttgggcaac ctgctctacctacatctggagtccaaccggatccgctttctgggcaagaacgccttcgcc cagctaggcaagctgcgcttcctcaacctctctgccaacgagctacagccctccctgcgc cacgcggccaccttcgcaccgctgcgctccctctcctccctcatcctctcggccaacaac ctgcagcacctcgggccgcgcatcttccagcacctgccacgtctcggcctgctctcgctc aggggcaaccagctcacgcacctcgcgcctgaggccttttggggcttggaggccctgcgc gagctgcgcctggagggtaatcggctgagccagctgccaactgcgctgctggagcctctg cacagcctggaggcgctggacctgagcggcaatgagctgtccgccctgcacccggccacc ttcggccacctgggccggctgcgcgagctcagcctgcgcaacaacgcgctcagcgcccta tccggggacatcttcgccgccagcccagccctttatcggctggatctagacggcaacggc tggacctgcgactgccggctgcgaggcctgaagcgctggatgggcgactggcactcgcag ggccggctcctcactgtcttcgtgcagtgtcgccaccccccggccctgcgaggcaaatac ctggattacctggatgaccagcagctgcaaaatggatcctgcgcggatccctcgccctca gcttccctgaccgctgaccgcaggcggcagcccctacccacggccgcaggggaggagatg acgccacctgcaggtctcgcggaggagctgccgccgcagccgcagctccagcagcagggg cgatttctagctggggtggcctgggatggggccgccagggagctggtaggcaaccgcagc gccctaaggctgagtcggcggggcccgggcctccagcagcccagcccctccgtcgctgcc gccgcgggcccggctccacagtccctagacctgcacaagaagccccagcggggccgtccg actcgggcagatcccgccctcgcggagcccaccccaacggcctctcctggctctgcgcca tcgcccgccggcgacccctggcagcgcgcgacgaagcatcgtctgggcacggagcaccag gagcgtgccgcccagtccgacggtggggccgggctgccgccgctggtgtccgacccatgc gacttcaacaagttcattctgtgcaacctgacggtggaggcggtgggcgcagacagcgcc tcggtgcgctgggccgtgcgcgagcaccgcagtccccggccgctgggcggcgcgcgcttc cgcctgctctttgaccgctttggccagcagcccaagttccaccgcttcgtctacctgcct gagagcagcgactcggccacgctgcgcgagctgcgcggggacaccccctacctggtgtgc gtggagggcgtgcttgggggccgtgtctgccctgtggctccccgggaccactgcgcgggg ctggtcaccctaccggaggccgggagccggggcggcgtcgactaccagctgctgaccttg gccctgctgacggtcaacgcgctgctggtgctcctggccttggcggcctgggcgtctcgc tggctgcgtaggaaactgcgggctaggcggaagggcggggccccggtccacgttcggcac atgtactccacccgacggcccctgcgctccatgggcaccggcgtgtccgccgacttctcg ggattccagtcgcaccggccacgcaccaccgtgtgcgcgctcagtgaggcggacctcatc gaattcccctgcgaccgcttcatggacagtgcgggcggcggcgcgggcggcagcctgaga cgggaggaccgtctcctgcagcgatttgccgactag >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_4|346_aa MGTDAYEGKAGDHCINNPSVPPTLIDEGPKCFENRGQRMTNYSKPRSGAPGAGVTPTFEA QGAAGLCDEPFLPLCPKPPSEENNLAFGVDSRAGVKGSHESHLYWAESMTGTEDQQGAQA LRREAQLPQGELGDFSAIEGGSSSQQLKLVVWGSFLGKIIETFIFEGSKSLAVLQVAILF QKLLLLDMEVLRGTQENFQGFRHSPLCHHCVSRQPNYSLITCSVFGPERAPLTTFLIWAA MLQQSTSTWCLQTARDHLRTRPKCFLSLSTVNPKRSQMWTDVIMANGVKTYGDGYISIEK ADYKTACSQTQPVSQIPKGPWRKAAVDGPTMSREDSAMAQEATHGF >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_4|1041_bp atggggacagatgcttatgaagggaaggctggtgaccactgcataaataatcccagtgtg ccacctactttaatagatgaaggtccgaagtgctttgagaacagaggacagaggatgact aattactccaaaccaaggagtggggccccaggtgcaggtgtaacccccacctttgaagcc caaggtgctgctggtctttgtgatgagccatttctccccctctgcccaaagccaccaagt gaagaaaacaacttggcatttggagtagacagcagggctggagtgaagggcagccatgaa tctcacttgtattgggcagagtccatgaccggcactgaggaccagcaaggggcccaggcc ctgagaagagaagctcagctaccccagggagagcttggtgatttcagtgccatagaaggt ggcagctcaagccagcaactgaagttagttgtctggggttcttttcttggcaagattatt gaaaccttcatttttgaagggtctaagtctttagctgtcctgcaggttgcaatattattc cagaaacttttactcttggacatggaagtgctaagaggcacccaggagaatttccagggc ttcagacatagtcctctctgccaccattgtgtgtcacggcaacccaattactccctgata acatgctcagtgtttggtcctgagagagctcctcttaccacgttcctgatttgggctgca atgcttcagcagtccacatctacctggtgcctgcaaactgcaagggaccatctgcgaacc aggcccaaatgttttcttagtctgtcaacagtcaatcctaagaggtcacagatgtggact gatgtaataatggcaaatggtgtgaaaacatatggagatggttatatctccatagagaag gcagactacaaaacagcatgttctcagacacagccagtttcccagatccccaagggacca tggagaaaggcagctgtggatggtcctaccatgagccgtgaggattctgccatggcccaa gaagcaactcatgggttttaa >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_5|46_aa MAAQKLDQGQVIIRGGGHILPYDQPLRAFDMINRFIYGKGWDPYVG >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_5|141_bp atggcagctcagaaattggaccagggtcaggtaattattcgaggtggaggacatatttta ccctatgaccagcctctgagagcttttgacatgattaatcgattcatttatggaaaagga tgggatccttatgttggataa >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_6|176_aa MERVAKKEVMWGKIKHTKIAKNERQMGVFQVLVSNKLFAPQTLPQCLLLEDLTCDFREAL WLGGGVWFDYCVLVFCPQSDFHKWTEELCPCLEDRDKEPVKEDIHHCRRRLLVRANKEEQ EALALHRISEGCQRSTISSLNVFCIHKKSQRVECDICHQFLTGGQFLLPPLWDLSP >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_6|531_bp atggagagagttgccaagaaggaggtaatgtggggaaagatcaagcataccaagattgcc aagaatgaaaggcaaatgggagttttccaggtgttggtctctaacaaactatttgcaccc caaactttgcctcagtgcctgcttttggaggacctaacctgtgacttcagagaagctctc tggctgggaggtggggtgtggtttgattactgtgttctggttttctgcccacagtcagat ttccataagtggacagaggaactttgcccctgtctggaagatagagacaaagaacccgtg aaagaggacatccatcattgtagaaggcgactgctggtcagagccaacaaggaggaacaa gaagcacttgctcttcatcggatctctgaaggctgccagaggagtaccatttcatctcta aatgtgttttgcattcacaagaagagtcagcgggttgagtgtgatatttgccaccagttt ctgactggcggccaattccttttaccacctctgtgggacttgtcgccctga >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_7|157_aa MFTLGPTHPLRFKQLRHGAILRGRPPPDYITPLGPNTHYISTSLNPCRHPPTSTQRFTAS RYWLEPAVQMRPQNFSSCNVLRPKEWVVKHTGEAALGTKGAKACTLQSLRAACLGLLPLT ANPYPPAAGWLHTCTYLNRAWELAHPGTVQGPEDRPI >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_7|474_bp atgttcacactgggtcccacacatcccctgagattcaagcagctacggcatggtgccatt ttgagaggccggcccccaccagactacatcacacctttggggcccaatacccactacata tctacatccctgaacccctgcagacacccccctacatctactcaaaggtttacagcatca cgatactggctggagccagcagtgcagatgcgtccccaaaactttagctcatgcaatgtc ttacgccccaaggaatgggtagtgaaacacactggggaggctgccctagggacaaaggga gccaaagcatgtactctccagagcctgagagctgcctgcctggggctgttgccactgaca gcaaatccttaccccccagcagcagggtggctgcacacctgcacatatcttaacagggcc tgggaactggctcacccaggcactgtccaaggacctgaggacaggcccatctag >gi568815591r:28855614_29058046|GENSCAN_predicted_peptide_8|185_aa MFSEWIVDLKVKHKTITLVGINLGENLNDLGYGVDVSDTMLKEQSMKEIIEKLDIIKINN FYFTKNNVNFMNRRPLLREWLSLEVTYFCYFAAIDREDQFSFQYIPNPRPCCLKYQQALG VLIYNGQLDIIVAAALTERSLMGMDWKGSQEYKKAEKKVWKIFKSDSEVAGYIRQAGDFH QKNIK >gi568815591r:28855614_29058046|GENSCAN_predicted_CDS_8|558_bp atgttctctgaatggattgtagacctaaaggtaaaacataaaactataacacttgtaggc attaacttaggagaaaatctaaatgaccttgggtatggtgttgatgtttcagatacaatg ctaaaggaacaatccatgaaagaaataattgagaagctggacatcattaaaattaataac ttctactttaccaaaaacaatgtcaacttcatgaaccgcagaccgctcctccgagagtgg ctgtccttggaagttacttacttctgttacttcgcagctattgacagagaggaccaattt tctttccagtatataccaaacccaagaccttgttgcttaaaataccagcaagccttaggg gttctgatctacaatggccaactggacatcatcgtggcagctgccctgacagagcgctcc ttgatgggcatggactggaaaggatcccaggaatacaagaaggcagaaaaaaaagtttgg aagatctttaaatctgacagtgaagtggctggttacatccggcaagcgggtgacttccat cagaaaaatatcaagtag