GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:06:29 Sequence gi568815596f:234935003_235153813 : 218811 bp : 47.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4788 4867 80 1 2 93 41 90 0.379 2.73 1.02 PlyA + 4966 4971 6 1.05 2.00 Prom + 14249 14288 40 -2.36 2.01 Init + 16636 16686 51 0 0 74 11 99 0.786 -0.03 2.02 Intr + 16978 17168 191 0 2 82 96 93 0.615 7.88 2.03 Intr + 17414 17784 371 2 2 55 46 136 0.570 0.85 2.04 Intr + 18448 18585 138 2 0 66 90 45 0.324 2.94 2.05 Intr + 29875 30188 314 2 2 61 90 165 0.353 9.90 2.06 Intr + 36171 36222 52 2 1 108 82 -7 0.164 -0.82 2.07 Intr + 43681 43898 218 2 2 63 71 175 0.600 11.52 2.08 Intr + 46621 46725 105 0 0 31 83 64 0.425 0.61 2.09 Intr + 48354 48404 51 2 0 84 48 70 0.729 1.70 2.10 Intr + 50667 50787 121 2 1 57 66 159 0.932 10.67 2.11 Intr + 50968 51168 201 2 0 -4 16 224 0.617 5.26 2.12 Intr + 60301 60374 74 2 2 126 96 98 0.572 13.53 2.13 Term + 60690 60749 60 2 0 80 43 37 0.134 -3.80 2.14 PlyA + 61277 61282 6 1.05 3.06 PlyA - 61356 61351 6 1.05 3.05 Term - 61856 61710 147 2 0 100 34 78 0.021 1.50 3.04 Intr - 66667 66573 95 2 2 50 58 86 0.141 1.48 3.03 Intr - 68562 68423 140 2 2 81 51 73 0.740 3.01 3.02 Intr - 73553 73464 90 1 0 44 54 120 0.703 3.31 3.01 Init - 87980 87835 146 2 2 97 84 64 0.439 6.49 3.00 Prom - 88595 88556 40 -7.46 4.00 Prom + 89534 89573 40 -5.46 4.01 Init + 100001 100118 118 1 1 86 105 123 0.999 14.22 4.02 Intr + 105886 108337 2452 2 1 93 91 3100 0.638 298.15 4.03 Intr + 110024 110339 316 2 1 50 41 225 0.375 10.07 4.04 Intr + 117560 117748 189 1 0 144 83 342 0.999 39.08 4.05 Term + 118590 118814 225 2 0 71 42 388 0.999 29.28 4.06 PlyA + 120600 120605 6 1.05 5.05 PlyA - 120634 120629 6 1.05 5.04 Term - 128297 128224 74 0 2 126 47 93 0.976 7.07 5.03 Intr - 130096 129976 121 1 1 76 37 95 0.916 3.27 5.02 Intr - 132174 132046 129 0 0 74 46 84 0.568 3.69 5.01 Init - 145577 145572 6 2 0 63 117 9 0.267 1.68 5.00 Prom - 146483 146444 40 -2.86 6.00 Prom + 152236 152275 40 -6.96 6.01 Init + 157570 157690 121 0 1 31 57 138 0.377 5.35 6.02 Intr + 158066 158178 113 0 2 78 21 57 0.528 -1.90 6.03 Intr + 158314 158519 206 0 2 78 45 92 0.046 1.90 6.04 Intr + 161920 162090 171 1 0 116 67 54 0.540 5.16 6.05 Term + 178713 178866 154 0 1 117 53 55 0.161 2.29 6.06 PlyA + 180237 180242 6 1.05 7.03 PlyA - 181037 181032 6 1.05 7.02 Term - 201253 200939 315 1 0 7 33 194 0.193 0.74 7.01 Init - 206622 206539 84 0 0 75 76 52 0.368 3.52 7.00 Prom - 207174 207135 40 -4.76 8.02 PlyA - 207339 207334 6 1.05 8.01 Sngl - 212230 212021 210 1 0 68 38 287 0.975 16.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 23024 22909 116 0 2 87 41 111 0.856 5.03 S.002 Term + 158314 158523 210 0 0 78 51 103 0.881 2.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_1|26_aa XSVLIGELANLESMQRDLKMTIYAFA >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_1|81_bp nactcggtgcttattggggaacttgctaacctggaaagtatgcagagagatctgaagatg acaatctatgcctttgcctga >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_2|648_aa MGVRGGTGARASTQTCSARDHPPPAEAGAQRARHSRRSGPLATTAAAAAVSPAEPRAPPA GPSRWPTSGASGAGGDAMRARDPARGSGRRAAALRVRSGRPPTPALATSRRAALRGGLPG QPDASSGANRGARSAHGGRGGHGRAWRGRGAPLERRWGGGKDARSPVKLVGDNAYFNFFG DPSCEDEIENSGFWLGDRVTEQPRESLLATSPLPSGLGAFLEIRGPDEMMGEAWILTALC GFGSLPCKNEGGDPSIIDAWQGGYPVHHILFQTPIFTALALRVAESVSQPSAPVTHMWYF SLPACCEILIKLSLINHPVQLLKVTFITQNNTSETKLTVESLKESQHSFIPERKKLLAPT IPLSVAMNLITTVTLRGTTETETSTPTVGCFEVLEGSPGPVERDRVVQRPWPAFETEGQE AGEIRESCRCVEEHAEPLEERLYLQSQLAGVCLHKDSLVTICSRFLAECEVKSRLTFGME VVPGEQDKVLAFMEPVLHPPLMYVRVKQPSMHTYELRDCAQVNQPVTHAYELKEHGAETT VQVNQPLTLIYENRVHVQVYQPLIYTYELRVHVQVNQPLTQTYELRVHVQVNQPSTHTYE HRDHASPFSPGQKAVSWDFSDGERLRLYQRIHFLLPPVERSHSSKKPG >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_2|1947_bp atgggggtgcgggggggcacaggagcgcgggccagcacccagacctgctcggcgcgggac caccctccgcccgccgaggcgggggcccagcgcgcccggcactctcggcggtccgggccc ctcgccactaccgccgccgccgccgccgtgagtcccgcggagccgcgcgcgcccccggct gggccgagccgctggccgacgagcggagcctcaggagccggcggggacgccatgcgagcc agggacccggcccggggttccgggcgccgagccgcagccctccgcgtgcgctcgggccgc cccccaaccccggctctggccacttcccggcgggcggccctgcggggtggcctccctggg caaccggacgcgtcctcgggcgcaaatcggggggcgcgctcggctcacgggggccgtggg ggccacgggagggcgtggaggggtcggggcgcccccctcgagcggcgctggggaggaggg aaggacgcgcggtctccggttaaacttgttggtgacaatgcatatttcaacttcttcgga gatccctcctgtgaggatgaaattgaaaactccggtttctggcttggcgatcgggtgacc gagcagccgcgagaatctctgctggccacgagtccccttcccagtgggctgggggctttc cttgagatccgtggcccagatgaaatgatgggagaggcctggatcctcaccgccctctgt ggttttggttccttgccttgtaaaaatgaagggggtgacccatccatcatcgacgcctgg cagggaggctaccctgtgcaccacatcctgttccagacccccatcttcacagccctggca ctacgtgtagcagagtcggtctcccagcccagtgctcccgtgacccacatgtggtacttc tccctcccggcctgctgtgagattcttataaaattgtctctgattaatcaccctgtacag cttttaaaagttacttttattacccaaaacaacacttcagaaactaagctgacggttgaa tcattgaaggagtcacagcattctttcatcccagagagaaaaaagctcttggcacccacc attccactttctgttgctatgaatttgattactacagttaccttaagagggacaactgaa acagagacaagcaccccaacagtagggtgcttcgaggtcctggaggggagccctggccct gtggagagagaccgagtcgtgcagcgtccgtggcctgcatttgaaactgaagggcaagag gctggtgagatccgagagtcatgccgctgcgtggaggaacatgctgaacctctggaggaa cggctctacctccagtcgcagttggctggtgtttgtttgcacaaggattctttggtaaca atctgcagtcggtttctggcggaatgtgaggtgaagtcacgactgactttcgggatggag gttgttcctggtgaacaggacaaagtccttgcattcatggagcctgttctgcatcctccc ttgatgtatgttcgggtgaagcagccctcgatgcacacctatgagctcagggactgtgcg caggtgaaccagcctgtgacacacgcctacgagctcaaggagcacggtgcagagaccaca gtgcaggtgaaccagcctctgacactcatctacgagaatagggtccacgtgcaggtgtac cagcccttgatatacacctatgagctcagggtccatgtgcaggtgaaccagcccttgaca caaacctatgagctcagggtccatgtgcaggtgaaccagccctccactcacacctatgag cacagagaccatgcgtctcccttctctcctggacagaaggccgtgtcctgggacttctct gatggcgagaggctgcggctgtaccagcgaatccactttctgttgccacccgtggaacgt tctcactcttccaagaagccaggctag >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_3|205_aa MGYWLERCEEGQGQGVGPNQTHQAPTATLQREHHPDPAPPSKTATFLLVKYLRSIILHKG YQKENEEKITTYKEKRYKRPESLVPESRLHTGAGKMGICFSADKTVNRHSILRMTLILLS CPDLYREHLRRSGIVQNKTKFNEKPKDNFQKPGLPCSDGACSGDAPNAKSSEASPCGGPK RTMPHNPLVSLQFIYRQEPSCSARL >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_3|618_bp atgggctactggctagaaagatgtgaggagggacaaggtcaaggagtggggcccaaccag acccatcaggccccaactgcaactctgcagagggagcaccaccctgaccctgctcctccc agcaagactgccacatttttgctagtaaagtacctaaggagcattattctacacaaaggg taccaaaaagagaatgaggagaaaataaccacctacaaggagaagcggtacaagagacca gagagtttagttccagagtccaggctacacacaggtgctggcaaaatgggaatctgcttc tcagcagataaaacagtaaacagacactctatccttcgaatgaccctcattctgctctca tgccccgacctgtacagggagcaccttcgacgttcaggcattgttcagaataagaccaaa ttcaatgaaaaacctaaagataattttcagaagccaggtcttccctgttctgatggggcc tgctctggcgatgccccaaatgcaaagtccagcgaggccagtccatgtgggggcccaaag agaaccatgccgcacaaccccctggtgtctctccagtttatctacagacaagaaccaagc tgcagtgctcgcctttga >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_4|1099_aa MAAQRIRAANSNGLPRCKSEGTLIDLSEGFSETSFNDIKVPSPSALLVDNPTPFGNAKEV IAIKDYCPTNFTTLKFSKGDHLYVLDTSGGEWWYAHNTTEMGYIPSSYVQPLNYRNSTLS DSGMIDNLPDSPDEVAKELELLGGWTDDKKVPGRMYSNNPFWNGVQTNPFLNGNVPVMPS LDELNPKSTVDLLLFDAGTSSFTESSSATTNSTGNIFDELPVTNGLHAEPPVRRDNPFFR SKRSYSLSELSVLQAKSDAPTSSSFFTGLKSPAPEQFQSREDFRTAWLNHRKLARSCHDL DLLGQSPGWGQTQAVETNIVCKLDSSGGAVQLPDTSISIHVPEGHVAPGETQQISMKALL DPPLELNSDRSCSISPVLEVKLSNLEVKTSIILEMKVSAEIKNDLFSKSTVGLQCLRSDS KEGPYVSVPLNCSCGDTVQAQLHNLEPCMYVAVVAHGPSILYPSTVWDFINKKVTVGLYG PKHIHPSFKTVVTIFGHDCAPKTLLVSEVTRQAPNPAPVALQLWGKHQFVLSRPQDLKVC MFSNMTNYEVKASEQAKVVRGFQLKLGKVSRLIFPITSQNPNELSDFTLRVQVKDDQEAI LTQFCVQTPQPPPKSAIKPSGQRRFLKKNEVGKIILSPFATTTKYPTFQDRPVSSLKFGK LLKTVVRQNKNHYLLEYKKGDGIALLSEERVRLRGQLWTKEWYIGYYQGRVGLVHTKNVL VVGRARPSLCSGPELSTSVLLEQILRPCKFLTYIYASVRTLLMENISSWRSFADALGYVN LPLTFFCRAELDSEPERVASVLEKLKEDCNNTENKERKSFQKELVMVSAQQVPGRSGVLL SKAFPPSQCTWAWAVVSEGMACVPPISHHPCVIQSTPPGSPEPGAGPWPGLLTGAPSTVA DKAPPSRESLCAGGPETTAVAVHAVVEPDSIAEPRTLTSTLHQRWKNDTTRSTTPAVENN RRALLKMDCQGLVVRLIQDFVLLTTAVEVAQRWRELAEKLAKVSKQQMDAYESPHRDRNG VVDSEAMWKPAYDFLLTWSHQIGDSYRDVIQELHLGLDKMKNPITKRWKHLTGTLILVNS LDVLRAAAFSPADQDDFVI >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_4|3300_bp atggcggctcagcggatccgagcggccaactccaatggcctccctcgctgcaagtcagag gggaccctgattgacctgagcgaagggttttcagagacgagctttaatgacatcaaagtg ccttctcccagtgccttgctcgtagacaaccccacacctttcggaaatgcaaaggaagtg attgcgatcaaggactattgccccaccaacttcaccacactgaagttctccaagggcgac catctctacgtcttggacacatctggcggtgagtggtggtacgcacacaacaccaccgaa atgggctacatcccctcctcctatgtgcagcccttgaactaccggaactcaacactgagt gacagcggtatgattgataatcttccagacagcccagacgaggtagccaaggagctggag ctgctcgggggatggacagatgacaaaaaagtaccaggcagaatgtacagtaataaccct ttctggaatggggtccagaccaatccatttctgaatgggaacgtgcccgtcatgcccagc ctggatgagctgaatcccaaaagtactgtggatttgctcctttttgacgcaggtacatcc tccttcaccgaatccagctcagccaccacgaatagcactggcaacatcttcgatgagctt ccagtcacaaacggactccacgcagagccgccggtcaggcgggacaaccccttcttcaga agcaagcgctcctacagtctctcggaactctccgtcctccaagccaagtccgatgctccc acatcgtcgagtttcttcaccggcttgaaatcacctgcccccgagcaatttcagagccgg gaggattttcgaactgcctggctaaaccacaggaagctggcccggtcttgccacgacctg gacttgcttggccaaagccctggttggggccagacccaagccgtggagacaaacatcgtg tgcaagctggatagctccgggggtgctgtccagcttcctgacaccagcatcagcatccac gtgcccgagggccacgtcgcccctggggagacccagcagatctccatgaaagccctgctg gaccccccgctggagctcaacagtgacaggtcctgcagcatcagccctgtgctggaggtc aagctgagcaacctggaggtgaaaacctctatcatcttggagatgaaagtgtcagccgag ataaaaaatgacctttttagcaaaagcacagtgggcctccagtgcctgaggagcgactcg aaggaagggccatatgtctccgtcccgctcaactgcagctgtggggacacggtccaggca cagctgcacaacctggagccctgtatgtacgtggctgtcgtggcccatggcccaagcatc ctctacccttccaccgtgtgggacttcatcaataaaaaagtcacagtgggtctctacggc cctaaacacatccacccatccttcaagacggtagtgaccatttttgggcatgactgtgcc ccaaagacgctcctggtcagcgaggtcacacgccaggcacccaaccctgccccggtggcc ctgcagctgtgggggaagcaccagttcgttttgtccaggccccaggatctcaaggtctgt atgttttccaatatgacgaattacgaggtcaaagccagcgagcaggccaaagtggtgcga ggattccagctgaagctgggcaaggtgagccgcctgatcttccccatcacctcccagaac cccaacgagctctctgacttcacgctgcgggttcaggtgaaggacgaccaggaggccatc ctcacccagttttgtgtccagactcctcagccaccccctaaaagtgccatcaagccttcc gggcaaaggaggtttctcaagaagaacgaagtcgggaaaatcatcctgtccccgtttgcc accactacaaagtacccgactttccaggaccgcccggtgtccagcctcaagtttggtaag ttgctcaagactgtggtgcggcagaacaagaaccactacctgctggagtacaagaagggc gacgggatcgccctgctcagcgaggagcgggtcaggctccggggccagctgtggaccaag gagtggtacatcggctactaccagggcagggtgggcctcgtgcacaccaagaacgtgctg gtggtcggcagggcccggcccagcctgtgctcgggccccgagctgagcacctcggtgctg ctggagcagatcctgcggccctgcaaattcctcacgtacatctatgcctccgtgaggacc ctgctcatggagaacatcagcagctggcgctccttcgctgacgccctgggctacgtgaac ctgccgctcacctttttctgccgggcagagctggatagtgagcccgagcgggtggcgtcc gtcctagaaaagctgaaggaggactgtaacaacactgagaacaaagaacggaagtccttc cagaaggagcttgtgatggtgagtgctcagcaggtgcctgggcgttcaggggtgctgctc agcaaagcctttccgccttcccagtgcacatgggcgtgggccgtggtgtcggaggggatg gcctgcgtccctcccatcagccatcacccatgtgtgattcagagcacacctcctggctct ccagaacctggtgccgggccgtggcctggtctcctcacgggtgcacccagcacagtggca gacaaagcgcctccatcccgtgagagcctctgtgcgggcggccccgagaccacagccgtg gctgtgcatgctgtcgtggaacctgacagcattgctgagcccaggacactcacctcgacg ttgcaccagaggtggaaaaatgacacaaccaggagtaccacaccagctgtggaaaataac cgccgagccctactgaagatggactgccagggcctggtggtcagactcatccaggacttt gtgctcctgaccacggctgtagaggtggcccagcgctggcgggagctggctgagaagctg gccaaggtctccaagcagcagatggacgcctacgagtctccccaccgggacaggaacggg gttgtggacagcgaggccatgtggaagcctgcgtatgacttcttactcacctggagccat cagatcggggacagctaccgggatgtcatccaggagctgcacctgggcctggacaagatg aaaaaccccatcaccaagcgctggaagcacctcactgggactctgatcttggtgaactcc ctggacgttctgagagcagccgccttcagccctgcggaccaggacgacttcgtgatttga >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_5|109_aa MPQGQPNRSQSRGLHQAGSLNTTAWILIKVQQEVTASPTMIGSQEVSGDCSERTFNPQKA HLRLQMSTKRTHTCYSLLRGIVYTLGTSESFIGTHNDLQAGIWEPPFIG >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_5|330_bp atgccgcaggggcagcccaacaggtcccagtcacgtggtctgcatcaagcaggctccctg aacacgactgcctggattttaattaaagttcagcaggaggtgactgcaagtccaacaatg attggaagccaggaggtgtcaggggactgttcagagagaactttcaacccccagaaagct cacctgaggctacagatgtccaccaagcgtacccacacttgctattctctcctacgtggg atcgtctacaccctgggtaccagtgaaagcttcatcgggacccacaatgacctccaagct ggcatttgggagcctccattcatcggctga >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_6|254_aa MLPPPPHKHQGILTDAANPVCSFAVCIKVRHTKGALALETGGQASLSVPESLRCSAAIYL VLLMSSCCRMRDWEELLERAYASIYRNPICASITRTWGSTDGDFQASGVVLVKSTPWTSF TTEAANLGDFPEDEDEEFGSTSDASWQAVFPVSLCVSASFFWWCDIALYYSYATLGEVGR KLYEISLQYFLQLHMNVQASKLKSSDLVAPCSLLPRRRKSLSPHTPHRGVYGPGGGCPVK SERDGACLCVIRGC >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_6|765_bp atgcttcctcctcctccccacaagcaccagggcatcctcacagatgcagcaaatcctgtt tgctcttttgccgtgtgcatcaaggtcaggcacaccaagggcgcgctggccttggagaca ggcgggcaggcctcactgtctgtgcctgagtccctgcgctgctctgcagccatctacctc gtcttgctgatgagcagctgctgcagaatgagggattgggaggaattactagagagagcc tatgccagcatctatagaaacccaatatgtgcatcgataaccagaacctggggcagcact gatggggatttccaagcatctggagtagtgctggtaaaatccacaccctggacttcattc accactgaggcagcaaacttaggggattttccagaagatgaggatgaagagtttgggagc acatcagatgccagctggcaggctgtcttccccgtgtctctctgtgtatcagcgtcattt ttctggtggtgtgatattgccttatactatagttatgctacccttggggaagttggacga aagttatatgagatctctctgcagtatttcttacaactgcatatgaatgtgcaagcatcc aaactaaaaagctcagaccttgttgctccctgttctttgcttccccgaagaagaaagtct ctcagcccccacactcctcatagaggtgtctatggccccggtgggggctgtccggtgaag tcagagcgcgacggagcctgcctgtgcgtgattcgtggctgctga >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_7|132_aa MSSGTVAASLDKDQDCYIQRCGVISDLQVTAIFFRPRELHFSYWNVAWEPLTTCGVRVLS PPGGCVNIGCRDEQLWASIPHIVHTLKQQRRWTRGAVFAYCLVESDAVSPSTIRFWGSDA AELLQLFLYREG >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_7|399_bp atgtcctcaggcactgtggctgcatcgctagataaggaccaggattgctacatccagcgc tgcggtgtgatctcagacttgcaggtgactgccatcttcttccgtccccgggaacttcat ttctcctactggaacgtggcgtgggaaccactcactacgtgtggggttagagtcctttcg ccccctggtggttgcgtgaacattggttgcagggatgaacagctctgggcctccatcccg cacatcgtccataccctcaagcagcaacgcagatggactcgaggtgcggtgttcgcctac tgccttgtggagtccgacgctgtgtctccctccacgatccggttttggggctcagatgct gcagagttgctccaattatttttatatagagagggatag >gi568815596f:234935003_235153813|GENSCAN_predicted_peptide_8|69_aa MGQIGPETTIDPGKDLNTLDDKELAKRKSIMDELFEKNQKNDPNFVYNIEVKIPQDEQLQ SCGWDTVSS >gi568815596f:234935003_235153813|GENSCAN_predicted_CDS_8|210_bp atgggacaaattggaccggaaacaaccattgatcctggaaaagacctgaacacactggat gacaaggagcttgccaaaaggaagagcatcatggatgaactttttgagaaaaatcagaag aatgatccaaattttgtttacaacattgaggtcaaaatcccacaggatgaacagctgcag tcctgtggctgggacacagtcagctcataa