GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:14:15 Sequence gi568815596f:227713912_227916384 : 202473 bp : 42.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9742 9790 49 0 1 62 53 72 0.893 2.26 1.02 Intr + 10668 10897 230 1 2 9 78 188 0.374 6.57 1.03 Intr + 14537 14746 210 1 0 54 97 202 0.784 15.89 1.04 Intr + 16779 16844 66 2 0 40 86 92 0.322 2.38 1.05 Term + 19016 20098 1083 1 0 -49 38 341 0.163 6.51 1.06 PlyA + 20754 20759 6 1.05 2.04 PlyA - 21448 21443 6 1.05 2.03 Term - 30459 30304 156 0 0 79 43 178 0.548 9.35 2.02 Intr - 32305 32121 185 2 2 22 26 253 0.208 11.09 2.01 Init - 44241 43941 301 0 1 55 33 175 0.462 6.22 2.00 Prom - 51839 51800 40 -5.55 3.00 Prom + 53619 53658 40 -8.15 3.01 Sngl + 53712 54332 621 2 0 70 53 548 0.993 45.44 3.02 PlyA + 54979 54984 6 1.05 4.00 Prom + 67663 67702 40 -2.95 4.01 Init + 79069 79244 176 0 2 86 89 42 0.430 2.97 4.02 Intr + 89244 89352 109 0 1 95 85 57 0.088 5.47 4.03 Term + 91785 91916 132 2 0 -3 43 159 0.053 -0.59 4.04 PlyA + 93334 93339 6 1.05 5.00 Prom + 94202 94241 40 -7.15 5.01 Init + 100001 100076 76 1 1 61 115 117 0.998 11.01 5.02 Intr + 101543 101657 115 0 1 82 103 135 0.556 12.99 5.03 Intr + 105342 105421 80 0 2 69 110 2 0.094 -1.22 5.04 Intr + 111350 111480 131 0 2 22 65 81 0.040 -1.31 5.05 Term + 118116 118256 141 2 0 59 38 157 0.306 4.75 5.06 PlyA + 118430 118435 6 1.05 6.03 PlyA - 119699 119694 6 1.05 6.02 Term - 122890 122737 154 0 1 32 48 82 0.439 -4.99 6.01 Init - 124899 124676 224 0 2 78 116 179 0.870 17.78 6.00 Prom - 130184 130145 40 -6.05 7.00 Prom + 131526 131565 40 -6.45 7.01 Init + 135450 135529 80 2 2 61 99 66 0.465 5.58 7.02 Term + 139919 140219 301 2 1 29 54 172 0.346 1.61 7.03 PlyA + 140882 140887 6 1.05 8.04 PlyA - 140969 140964 6 1.05 8.03 Term - 146450 146174 277 1 1 71 33 131 0.557 -0.15 8.02 Intr - 146648 146561 88 0 1 77 64 130 0.860 7.61 8.01 Init - 151750 151681 70 1 1 83 78 19 0.438 1.66 8.00 Prom - 154956 154917 40 -5.15 9.03 PlyA - 155434 155429 6 1.05 9.02 Term - 156610 156203 408 1 0 52 37 286 0.952 14.13 9.01 Init - 158700 158566 135 0 0 69 50 159 0.398 10.39 9.00 Prom - 168892 168853 40 -5.45 10.00 Prom + 169678 169717 40 -4.95 10.01 Init + 171445 171512 68 0 2 42 111 77 0.499 6.00 10.02 Intr + 179884 180006 123 1 0 50 65 157 0.539 8.38 10.03 Intr + 184271 184370 100 2 1 64 73 50 0.636 0.29 10.04 Intr + 191018 191124 107 1 2 88 86 100 0.836 7.89 10.05 Intr + 192325 192427 103 1 1 69 105 -34 0.842 -4.34 10.06 Intr + 193227 193341 115 2 1 59 78 137 0.879 8.90 10.07 Term + 194978 195030 53 0 2 26 48 112 0.772 -2.39 10.08 PlyA + 202120 202125 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_1|545_aa MCRRAQEFLSVTLMTKAASTAAAAAGAAAGAAAHPVAPAGGAAATATTAATTTATTAATT AATAATTAATTPTATHGVSREDSGEVTRKDSAEVLPGGLLLQLLPLLPRLLWGLLQHTCD LLLPPHLRLMWLRLWEGLLPAERLLPEAMLLLGGPPASGKMLQQGMEKGTEDFSVVLAAT LGLRKNKIPRNPTYKGCEGPLQGELQTTAQQNKRGHKQIEEHSMLMGRKNQYCEMAILPK VIYRFNAIPIKLPMSFFTELEKTTLKFIWNQKRARIAKTILSQKNKAGSITLPDFKLHYK ATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDENKKWGKDSLFNKWCLENW LAIYRKLKLDPFLTPSTKINSRWIKDLNVRPKTIKTLEENLSITIQDIGIGKDFMSKASK AMATKAKIEKWDLIKLKSFCTAKETTIRVNRQPTEWEKMFAIYPSDKRLIPRIYKELKQI YKKKSNKPIKKWGKDMNRHFSKDNIYAANRHMKKSSSSLAIREMQIKTTMRYLTPVRMAI IKKSG >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_1|1638_bp atgtgtagacgagcacaggaatttctgtcagtgacactgatgaccaaagcagcctccaca gcagccgcggcagcaggggcagcagctggagcagcagcccacccggtagcacctgcaggt ggtgcagctgccacagccaccaccgcagccaccaccacagccaccactgcagccaccacc gcagccaccgcagccaccaccgcagccaccacacccacagcaacccatggtgtcagtaga gaggactcaggtgaagtgacgaggaaggactcagcagaggtgctaccgggtgggctgctg ctccagctgctgcccctgctgccgcggctgctgtgggggctgctgcagcacacctgtgat ctgctgctgccgccgcacctgcggctcatgtggctgcggctgtgggaagggctgttgcca gcagaaaggctgctgccagaagcaatgctgctgctaggcgggcccccggcctctgggaag atgctgcagcaagggatggagaaagggacagaagacttctccgtggtgttggcagccacc ctgggcctgagaaagaataaaatacctaggaatccaacttacaagggatgtgaaggacct cttcaaggagaattacaaaccactgctcaacaaaataaaagaggacacaaacaaatagaa gaacattccatgctcatgggtaggaagaatcaatattgtgaaatggccatactgcccaag gtaatttatagattcaatgccatccccatcaagctaccaatgagtttcttcacagaactg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgccaagacaatc ctaagtcaaaagaacaaagctggaagcatcacgctacctgacttcaaactacactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaac agaacagagccctcagaaataataccacacatctacaaccatctgatctttgacaaacct gatgaaaacaagaaatggggaaaggattccctatttaataaatggtgcttggaaaactgg ctggccatatatagaaagctgaaactggatcccttccttacaccttctacaaaaattaat tcaagatggattaaagacttaaatgtcagacctaaaaccataaaaaccctagaagaaaac ctaagcattaccattcaggacataggcataggcaaggacttcatgtctaaagcatcaaaa gcaatggcaacaaaagccaaaattgagaaatgggatctaattaaactaaagagcttctgc acagcaaaagaaactaccatcagagtgaacaggcagcctacagaatgggagaaaatgttt gcaatctacccatctgacaaaaggctaatacccagaatctacaaagaactcaaacaaatt tacaagaaaaaatcaaacaaacccatcaaaaagtggggaaaggatatgaacagacacttc tcaaaagacaacatttatgcagccaacagacacatgaaaaaaagctcatcatcactggcc atcagagaaatgcaaatcaaaaccacaatgagatatctcacaccagttagaatggcaatc attaaaaagtcaggataa >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_2|213_aa MHRVRPLGLGQAPLKALCKGHTQFSLQNNVVGRYYCPHFTDEQSNTEVRYLCLIHMNKQL PEKGAVMRNLHEEMSLGSLTLKEVPHAPSRLAREIPSVNKTAALLLAAALLLTASLLLAT ALPIATATRAAGAAAAADHGSAAAASTAATAAGAAAGAAAHPTTHRRGPCGGDPAVPLGT YAAGSDQQPPGSAGGCMSPRMSVSVVLQLAPLS >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_2|642_bp atgcacagagtacgtcccctgggcctgggccaggcacctctcaaagctctctgcaaggga catacacaattttctctccagaacaacgttgttggaagatattactgtccccattttaca gatgagcaaagtaacacagaggttcgttatctctgcttaatacatatgaacaagcaattg ccagaaaaaggagcagtgatgaggaatttacatgaggaaatgagcctgggatccctgaca ctcaaggaagtgcctcatgctccttccagattggcaagagaaatacccagtgttaacaag acagcagcattgcttctggcagcagcacttctgctgacagcatcccttctgctggcaaca gcccttcccatagccacagccacacgagctgcaggtgcggcggcagcagcagatcacggg agtgctgcagcagcctccacagcagccacggcagcaggggcagcagctggagcagcagcc cacccgacaacgcacagaagaggcccatgcggtggagaccccgcagtccctcttggaact tacgctgcaggttctgatcaacaaccaccaggctctgcagggggctgcatgtcccccagg atgtcggtgtccgtggttttacaactggcacctctttcataa >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_3|206_aa MDVGDKAQGDQCRAARKEPASSQVTRTPAVTEWGEHRFKKYLEGKAVSTWRMNVMRNGQS GRSRLQTPGKRSFQSLFGCSEPYSGILNQGPFRPPGDNQLCLGQFCWSQQAAGVAVLEDA AKPPWCTGQAPWDGNPGLGARLPGSPLAAPAPGGRKRGGPGSRSVLLQKQALALRADVRG SPADPLPSMESRSSQGSPPPFTPPGS >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_3|621_bp atggatgtgggggacaaggcccagggagaccaatgcagagccgccaggaaagagcctgca agttcccaagtcaccaggaccccagcggtgacggagtggggagagcacaggttcaagaaa tatttagaaggcaaagctgtcagcacttggcgaatgaacgtgatgcgaaatggacagagt gggagaagcaggctacagacaccaggaaaaagatcgttccaatcactctttggctgttcc gagccttatagcgggatcctcaaccaggggccattccgccccccaggagacaaccagctg tgtctcggacagttttgctggtcacagcaagcggccggggtggcggtgctagaggacgct gctaaacccccttggtgcacagggcaggccccatgggacgggaatcctgggctcggggcg cggctgcctgggtcccctctggcggcgccagctccaggaggcaggaaacgaggaggccct ggctcacggagtgtgcttttacagaagcaagccctggcgctccgggcagatgtccggggg tctcccgcagaccccttgccctccatggaatcccgttcctcccagggttcccctccccca tttacccctccaggctcctga >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_4|138_aa MEGVSYWLAFKKFQTFSGDIILIFVPVEITFYSYSSTITHSSSLTCGQLGKIRTTEANQE TKYECNELSVGEKRSELSLIINSGGCPNVYSKIGLQDEPQTKLLRHRIKEGSGLFGPERQ QTRVFKSRAPRKRNSWPF >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_4|417_bp atggaaggtgtatcctattggttggctttcaaaaaatttcagactttttctggagatatt atactgatttttgtgccagttgaaatcacattttattcttattcatctaccatcactcac tcaagctcccttacatgtggccagcttggaaagatccgcacgacagaggcaaatcaggaa acgaaatatgaatgtaatgaactctcagtaggtgaaaaaaggagtgaactttctttgatc atcaactcaggtgggtgtccaaatgtttattcaaaaattggtttgcaggatgagccacag acgaaactcctcagacaccggattaaagaaggaagtggtttattcggcccggagcgtcag cagactcgcgtctttaagagccgagctccccgaaaaagaaattcttggcctttttaa >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_5|180_aa MCCTKSLLLAALMSVLLLHLCGESEAASNFDCCLGYTDRILHPKFIVGFTRQLANEGCDI NAIIHCWNSTVKARHFQGAADLSHCWKASAGDEESIECRFIHTQAIMGLVLHRLKSSEER ACLVLRVEIYKQIHTTKKLETGPPIATIQGLESLIKKPRQLATLEVGEGYGMLAETLHDL >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_5|543_bp atgtgctgtaccaagagtttgctcctggctgctttgatgtcagtgctgctactccacctc tgcggcgaatcagaagcagcaagcaactttgactgctgtcttggatacacagaccgtatt cttcatcctaaatttattgtgggcttcacacggcagctggccaatgaaggctgtgacatc aatgctatcattcactgttggaacagcacagtcaaagcaaggcattttcagggggcagca gatttgtctcattgctggaaggcctcagcaggtgatgaggaaagtattgaatgtagattc attcatactcaggccataatgggcttagtgcttcataggcttaaatccagcgaagaaaga gcctgcttagtactgagagttgaaatttacaagcagatccatactactaagaaactagag actggacccccaatagctacaatccagggattagaaagcctcatcaagaagccccgacag cttgctacccttgaagttggtgaaggatatggaatgcttgcagaaacacttcatgatcta tga >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_6|125_aa MRKNQRKKAENSKNQNASSSPKDHNSSPAREQNWTENETDELTEVGFRRWVITNSSELKE HVLSQCKEVKNLEKRIKHERSKINTLTSQLKELEKQEQTNSKASRRQEITKIRAGLKEIE TQKTL >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_6|378_bp atgaggaaaaaccagcgcaaaaaggctgaaaattccaaaaaccagaacgcctcttcttct ccaaaggatcacaactcctcaccagcaagggaacaaaactggacagagaatgagactgac gaattgacagaagtaggcttcagaaggtgggtaataacaaactcctccgagctaaaggag catgttctgtcccaatgcaaggaagttaagaaccttgaaaaaaggataaagcatgaaaga tctaaaatcaacaccttaacatcacaattaaaagaactagagaagcaagagcaaacaaat tcaaaagctagcagaagacaagaaataactaagatcagagcaggactgaaggagatagaa acacagaaaaccctttga >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_7|126_aa MPHYTLLPFGIQEKPTSINIDTDLEVCTYPLNQNAGQEEEELVWERVWRGLLRRKVNGEA PSMLASKGESALLSAEAGGKALGNIPGWEGELVHEVCLIKDKLPCPISKIPQSFVPKGED EDCFSV >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_7|381_bp atgccacattacaccctccttccttttggaattcaggaaaagccaaccagcattaatatc gacacagaccttgaagtctgcacttatccccttaaccagaacgctgggcaggaggaagaa gagcttgtctgggaaagggtttggaggggattgttgagaaggaaagttaatggagaagct cccagcatgttggcaagtaagggtgagtcagctttgctaagtgcagaagctggtggaaag gcacttgggaacatccctgggtgggaaggagagttggtgcatgaagtttgtttgatcaaa gacaaacttccttgtcctatttcaaaaattccccagagctttgtacccaaaggagaggac gaagactgtttttcggtgtga >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_8|144_aa MGDKSETPSKKTKQKRRKEEMIVVGLVPAGDRRQLQGPGSTCGLIPIKQATPSSSNRALH SWALASWMVTRQVQAIQNASEIVAHSHQLCAGVPVPYSLIIAESATGLVNSSAVLLWQTH LKILLRICSRPLMIWKTYFPFRSL >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_8|435_bp atgggtgacaagagcgaaactccatctaaaaaaacaaaacaaaaaagaagaaaggaagaa atgattgtagtgggcctagtgcctgctggagaccgcagacagcttcagggcccaggttcc acgtgtggcctgattcccatcaagcaggcaacacccagcagctctaacagagctctccac agctgggctctagcatcatggatggtgacaagacaagtgcaggccattcagaatgccagc gagattgtggcccacagccatcagctgtgtgctggtgtcccagttccctattccctgatc atagcagagtcagccacaggcctagtgaacagttctgcagtgttgctttggcagacccac ctgaaaattctgctcagaatctgttctcgcccactaatgatttggaaaacgtacttcccg tttcgatcactttag >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_9|180_aa MPSTGRAQFEEAAGLGWDWKKEIMRSLCDTSSAMLTKQLDVLQGKGDLAFRDDSIQPQEE PAIRPRSSQLVPPMGIQDSKEPNRTCCLNGGTCMLGSFCACLPSFYGWNCEHGVRKENCG SVPHDTRLPKKCSMCKCWHGQLRCFPQAFLPGCDGLVMDEHLMAPRTPELPPSACTTFIC >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_9|543_bp atgccttctactggcagggcacagtttgaggaagcagcaggattgggatgggactggaaa aaagaaatcatgagatctctatgcgatacctcatcggcgatgttgaccaagcagttggat gtgttgcagggaaaaggagacctggccttcagagatgacagcattcagccccaggaggag cctgcaattcgtcctcggtcttcccagcttgtgccccccatggggatacaggacagtaag gagccaaacagaacctgctgcctgaatgggggaacctgcatgctggggtccttttgtgcc tgcctcccctccttctatggatggaactgtgagcacggtgtacgcaaagagaattgtggg tctgtgccccatgacacccggctgcccaagaagtgttccatgtgtaaatgctggcacggg cagctccgctgctttcctcaggcatttctacctggctgtgatggccttgtgatggatgag cacctcatggctcccaggactccagaactaccaccgtctgcatgcaccacttttatatgc tag >gi568815596f:227713912_227916384|GENSCAN_predicted_peptide_10|222_aa MLEYEKHGELKTKSIDLLDLGPSFITGSYDRTCKLWDTASGEELNTLEGHRNVVYAIAFN NPYGDKIATGSFDKTCKLWSVETGKCYHTFRGHTAEIGHSAEIISLSFNTSGDRIITGSF DHTVVVWDADTGRKVNILIGHCAEISSASFNWDCSLILTGSMDKTCKLWDATNGKCVATL TGHDDEILDSCFDYTGKLIATASADDTDDAKLHRYHLPDDHV >gi568815596f:227713912_227916384|GENSCAN_predicted_CDS_10|669_bp atgttggaatatgaaaaacatggagaattaaagactaagtccatagatttgcttgatctt ggtcccagctttatcacaggaagctatgatcggacgtgcaagctctgggacactgcgtct ggagaggagctgaacacgctggagggccacaggaatgtggtttatgccatagcattcaac aatccttacggtgacaaaatcgccactgggtcctttgataaaacttgtaaactctggagt gtggaaacaggaaaatgttaccataccttcaggggtcatacagcagaaataggacattct gccgaaatcatctccttgtcatttaacacctcaggagacagaatcatcacggggtctttt gatcataccgttgtagtgtgggacgctgatactggaaggaaggtaaatatcttaattggt cattgtgctgagattagcagtgcctcattcaattgggattgctctctaatattaactggc tctatggacaaaacctgcaagctgtgggatgctacaaatggaaaatgtgtggcaacctta acaggccatgatgatgaaatactagacagctgctttgattacactggaaagcttattgca actgcttcagctgatgatacagatgatgctaaactacatcgataccatctacctgatgac cacgtgtga