GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:12:08 Sequence gi568815590r:7871664_7976395 : 104732 bp : 41.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9759 9816 58 2 1 74 110 43 0.875 4.85 1.02 Term + 10760 10905 146 0 2 107 42 162 0.684 10.59 1.03 PlyA + 12660 12665 6 1.05 2.06 PlyA - 15679 15674 6 1.05 2.05 Term - 17153 16981 173 0 2 75 39 130 0.941 3.81 2.04 Intr - 19674 19348 327 1 0 38 22 289 0.561 12.05 2.03 Intr - 20008 19829 180 2 0 -3 81 174 0.811 6.52 2.02 Intr - 20334 20071 264 1 0 9 37 335 0.557 17.06 2.01 Init - 20603 20489 115 2 1 28 20 188 0.098 6.42 2.00 Prom - 20732 20693 40 -10.84 3.00 Prom + 21663 21702 40 -8.45 3.01 Init + 22488 22674 187 2 1 92 109 179 0.594 19.82 3.02 Intr + 24887 25006 120 0 0 51 11 134 0.223 1.55 3.03 Intr + 25324 25460 137 2 2 24 92 48 0.088 -1.83 3.04 Intr + 34994 35112 119 1 2 3 109 157 0.962 7.64 3.05 Intr + 36609 36714 106 0 1 70 65 72 0.466 2.40 3.06 Term + 36932 37006 75 1 0 108 54 65 0.548 1.96 3.07 PlyA + 39267 39272 6 1.05 4.02 PlyA - 42189 42184 6 1.05 4.01 Sngl - 43893 43582 312 0 0 84 53 297 0.950 21.48 4.00 Prom - 45032 44993 40 -6.45 5.00 Prom + 45539 45578 40 -11.44 5.01 Init + 47035 47255 221 0 2 92 58 127 0.646 7.46 5.02 Intr + 50064 50253 190 0 1 15 113 93 0.336 3.17 5.03 Intr + 54672 54734 63 2 0 59 98 51 0.133 1.20 5.04 Intr + 59901 59987 87 2 0 16 109 63 0.304 0.45 5.05 Intr + 60649 60705 57 0 0 82 86 67 0.318 4.06 5.06 Intr + 77466 77592 127 2 1 51 109 147 0.472 12.43 5.07 Intr + 78370 78465 96 2 0 84 115 34 0.963 4.76 5.08 Intr + 79002 79084 83 1 2 64 115 34 0.995 2.14 5.09 Term + 80144 80728 585 1 0 55 44 375 0.996 23.32 5.10 PlyA + 81029 81034 6 1.05 6.04 PlyA - 81228 81223 6 1.05 6.03 Term - 83655 83083 573 0 0 47 42 236 0.404 8.36 6.02 Intr - 90475 90106 370 0 1 81 74 147 0.553 6.78 6.01 Init - 92108 92053 56 2 2 86 39 43 0.522 0.01 6.00 Prom - 92733 92694 40 -7.15 7.03 PlyA - 93102 93097 6 1.05 7.02 Term - 96103 95681 423 1 0 50 38 396 0.866 25.11 7.01 Init - 96881 96651 231 2 0 111 44 237 0.964 19.91 7.00 Prom - 98299 98260 40 -12.72 8.02 PlyA - 99220 99215 6 1.05 8.01 Sngl - 101590 99998 1593 1 0 111 43 1245 0.992 117.39 8.00 Prom - 101715 101676 40 -4.65 9.03 PlyA - 102615 102610 6 1.05 9.02 Term - 103346 103205 142 1 1 96 48 95 0.626 2.72 9.01 Intr - 103733 103598 136 0 1 63 33 149 0.504 5.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_1|67_aa MRIHYLLFALLFLFLVPVPGHGGIINTLQKYYCRVRGGRCAVLSCLPKEEQIGKCSTRGR KCCRRKK >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_1|204_bp atgaggatccattatcttctgtttgctttgctcttcctgtttttggtgcctgttccaggt catggaggaatcataaacacattacagaaatattattgcagagtcagaggcggccggtgt gctgtgctcagctgccttccaaaggaggaacagatcggcaagtgctcgacgcgtggccga aaatgctgccgaagaaagaaataa >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_2|352_aa MLAVDAVIAELKKQSKPVTKPEEIAQVATISANGDKEIGEKCEFQDAYVLLHEKKISSVQ SIVPALEIANAYCKPLVIIAEDIDGEALTTLILNRLKVGLQVVAVKAPGFGDNRKNQLKD TVIATGGEVGEVTVIKDDAMLLKGKGNKSQIEKCVQEIIDQSDVTTSEYEKEKLNGETFR WSSCAEANEDKIIGIEIIKRTLKIPAMTIAKNAGVDGFLIVEKIMQSSSEVGYDTMLGDV VNMVEKDIIDPTKVVRTALLDAAGMASLLTTAAVVVTKIPKEGNSPGMGAMCGMGESSLE LLHKDLPPDLSTRTGTIGAQIANWEERSTKQKCAHGHCNKVYQEVLGILQRV >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_2|1059_bp atgttagctgttgatgctgtaattgctgaacttaaaaagcagtctaaacctgtgaccaaa cctgaagaaattgcacaggttgctacaatttctgcaaatggagacaaagaaattggtgag aaatgtgaattccaggatgcctatgttctgttgcatgaaaagaaaatttctagtgtccag tccattgtacctgctcttgaaattgccaatgcttactgtaagcctttggtcataattgct gaagacattgatggagaagctctaactacactcatcctgaataggctaaaggttggtctt caggttgtggcagtcaaagctccagggtttggtgacaatagaaagaaccagcttaaagat acggttattgctactggtggagaagttggagaggtcactgtgatcaaagatgatgccatg ctcttaaaaggaaaaggtaacaagtctcaaattgaaaaatgtgttcaagaaatcattgac cagtcagatgtcacaactagtgaatacgaaaaggaaaaactgaatggagaaactttcaga tggagtagctgtgctgaagctaatgaagataaaataattggcatagaaattattaaaaga acactcaaaattccagcaatgactattgctaagaatgcaggtgttgatggatttttgata gttgagaaaattatgcaaagttcctcagaagttggttatgatactatgttaggagatgtc gtgaatatggtggaaaaagacattattgacccaacaaaggttgtgagaactgctttattg gatgctgctggcatggcctctctattaactacagcagctgttgtagtcacaaaaattcct aaagaagggaacagccctggaatgggtgcaatgtgtggaatgggagagtcatccttggag cttctgcacaaagatttaccccctgacctgagcaccaggacaggaaccataggtgctcag atagcaaactgggaggagagaagcacaaaacaaaagtgtgcccatggacattgcaataaa gtataccaggaagttcttggaattcttcagagagtttag >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_3|247_aa MLPVLPEALSPSRVPGFEPALYHPQELNVRAMDRIRWKELSIWPETVPRYSGKTGRRTEW AAEGINKLAPVVSLEQNAAKSHEEAKKLLWLMRIQKGLPHQRPSPTQRPVPSWNHSGSAK GGDAQSSVLHKDSLLLAKVSYAIMPPTEGASEAIGQCQSSATKRRRSLKESVREPWARVP GAVGMAARKAGLAAKGEGEGVEGYLPLSQKSREGVETRREGVEKMKGIEIKRRERLKSGK EKVVEGQ >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_3|744_bp atgctcccggtgctgccagaggccctgagcccctccagggtccctgggtttgagccagcc ctgtatcatccccaggagctgaatgtccgagcaatggatagaattagatggaaagagctc tcaatttggcctgagactgtccccagatactcaggaaaaacaggacgtcgcacagagtgg gcagcagaaggtataaacaaattggcacctgtggtctccctggaacaaaatgctgcaaaa agccatgaggaggccaagaagctgctgtggctgatgcggattcagaaagggctccctcat cagagaccaagcccaactcaaagaccagttccctcatggaatcatagtggatctgccaag ggaggggatgcccagtcctctgttcttcacaaggactcccttcttctggctaaggtttct tatgcaattatgcctcctacagagggggcttctgaggcgatcgggcagtgtcagtcttca gccactaagcggagaagatctctgaaggagtcagtcagagagccttgggccagagttcca ggggctgtgggaatggctgccagaaaagcgggacttgccgctaagggtgaaggagaaggg gttgaagggtacttgcccctctcccagaaaagcagagaaggggtagagacaaggcgagaa ggagttgagaaaatgaaaggaattgaaattaagagaagggagagattgaagagtggaaag gagaaagtggttgagggacagtga >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_4|103_aa MEQAHEQSGYEGGQGGQAQAQGHEGHMLELAALSTGALERPRHISTGKEDGNGPGWEVAA AAAGVLSPGVKGGSLCGRSTDTSVPVHSQSHPWSFCTNIYPLT >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_4|312_bp atggagcaggcacatgagcagagtggctatgaggggggacaaggtgggcaggctcaggcc cagggacatgaaggccacatgctggagcttgctgccctgagcacgggtgccttggagcgg cccaggcacatctcgactgggaaggaagatggcaacggaccaggatgggaggtggcagca gcagcagcaggtgtcctcagccctggggtgaaaggagggtctctgtgtggacgcagcact gacacctctgtgcctgtgcattctcagagtcatccttggagcttctgcacgaatatttac cccctgacctga >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_5|502_aa MLLELPEAVSPSQVPGFEPALYHPQELNVPAMDRTRWNRLPVWPETVPRHSGKTGHPTER AGDLQFTDPESVPLESLTSLECPSQLSRPGVRDGTGGVFWVAPPVTQNQQEQRDIRIQNP MSERHKVCVSIILNKSTVSLSQGTYSTQPVEGGGTNCKRPGQRPWVVVVTAMGGMGLCSG PGIVPRQRFLVNLLLNSSALFRALFKEKVTFEDVAIDFTQEEWDMMDTSKRKLYRDVMLE NISHLVSLGYQISKSYIILQLEQGKELWWEGRVFLQDQNPDRESALKKKHMISMHPIIRK DTSTSMTMENSLILEDPFEYNDSGEDCTHSSTITQCLLTHSGKKPCVSKQCGKSLRNLLS PKPRKQIHTKGKSYQCNLCEKAYTNCFYLRRHKMTHTGERPYACHLCGKAFTQCSHLRRH EKTHTGERPYKCHQCGKAFIQSFNLRRHERTHLGQKCYECDKSGKAFSQSSGFRGNKIIH IGEKPPACLLCGKAFSLSSDLR >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_5|1509_bp atgctcctggagctgccagaagccgtgagcccctcccaggtccctgggtttgagccagcc ctgtatcatccccaggagctgaatgtcccagcaatggatagaactagatggaaccggctc ccagtttggcctgagactgtccctagacattcaggaaaaacaggacatcccacagagcgg gcaggtgatctccagttcacagaccctgagtctgttcccctcgaatctctgacatctttg gaatgtccttcccagctctccagaccaggggtcagggatggcacaggaggagttttttgg gttgctcctccagtaacccaaaaccaacaggagcaacgggatattagaattcaaaacccc atgtcagaaagacacaaagtctgtgtcagtatcatcttgaataaatccacggtgtccttg tcccaagggacatacagcacacagcctgtggagggtggaggaaccaactgcaagcgtcct ggacaacgaccttgggtagtggtagtgacagcaatgggtgggatgggcctgtgctctggc cctggaatagtgcccaggcagagattcctcgtcaatttgctgctgaattctagtgccctc tttcgtgccctcttcaaggagaaagtgacttttgaagatgtagctattgacttcacccag gaagagtgggacatgatggacacatccaaaagaaagctgtacagagatgtgatgctggaa aatatcagtcacctggtgtccctcgggtaccagataagcaaatcctatataattttgcag ctggagcaaggaaaagagctgtggtgggaaggaagagtatttcttcaagaccagaatcca gacagggaaagtgcccttaagaaaaaacacatgatatccatgcatcctatcatcagaaaa gacacatccaccagtatgacaatggagaactctctcattctggaggatccttttgaatat aatgattcgggagaagattgcactcacagttccacaataactcagtgtttgttaactcac agtggaaagaaaccctgtgtcagcaaacagtgtggaaaatcccttcgtaatcttttgtcc cctaaaccacgtaaacaaattcatactaaaggtaaatcatatcaatgtaatctatgtgaa aaggcctatactaattgcttttaccttagacggcacaagatgactcacactggagagagg ccatatgcatgtcatctatgtggaaaagccttcactcagtgttctcaccttagaagacat gagaaaactcacacgggagagagaccatataagtgtcatcaatgtgggaaagcctttatt caatcctttaaccttcgaagacatgagagaactcaccttggacaaaagtgttatgaatgt gataaaagtgggaaagcctttagtcaaagctctggctttagaggaaacaaaataattcac attggagagaaaccacctgcttgtcttctatgtgggaaggccttcagtctgtcctccgac cttagatga >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_6|332_aa MNSVKPEARTNIKFTVKLGKVTLNFNSYYSVPQGTRSTFKFSGLTTLSFGRKVLETFNVQ WMKQPGPTAHNTLTGTQRKRLRICWVELRTDPETQGVGRGSNQEGSGPDLLLGPAPLELA VFSDPEADFTDGKEVQYFWSSPPLRGSGRAAPEDAPGPAGASGPSGGIQPQADTDALRAR NRAACAGPAVSGLAKRAASPTPLPEPPRRPQPQSQGDGASLTMGEENSGPPWRPGPPREA DRACALHGRKGGFHCPLPAMRWQHRTFGLSGGPESESLKFRCGLFSTFFWKIKWKLSTIS CALIKEDGNKEANSKISIQKSIDSLYVEGRRA >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_6|999_bp atgaattctgtcaaacctgaagcaagaacaaacatcaaatttacggtgaagctggggaaa gtgactttaaacttcaatagctattactctgttccacaaggaaccaggtcaacattcaaa ttcagtggtctgacaactctaagctttggccggaaagtattggaaacatttaatgtgcag tggatgaagcagcccggccccactgcacacaacacactcacggggactcaaaggaagaga ctcaggatctgctgggtggaactgaggacagacccagaaacacagggggtggggaggggg tcaaaccaggagggctcaggacctgacctcctcctaggccctgcccctctggaactcgca gttttttctgacccagaagcggatttcactgatggaaaagaagttcagtatttctggtcc agcccacctctgcggggctctgggagggcagcgccggaggatgctccgggcccagcgggg gcatccgggcccagcgggggtatccagcctcaggctgatactgacgccctgagggcgcgg aatagggcggcctgcgcagggcccgccgtctcgggccttgcaaaaagagcggcctctcca acgcccctaccggaacctccccggaggccccagccccaaagccagggcgatggcgcctcc ctgacaatgggtgaagaaaactcaggtcctccctggagacccggcccgccgcgggaggca gaccgcgcatgcgccctgcatggccggaaaggtgggtttcattgccctctgccggccatg aggtggcagcacaggacgtttggtcttagcggtggacctgagtctgaatcactgaaattc aggtgtggattattcagtactttcttttggaagatcaaatggaaattgagtacgatatct tgtgctttaattaaagaagatggaaataaagaagcaaattcaaaaatcagtatacaaaag tcgattgattccctctatgtggagggaagacgagcttga >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_7|217_aa MEDDSLYLGGEWQFNHFSKLTSSRPDAAFAEIQRTSLPEKSPLSSETRVDLCDDLAPVTR QLAPREKLPPSSRRPAAKAPAAKTLTLHTSAKVLILVLKRFSDVTGNRLAKNVQYPECVD MQPYMSQQNTGPLFYVLYAVLIVTGWSCHNGHYFSCVKAQEGQWYKMDDAEVTASGITSP LSQQAYVLFYIQKNEFGRPSYRVSAGREPRALCAEDN >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_7|654_bp atggaggacgactcactctacttgggaggtgagtggcagttcaaccacttttcaaaactc acatcttctcggccagatgcagcttttgctgaaatccagcggacttctctccctgagaag tcaccactctcatcggagacccgtgtcgacctctgtgacgatttggctcctgtgacaaga cagcttgctcccagggagaagcttcctccgagtagcaggagacctgctgcgaaggcgcct gccgccaagacgttaactttacacacttctgccaaggtcctcatccttgtcttgaagaga ttctccgatgtcacaggcaacagacttgccaagaatgtgcaatatcctgagtgcgttgac atgcagccatacatgtctcagcagaacacaggacctcttttctatgtcctctatgctgtt ctcatcgtcaccgggtggagttgtcacaacggacattacttctcttgtgtcaaagctcaa gaaggccagtggtataaaatggatgatgccgaggtcactgcctctggtatcacttctcct ttgagtcaacaggcctatgtcctcttttacatccagaagaatgaatttggaagacccagt tacagggtgtccgcaggcagggaaccaagagctctttgtgctgaagacaattga >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_8|530_aa MEDDSLYLGGEWQFNHFSKLTSPRPDAAFAEIQRTSLPEKSPLSSETRVDLCDDLAPVAR QLAPREKLPLSSRRPAAVGAGLQNMGNTCYLNASLQCLTYTPPLANYMLSREHSQTCQRP KCCMLCTMQAHITWALHSPGHVIQPSQALAAGFHRGKQEDAHEFLMFTVDAMKKACLPGH KQVDHHSKDTTLIHQIFGGCWRSQIKCLHCHGISDTFDPYLDIALDIQAAQSVKQALEQL VKPEELNGENAYPCGLCLQRAPASNTLTLHTSAKVLILVLKRFCDVTGNKLAKNVQYPEC LDMQPYMSQQNTGPLVYVLYAVLVHAGWSCHNGYYFSYVKAQEGQWYKMDDAEVTACSIT SVLSQQAYVLFYIQKSEWERHSESVSRGREPRALGAEDTDRPATQGELKRDHPCLQVPEL DEHLVERATEESTLDHWKFPQEQNKMKPEFNVRKVEGTLPPNVLVIHQSKYKCGMKNHHP EQQSSLLNLSSMNSTDQESMNTGTLASLQGRTRRSKGKNKHSKRSLLVCQ >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_8|1593_bp atggaggacgactcactctacttgggaggtgagtggcagttcaaccacttttcaaaactc acatctcctcggccagatgcagcttttgctgaaatccagcggacttctctccctgagaag tcaccactctcatctgagacccgtgtcgacctctgtgatgatttggctcctgtggcaaga cagcttgctcccagggagaagcttcctctgagtagcaggagacctgctgctgtgggggct ggtctccagaatatgggaaatacctgctacttgaatgcttccctgcagtgcctgacatac acaccgccccttgccaactacatgctgtcccgggagcactctcaaacatgtcagcgtccc aagtgctgcatgctctgtactatgcaagctcacatcacatgggccctccacagtcctggc catgtcatccagccctcacaggcattggctgctggcttccatagaggcaagcaggaagat gcccatgaatttctcatgttcactgtggatgccatgaaaaaggcatgccttcccggccac aagcaggtagatcatcactctaaggacaccaccctcatccaccaaatatttggaggctgc tggagatctcaaatcaagtgtctccactgccacgggatttcagacacttttgacccttac ctggacatcgccctggatatccaggcagctcagagtgtcaagcaagctttggaacagttg gtgaagcccgaagaactcaatggagagaatgcctatccttgcggtctttgtctccagagg gcgccggcctccaacacgttaactttacacacttctgccaaggtcctcatccttgtcttg aagagattctgcgatgtcacaggcaacaaacttgccaagaatgtgcaatatcctgagtgc cttgacatgcagccatacatgtctcagcagaacacaggacctcttgtctatgtcctctat gctgtgctggtccacgctgggtggagttgtcacaacggatattacttctcttatgtcaaa gctcaagaaggccagtggtataaaatggatgatgccgaggtcactgcctgtagcatcact tctgtcctgagtcaacaggcctatgtcctcttttacatccagaagagtgaatgggaaaga cacagtgagagtgtgtcaagaggcagggaaccaagagcccttggcgctgaagacacagac aggccagcaacgcaaggagagctcaagagagaccacccttgcctccaggtacccgagttg gacgagcacttggtggaaagagccactgaggaaagcaccttagaccactggaaattcccc caagagcaaaacaaaatgaagcctgagttcaacgtcagaaaagttgaaggtaccctgcct cccaacgtacttgtgattcatcaatcaaaatacaagtgtgggatgaaaaaccaccatcct gaacagcaaagctccctgctaaacctctcttcgatgaactcgacagatcaggagtccatg aacactggcacactcgcttctctgcaagggaggaccaggagatccaaagggaagaacaaa cacagcaagagatctctgcttgtgtgccagtga >gi568815590r:7871664_7976395|GENSCAN_predicted_peptide_9|92_aa XAVIKVDQPQRKAAQGTTQGSIEPQNLGRNPAQAPKCAYEQGLRVTTLYIRRTKPSDGPH LAHEKTRERNGAKGHSPLIPSSSLKSSEFHGP >gi568815590r:7871664_7976395|GENSCAN_predicted_CDS_9|279_bp ngtgcagtgatcaaagttgaccaaccccagaggaaagctgcccagggcaccactcagggc tccatagaaccacagaatcttggacgcaaccctgctcaagcacccaaatgtgcatacgaa cagggtctccgtgtgacgacactttacatcaggcgcacgaagccttctgatggaccacac ctggcccatgaaaagacaagggaaagaaacggggccaaaggtcacagtcctctcattcca tcatcctccttaaaatcatccgaatttcatgggccctga