GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:10:00 Sequence gi568815586f:4231280_4452688 : 221409 bp : 44.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5464 5470 7 0 1 110 59 2 0.094 0.55 1.02 Term + 22485 22549 65 1 2 102 44 79 0.564 2.95 1.03 PlyA + 25866 25871 6 1.05 2.00 Prom + 31812 31851 40 -0.86 2.01 Init + 37918 37970 53 0 2 85 5 118 0.565 1.83 2.02 Intr + 41013 41095 83 0 2 119 85 4 0.783 2.48 2.03 Intr + 41567 41726 160 0 1 73 -63 176 0.701 0.05 2.04 Intr + 41870 41974 105 2 0 80 59 83 0.770 3.93 2.05 Intr + 42722 42956 235 2 1 37 92 457 0.839 38.69 2.06 Intr + 44726 44941 216 1 0 98 55 222 0.999 18.60 2.07 Intr + 47481 47640 160 2 1 111 111 289 0.999 33.06 2.08 Intr + 48464 48545 82 0 1 82 66 25 0.534 -1.60 2.09 Intr + 52932 53044 113 0 2 50 93 65 0.317 3.22 2.10 Intr + 57563 57711 149 0 2 137 66 187 0.864 21.35 2.11 Intr + 58952 59170 219 1 0 50 76 81 0.639 1.60 2.12 Term + 68581 68730 150 0 0 81 55 278 0.909 21.71 2.13 PlyA + 68907 68912 6 1.05 3.03 PlyA - 69377 69372 6 1.05 3.02 Term - 93474 93093 382 2 1 28 40 838 0.981 67.31 3.01 Init - 96624 96620 5 0 2 76 55 0 0.203 -5.03 3.00 Prom - 100714 100675 40 -1.06 4.00 Prom + 105481 105520 40 -4.66 4.01 Init + 112367 113032 666 1 0 66 41 305 0.111 18.83 4.02 Term + 113741 114673 933 1 0 12 47 280 0.120 8.43 4.03 PlyA + 114721 114726 6 -0.45 5.00 Prom + 115004 115043 40 -3.26 5.01 Init + 118543 118617 75 0 0 71 96 52 0.619 5.39 5.02 Intr + 119988 120098 111 2 0 40 91 105 0.814 6.58 5.03 Intr + 120981 121184 204 2 0 133 62 22 0.809 3.50 5.04 Term + 123159 123176 18 2 0 99 47 -9 0.101 -5.58 5.05 PlyA + 123280 123285 6 1.05 6.03 PlyA - 123310 123305 6 1.05 6.02 Term - 124594 124468 127 0 1 112 47 114 0.981 7.46 6.01 Init - 136388 136327 62 2 2 61 113 30 0.287 3.52 6.00 Prom - 136612 136573 40 -7.06 7.06 PlyA - 137120 137115 6 1.05 7.05 Term - 139504 139064 441 1 0 109 46 584 0.998 51.56 7.04 Intr - 141418 141315 104 2 2 121 70 36 0.945 4.99 7.03 Intr - 148259 148093 167 1 2 81 79 122 0.029 10.20 7.02 Intr - 160640 160520 121 0 1 79 52 59 0.081 0.95 7.01 Init - 193951 193870 82 1 1 89 98 43 0.506 4.68 7.00 Prom - 196759 196720 40 -4.86 8.09 PlyA - 196898 196893 6 1.05 8.08 Term - 197674 197520 155 2 2 134 44 49 0.858 3.18 8.07 Intr - 198109 197929 181 1 1 92 42 48 0.782 0.04 8.06 Intr - 203112 202999 114 0 0 102 70 173 0.738 17.54 8.05 Intr - 212957 212854 104 0 2 87 113 52 0.955 7.49 8.04 Intr - 213483 213439 45 1 0 108 49 39 0.562 0.38 8.03 Intr - 214206 213946 261 1 0 94 79 311 0.611 28.26 8.02 Intr - 217294 217184 111 2 0 72 47 87 0.750 3.35 8.01 Intr - 220308 220168 141 1 0 59 110 95 0.965 9.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_1|23_aa MPDCKAVEDGDLKMLILFLGPLN >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_1|72_bp atgcctgactgtaaggctgttgaagatggagacctcaagatgctcattctctttcttgga ccacttaattga >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_2|574_aa MARGGGRGGRRVAALAPGEPSVRLTQPAPHPALLALRAPARAPSLESCIGVATLSADTSG GLSADAGARKRVFPAWPLAAGEPLGALPPACGGPRRCTAGGQKGRCSGPFNRGFRNSFEV IRNTDFRDMTFISGPGKAGGRGAAGLAMELLCHEVDPVRRAVRDRNLLRDDRVLQNLLTI EERYLPQCSYFKCVQKDIQPYMRRMVATWMLEVCEEQKCEEEVFPLAMNYLDRFLAGVPT PKSHLQLLGAVCMFLASKLKETSPLTAEKLCIYTDNSIKPQELLEWELVVLGKLKWNLAA VTPHDFIEHILRKLPQQREKLSLIRKHAQTFIALCATVAMAETLEPYSSKGEMSGVHVLT IRNLRHCAFTRIILSESSEQHYNVGIAVVIICRSENENSERSDFKFAMYPPSMIATGSVG AAICGLQQDEEVSSLTCDALTELLAKITNTDVLREMAQGDADGANPGSSSEIVRRYGFKM RAPNIRFLNEDSRTQTENLPPVQHEECFSNPPEENRGRDLTFIEMDCLKACQEQIEAVLL NSLQQYRQDQRDGSKSEDELDQASTPTDVRDIDL >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_2|1725_bp atggcgaggggcggggggcggggagggaggcgggtcgcggcgctggctccgggggaacct agtgtacggctcacccagcccgcgccccaccccgccttgctggctctccgcgcccctgcc cgggccccctctctcgaaagctgcatcggtgtggccacgctcagcgcagacacctcgggc ggcttgtcagcagatgcaggggcgaggaagcgggtttttcctgcgtggccgctggccgcg ggggaaccgctgggagccctgcccccggcctgcggcggccctagacgctgcaccgcgggg gggcagaagggacgttgttctggtccctttaatcggggctttcgaaacagcttcgaagtt atcaggaacacagacttcagggacatgacctttatctctgggccggggaaagcaggaggg agaggggccgccgggctggccatggagctgctgtgccacgaggtggacccggtccgcagg gccgtgcgggaccgcaacctgctccgagacgaccgcgtcctgcagaacctgctcaccatc gaggagcgctaccttccgcagtgctcctacttcaagtgcgtgcagaaggacatccaaccc tacatgcgcagaatggtggccacctggatgctggaggtctgtgaggaacagaagtgcgaa gaagaggtcttccctctggccatgaattacctggaccgtttcttggctggggtcccgact ccgaagtcccatctgcaactcctgggtgctgtctgcatgttcctggcctccaaactcaaa gagaccagcccgctgaccgcggagaagctgtgcatttacaccgacaactccatcaagcct caggagctgctggagtgggaactggtggtgctggggaagttgaagtggaacctggcagct gtcactcctcatgacttcattgagcacatcttgcgcaagctgccccagcagcgggagaag ctgtctctgatccgcaagcatgctcagaccttcattgctctgtgtgccaccgtggccatg gcagagaccctggaaccttactcatccaagggagagatgtcaggagttcatgttttgaca atcagaaaccttaggcactgtgcttttacaaggattattttaagtgaatcctcagaacag cactacaacgtgggtattgctgttgtcattatttgcagatcagaaaatgaaaactcagag aggtcagactttaagtttgccatgtacccaccgtcgatgatcgcaactggaagtgtggga gcagccatctgtgggctccagcaggatgaggaagtgagctcgctcacttgtgatgccctg actgagctgctggctaagatcaccaacacagacgtgctcagagaaatggcgcagggagat gctgacggagcaaatccggggtcctcctctgagatagttcgtagatatggttttaaaatg cgggctccaaacatacgctttttaaacgaggactccagaacacagactgaaaacctccct ccagtgcagcatgaagaatgcttttctaatcctccagaggaaaatcgtggacgggactta acgtttatagaaatggattgtctcaaagcttgccaggagcagattgaggcggtgctcctc aatagcctgcagcagtaccgtcaggaccaacgtgacggatccaagtcggaggatgaactg gaccaagccagcacccctacagacgtgcgggatatcgacctgtga >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_3|128_aa MRLGPAAGRDVSCEQLTQLYSACQRPQVNPGLRRKQNSLLKRLRKAKKEAPPMEKPEVVK THLRDMIILPKMVGSMVGVYNGKTFNQVEIKPEMISHYLGEFSITYKLVKHCRPGIGATH SSRFIPLK >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_3|387_bp atgagacttggaccagctgcgggacgggacgtgtcctgcgagcagctgacgcagctgtac agtgcgtgccagcggccgcaagtgaacccgggcctgcggcggaaacagaactcgctgctg aagcgcctgcgcaaggccaagaaggaggcaccgcccatggagaagccggaagtggtgaag acgcacctgcgggacatgatcatcctgcccaagatggtgggcagcatggtgggcgtctac aacggcaagaccttcaaccaggtggagatcaagccggagatgatcagtcactacctgggc gagttctccatcacctataagctggtgaagcactgccggcccggcatcggggccacccac tcctcccgcttcatccccctcaagtag >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_4|532_aa MKAEIKMFFETNENKETTYQNLWDTFKAVCRGKFIALNAHRRKQERSKVGTLTSQLKELE KQEQTNSKASRRQEITKIRAELEETETQKTLPKINESRSWFFEKINKIDRPLARPIKKRE KNQMEAIKNDKGDITTDPTEIRTTIREYYKHLYTNKLESLEEMDKFLDTYTLPRLKQEEV ESLIRPVTGSEIEAIINSLPTKKSPGPDGFTAEFFQRYKEELRIKYLGIQLTRDMKDLFK EHYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMAILPKVIYRFNAIPFELPMTFFTDLEK TTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLCYKATVTKTAWYWYQNRDIDQWNRT EPSEIIPHIYNYLNFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSR WSKDLHVRPKTIKTLEENLGNTIQDIGMGKDFMTKTPKAMATKAKVDEWDLIKLKSFCTA KETTIRVNRQPTEWEKIFPIYSSDKGLISRIYKELKQICKKKTTPSTSGQRI >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_4|1599_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagagacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgctcac aggagaaagcaggaaagatctaaagttggcaccctaacatcacaattgaaagaactagag aagcaagagcaaacaaattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactggaggagacagagacacaaaaaacccttccaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagaccaataaagaaaagagag aagaatcaaatggaagcaataaaaaatgataaaggggatatcaccacagatcccacagaa atacgaactaccatcagagaatactataaacacctctacacaaataaactagaaagtcta gaagaaatggataaattcctcgacacctacaccctcccaagactaaaacaggaagaagtt gaatctctgattagaccagtaacaggctctgaaattgaggcaataattaatagcttacca accaaaaaaagtccaggaccagacggattcacagccgaattcttccagaggtacaaggag gagctgaggataaaatacctaggaatccaacttacaagggatatgaaggacctcttcaag gagcactataaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccttcgagctaccaatgactttcttcacagacttggaaaaa actactttaaagttcatatggaaccaaaaaagagcccacattgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatgctacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataataccacacatctacaactatctgaactttgacaaacctgacaaa aacaagaaatggggaaaggattccctattcaacaaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaaga tggagtaaagacttacatgttagacctaaaaccataaaaaccctagaagaaaacctaggc aataccattcaggacataggcatgggcaaggacttcatgactaaaacaccaaaagcaatg gcaacaaaagccaaagttgacgaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttccaatc tactcatctgacaaagggcttatatccagaatctacaaagaactcaaacaaatttgcaag aaaaaaacaaccccatcaacaagtgggcaaaggatatga >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_5|135_aa MHGILERSKFCKDMTVKYDSRLRERKYGVVEGKALSELRAMAKAAREECPVFTPPGGETL DQVKMRGIDFFEFLCQLILKEADQKEQFSQGSPSNCLETSLAEIFPLGKNHSSKVNSDSG IPGLAASVLVRQPLQ >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_5|408_bp atgcatggaattttggagagaagcaaattttgcaaagatatgacggtaaagtatgactca agacttcgggaaaggaaatacggggttgtagaaggcaaagcgctaagtgagctgagggcc atggccaaagcagccagggaagagtgccctgtgtttacaccgcccggaggagagacgctg gaccaggtgaaaatgcgtggaatagacttttttgaatttctttgtcaactaatcctgaaa gaagcggatcaaaaagaacagttttcccaaggatctccaagcaactgtctggaaacttct ttggcagagatatttcctttaggaaaaaatcacagctctaaagttaattcagacagcggt attccaggattagcagccagtgtcttagttaggcagccactgcagtga >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_6|62_aa MSIHYRSFAKDWVAHARGKIRAADLTSFSADTIESSLPILDKAAFLPPVSIPLTLTYFAQ QH >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_6|189_bp atgagcattcactacaggtccttcgccaaagactgggtagcacatgcacgaggtaagata cgagcagcagatcttacatccttcagtgctgacaccatcgagtcctccctgcccatcctg gataaagcagcattcctcccaccagtctccatcccccttactctgacctattttgctcaa cagcactga >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_7|304_aa MPGRKEVHVCGPSAVAHACKLSTLADQVPALGPPRAHPGGFSPEDTFLETGTFIEEGCGV TDKDLNSDVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGHVDGAPHQ TIYSALMIRSEDAGFVVITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTLENGYDVY HSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRSAEDDSER DPLNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGVVRGGRVNTHAGGTGPEGCRPF AKFI >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_7|915_bp atgcctggacggaaagaagtccacgtttgtgggccgagcgcggtggctcatgcctgtaaa ctcagcactttggcagaccaagtccctgcccttggcccgccacgtgcccatcctggtggc ttcagtcctgaagatacgttcttggaaacagggacattcatagaagagggctgtggggtc actgacaaagatctgaattcagacgtctgcagcatgagcgtcctcagagcctatcccaat gcctccccactgctcggctccagctggggtggcctgatccacctgtacacagccacagcc aggaacagctaccacctgcagatccacaagaatggccatgtggatggcgcaccccatcag accatctacagtgccctgatgatcagatcagaggatgctggctttgtggtgattacaggt gtgatgagcagaagatacctctgcatggatttcagaggcaacatttttggatcacactat ttcgacccggagaactgcaggttccaacaccagacgctggaaaacgggtacgacgtctac cactctcctcagtatcacttcctggtcagtctgggccgggcgaagagagccttcctgcca ggcatgaacccacccccgtactcccagttcctgtcccggaggaacgagatccccctaatt cacttcaacacccccataccacggcggcacacccggagcgccgaggacgactcggagcgg gaccccctgaacgtgctgaagccccgggcccggatgaccccggccccggcctcctgttca caggagctcccgagcgccgaggacaacagcccgatggccagtgacccattaggggtggtc aggggcggtcgagtgaacacgcacgctgggggaacgggcccggaaggctgccgccccttc gccaagttcatctag >gi568815586f:4231280_4452688|GENSCAN_predicted_peptide_8|370_aa XGWPLFENNASSEIHGFKAFNSTTLIQTFHKLYCMAGTVLRSEDTETEHLRNLGCLLLHM LAAITTYAVADGNEELEAHVSLPHRILVGMVVPSPAGTRANNTLLDSRGWGTLLSRSRAG LAGEIAGVNWESGYLVGIKRQRRLYCNVGIGFHLQVLPDGRISGTHEENPYSSYPASCLQ AASLLYGLLEISTVERGVVSLFGVRSALFVAMNSKGRLYATPSFQEECKFRETLLPNNYN AYESDLYQGTYIALSKYGRNRWRMRLCPKQVLNLVHHADSQRRTMFVYLIERLKFRHQRA VSCVVSSFCKSLSTSGWSQGRQAKRTRKHRFMVLLIIKAGRSPVAYSVQLPCSLPMGKLA QEGEDSDGRL >gi568815586f:4231280_4452688|GENSCAN_predicted_CDS_8|1113_bp nagggctggcccttgtttgagaataacgcgtctagtgaaatccatggatttaaagccttt aattcaacaacacttattcaaacatttcataaactgtattgcatggcgggcacggtgctc cgttctgaagatacagagacagaacatttgaggaatctgggctgcctgcttctgcatatg ttagcagccatcacaacatatgctgtggcggatggcaatgaagaactggaagcccacgtg tctctcccacaccgcatcctagtgggcatggtggtgccctcgcctgcaggcacccgtgcc aacaacacgctgctggactcgaggggctggggcaccctgctgtccaggtctcgcgccggg ctagctggagagattgccggggtgaactgggaaagtggctatttggtggggatcaagcgg cagcggaggctctactgcaacgtgggcatcggctttcacctccaggtgctccccgacggc cggatcagcgggacccacgaggagaacccctacagttcttatcctgcaagctgcctgcaa gctgccagcttgctgtatggcctgctggaaatttccactgtggagcgaggcgtggtgagt ctctttggagtgagaagtgccctcttcgttgccatgaacagtaaaggaagattgtacgca acgcccagcttccaagaagaatgcaagttcagagaaaccctcctgcccaacaattacaat gcctacgagtcagacttgtaccaagggacctacattgccctgagcaaatacggacggaac aggtggaggatgcgattatgccccaagcaggtcctgaatctggtccatcatgcagatagc caacgcagaaccatgtttgtatacctcattgaacgactgaaattccggcaccagagagct gtgtcctgcgtggtttcctctttctgtaaatcgctgagtacctcaggctggtcccaggga aggcaggcaaagagaacccgaaagcaccgattcatggtgctcttgattattaaggcagga agaagtcctgtggcgtactcagtccaactaccctgtagcttgccaatggggaaattggcc caggagggtgaagacagcgatggaagactatag