GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:48:35 Sequence gi568815595f:150452354_150683908 : 231555 bp : 38.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4723 4774 52 1 1 66 115 49 0.845 3.39 1.02 Term + 6023 6283 261 1 0 110 42 178 0.994 9.94 1.03 PlyA + 7138 7143 6 1.05 2.00 Prom + 17113 17152 40 -8.25 2.01 Init + 17467 17651 185 0 2 51 27 170 0.133 5.84 2.02 Term + 43704 43824 121 0 1 100 43 113 0.089 4.97 2.03 PlyA + 43861 43866 6 1.05 3.03 PlyA - 44555 44550 6 1.05 3.02 Term - 49018 48783 236 2 2 44 49 202 0.367 7.40 3.01 Init - 49519 49081 439 1 1 48 28 277 0.358 14.02 3.00 Prom - 54606 54567 40 -3.85 4.00 Prom + 57425 57464 40 -5.75 4.01 Init + 59241 59293 53 2 2 106 94 38 0.566 6.88 4.02 Intr + 81798 82132 335 0 2 -18 52 265 0.006 6.59 4.03 Intr + 93454 93950 497 2 2 38 25 388 0.000 19.28 4.04 Intr + 94338 94477 140 2 2 -18 109 135 0.002 3.34 4.05 Intr + 103584 103636 53 0 2 62 90 23 0.002 -2.47 4.06 Intr + 106035 106069 35 1 2 126 55 24 0.296 -0.08 4.07 Intr + 115549 115693 145 0 1 76 95 99 0.850 8.33 4.08 Intr + 115733 115939 207 0 0 71 83 61 0.449 2.03 4.09 Intr + 119605 120176 572 2 2 67 111 313 0.987 23.05 4.10 Intr + 123296 123409 114 1 0 51 68 107 0.953 4.72 4.11 Intr + 129265 129393 129 0 0 50 100 79 0.906 5.27 4.12 Intr + 130847 130912 66 1 0 87 80 91 0.973 6.38 4.13 Intr + 135711 135785 75 2 0 72 103 39 0.519 2.59 4.14 Term + 135825 135992 168 2 0 55 48 155 0.565 5.10 4.15 PlyA + 136012 136017 6 -0.45 5.00 Prom + 136402 136441 40 -5.15 5.01 Init + 151010 151146 137 1 2 51 97 235 0.946 18.38 5.02 Intr + 159440 159609 170 2 2 64 6 175 0.033 5.57 5.03 Term + 163169 164385 1217 0 2 66 37 264 0.122 9.90 5.04 PlyA + 165226 165231 6 1.05 6.00 Prom + 169204 169243 40 -5.55 6.01 Init + 170069 170142 74 1 2 48 82 114 0.991 7.39 6.02 Intr + 170690 170816 127 2 1 100 95 40 0.968 5.66 6.03 Intr + 186416 186566 151 1 1 -16 76 175 0.208 4.71 6.04 Term + 200964 201013 50 1 2 110 41 32 0.057 -2.81 6.05 PlyA + 201439 201444 6 1.05 7.08 PlyA - 201452 201447 6 1.05 7.07 Term - 207957 207883 75 0 0 68 42 89 0.210 -0.84 7.06 Intr - 214662 214434 229 2 1 77 100 94 0.604 6.45 7.05 Intr - 217142 216943 200 2 2 9 107 116 0.380 2.83 7.04 Intr - 226201 226056 146 2 2 82 90 92 0.861 7.88 7.03 Intr - 228185 228115 71 1 2 85 101 25 0.950 1.41 7.02 Intr - 228577 228420 158 1 2 77 87 152 0.995 11.89 7.01 Intr - 229963 229865 99 1 0 87 103 88 0.601 9.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 81904 82132 229 0 1 34 52 222 0.950 11.78 S.002 Term + 85408 85535 128 2 2 78 43 85 0.846 0.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_1|104_aa XASGGGVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVERNSLLERENALLKSL SSNDQLSQLPTQQANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_1|315_bp nntgcatctgggggaggtgttgtagccattgacaacaaaatagaacaagcaatggatctg gtgaaaagccatttgatgtatgcagtaagagaagaagtggaagttttaaaggaacaaata aaagaattagttgaaagaaactctttacttgaacgagaaaatgcactgttaaaatctctt tcaagcaatgatcaattatcccaactcccaacccaacaggccaatcctggtagcacttct caacagcaagcagtgatagcacagcctccgcagccaacgcaacctccacagcagccgaat gtctcctcagcataa >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_2|101_aa MAVMTADSSGNAAFLEASSAQIITLWEGKKVTTIFYPRILKNVIPQKGVLRGGGECQPQE GSKKLPPSSPLAPPTPDAFTVFSVDSKFGTEKLTALKVLLL >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_2|306_bp atggctgtgatgacagctgactcatctggtaatgcagctttcctggaggcttcctctgct cagataatcacactctgggagggaaaaaaggttacaacaatcttctaccccaggatactc aagaatgtcatcccccagaagggagttctacgaggaggaggagaatgtcagccccaggag gggagcaaaaagctgccgccctccagtcccctggctccccccactccagatgcattcaca gtttttagtgtagattccaaatttggaactgaaaaactcacagccttgaaagtcctacta ttgtag >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_3|224_aa MDNEVQAEVFSDGDEELVGNWSKGYSCYALAKRLVAFCPCLRDLWNFELEGDDLGYLVKE ISKWQSVQKEAEDNSLEILQPNEAVEKENPFSGEKFKPAAEICISNEELNVNHQDNGENV SGHVRELHNSPSHHRPRVPGGKDGFVAMAKRGQGTAWTTASQGASPKPGQLPHDIEPVGA EMSRIEIWEPPPRFQRMYGNTWMSKQKFAGEGEALMENLLENLC >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_3|675_bp atggacaatgaagtccaggctgaggtgttctcagatggagatgaggaacttgttgggaac tggagtaaaggttactcttgctatgctttagcaaagagactggtggcattttgcccctgc cttagagatctgtggaactttgaacttgagggagatgatttagggtatctggtgaaagaa atttctaagtggcaaagcgttcaaaaggaagcagaggacaatagtttggaaattttgcaa cccaatgaagcagtagaaaaggaaaaccccttttctggggagaaattcaagccagctgca gaaatttgcataagtaatgaggagctgaatgttaatcaccaagacaatggggaaaatgtc tcagggcatgtcagagaacttcacaacagcccctcccatcacaggcccagagtcccagga gggaaagatggttttgtggccatggctaaaagaggccaaggcacagcttggaccactgct tcacagggtgcaagccccaagcctgggcagcttccacatgatattgagcctgtgggtgca gagatgtcaagaattgagatttgggaacctccacctagatttcagaggatgtatggaaac acctggatgtccaagcagaagtttgctggcgagggggaagccctcatggagaacctcttg gagaacctctgctag >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_4|862_aa MDNPGGLYVKQNKSGTERWGKKEIEFVTKQLSTKKSLGPDGFIGEFYKTYKEFMPILHKL FQETEENNSRLILRSQYYPDNKTYTTRNENYTKISYEHKCKNSTEILANQNLYLKKGIFK KLTTNIILHDAEKPLGPQAPTPARRSPAPSPHRSPLPFPSPHSLAVRRLGPLPLHGTSPR SAGSGRKRDAAPGTRTQLRLRNAEGRPLFPYRGLGDVAALGDVLAVLLVGHTDPLLGDHL RGATTCPGHPSDSLTRLPPLEPLRGSARGARAAGAPRFSGQTLATAAGPGRRKREADGSG NTNSLSPRMRRRVGAGFGPGELSPERFLFPGQHGAVHAALDRKAVWNICTSEQSVVWRQG NLGRIAKSVSLVAVYVPGSKGAPSFVRLYQYPNFAGPHAALANKSFFKADKVTMLWNKKV VNLGYIPMCITEESMPICVLNLIKVFTVIATAVLVIASTDVDKTGASYYGEQTLHYIATN GESAVVQLPKNGPIYDVVWNSSSTEFCAVYGFMPAKATIFNLKCDPVFDFGTGPRNAAYY SPHGHILVLAGFGNLRGQMEVWDVKNYKLISKPVASDSTYFAWCPDGEHILTATCAPRLR VNNGYKIWHYTGSILHKYDVPSNAELWQVSWQPFLDGIFPAKTITYQAVPSEVPNEEPKV ATAYRPPALRNKPITNSKLHEEEPPQNMKPQSGNDKPLSKTALKNQRKHEAKKAAKQEAR SDKSPDLAPTPAPQSTPRNTVSQSISGDPEIDKKIKNLKKKLKAIEQLKEQAATGKQLEK NQLSMERYNPLIVIDLLTVCNPCVRSVGQLSPVESMRLSGRRGHGGSCEELLTPKPCPEC RADCLLTVGLVTDKCPVSVVVG >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_4|2589_bp atggataaccctggagggctttatgttaagcaaaataagtcaggcacagaaaggtgggga aaaaaagaaattgaatttgtaacaaaacaactatccacaaagaaaagcctaggcccagat ggctttattggtgaattctacaaaacatacaaagaattcatgccaattcttcacaagctt ttccaggaaaccgaagagaacaactcccgacttattctacgaagccagtattaccctgat aacaaaacctacacaacaagaaatgaaaactacaccaagatctcttacgaacataaatgt aaaaattctacagagatactagcaaaccaaaatctctacctgaaaaagggcatcttcaaa aaactcacaactaacatcatacttcatgatgcagagaaaccactcggaccccaggctccc acgcctgcacgccggtcgccggccccctcacctcacaggtctccccttccgttcccgagc ccccacagtctggcggttcgcagactcgggcccctacccctccacggcaccagcccacgg tcggccggatccgggagaaaacgcgacgcggccccagggacccggactcagctccggctg cggaacgcagaggggcgcccgctctttccttaccgaggtcttggcgacgttgccgcgctg ggtgatgttcttgctgtgcttctcgttggccatacggatcctttgcttggcgaccatctt cgcggcgccaccacctgccccggccacccctcggactcgctcactcgcctgcctcctctg gagccgctgcgaggctcggctcgtggtgcccgcgccgccggagcgccgaggttctcaggc cagacgctagctacggccgctgggcctgggcgccgcaagcgcgaggctgatggctccgga aacaccaattcgctgtctccacgcatgaggagacgtgtaggggccgggttcggccctggt gaactctcacccgagcggtttctctttccgggacaacatggcgccgtccacgccgctctt gacagaaaggctgtatggaacatttgtacttcagaacagtctgttgtgtggagacagggg aatctgggaagaattgcaaagtctgtatctttagtggctgtctatgttccaggaagtaaa ggtgcaccttcatttgttagattatatcagtaccccaactttgctggacctcatgcagct ttagctaataaaagtttctttaaggcagataaagttacaatgctgtggaataaaaaagtt gttaatttaggctatattcctatgtgtataacagaagaatcaatgcccatttgtgtttta aatctaattaaagtttttactgttatagctactgctgtgttggtaatagctagcacagat gttgacaagacaggagcttcctactatggagaacaaactctacactacattgcaacaaat ggagaaagtgctgtagtgcaattaccaaaaaatggccccatttatgatgtagtttggaat tctagttctactgagttttgtgctgtatatggttttatgcctgccaaagcgacaattttc aacttgaaatgtgatcctgtatttgactttggaactggtcctcgtaatgcagcctactat agccctcatggacatatattagtattagctggatttggaaatctgaggggacaaatggaa gtgtgggatgtgaaaaactacaaacttatttctaaaccggtggcttctgattctacatat tttgcttggtgcccggatggtgagcatattttaacagctacatgtgctcccaggttacgg gttaataatggatacaaaatttggcattatactggctctatcttgcacaagtatgatgtg ccatcaaatgcagaattatggcaggtttcttggcagccatttttggatggaatatttcca gcaaaaacaataacttaccaagcagttccaagtgaagtacccaatgaggaacctaaagtt gcaacagcttatagacccccagctttaagaaataaaccaatcaccaattccaaattgcat gaagaggaaccacctcagaatatgaaaccacaatcaggaaacgataagccattatcaaaa acagctcttaaaaatcaaaggaagcatgaagctaagaaagctgcaaagcaggaagcaaga agtgacaagagtccagatttggcacctactcctgccccacagagcacaccacgaaacact gtctctcagtcaatttctggggaccctgagatagacaaaaaaatcaagaacctaaagaag aaactgaaagcaatcgaacaactgaaagaacaagcagcaactggaaaacagctagaaaaa aatcagctctctatggaaagatacaatccattaatagtcatagacctgctgactgtctgt aacccctgcgtcagatcagtgggacagcttagccctgtagagtcgatgaggttaagtggc aggcgtggacacggcggcagttgtgaagagctgcttacaccaaagccttgcccggaatgc agggctgattgcctgctcacagttggtcttgtgactgacaagtgtcctgttagtgtcgtt gtaggatga >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_5|507_aa MRLLLLLLVAASAMVRSEASANLGGVPSKRLKMQYATGPLLKFQICPDFPDEALATTAAT AAAASPLPRREAVHVSELRGTGADRNPHSPGRSVGGSSLCTPTPVVSDFLYTNNRQTESQ IMSELPFTIASKRIKYLGIQLTRDVKNLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINI VKMAILPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQKRARITKSILSQKNKAGGITL PDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEMMPHIYNYLIFDKPEKNKQWGKDSLF NKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGITIQDIGMGK DFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISR IYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYVAKKHMKKCLPSLAIREMQIKTTMRY HLTPVRMAVIKKSEIHCHLNKVGLCQY >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_5|1524_bp atgaggcttctgctgcttctcctagtggcggcgtctgcgatggtccggagcgaggcctcg gccaatctgggcggcgtgcccagcaagagattaaagatgcagtacgccacggggccgctg ctcaagttccagatttgccccgatttcccggacgaggcactggcgaccacagcggccact gcggctgccgcctccccactgcccaggcgtgaagccgttcatgtctccgagctccggggt accggcgctgacagaaatccccactctcctggccgctccgtcggcggcagttcactctgc acaccaacacctgttgtttcagatttcctatacaccaacaacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaagaacctcttcaaggagaactacaaaccactgctcaaggaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaataaatatc gtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagcta ccaatgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactg cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaatgatgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaattggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagacttaaacattagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacaaaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg gcaaaggacatgaacagacacttctcaaaagaagacatttatgtagccaaaaaacacatg aaaaaatgcttaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagatac catctcacaccagttagaatggcagtcattaaaaagtcagaaattcattgtcatttaaat aaagtgggcctgtgtcaatattaa >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_6|133_aa MRVISQRYPDIRIEGENYLPQPIYRHIASFLSVFKLVLIGLIIVGKDPFAFFGMQAPSIW QWGQENKKIIELEGCGNPRFMSDWSKVWVVQDLQLVSEVVTDLWNRTLNLWDLTLTPEVI VASFSTWSRTGFA >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_6|402_bp atgcgggttattagccagcggtacccagacatccgcattgaaggagagaattacctccct caaccaatatatagacacatagcatctttcctgtcagtcttcaaactagtattaataggc ttaataattgttggcaaggatccttttgctttctttggcatgcaagctcctagcatctgg cagtggggccaagaaaataagaaaattattgaacttgagggttgtgggaacccccgattt atgtctgattggtcaaaagtatgggtggtccaggacctgcaactggtgtctgaagtggta acagacttgtggaaccgaacacttaacttgtgggatctgacactaactccagaggtcatt gtggcctccttcagtacttggtcaaggacaggatttgcctga >gi568815595f:150452354_150683908|GENSCAN_predicted_peptide_7|325_aa DEEEETSPKCEFCGSDLRAFFSNVDVSSEPKGHASCCIAFQNLIDYIYEEQIKTKPPKAE LIAIDPHAAHGSEVDRLKAKEKALQRKQEQRMARHFAIISREQTHFSEDDSKRLKTISYQ LSVDIPEKQIIDDIVFDFQLRNSNMSIICCDSRIACGKSRSIRTIYMTLSDLSYPSGNLA IIRVPNKVNGFTCIVQEDMPTNPAILAVLDSSGRSSCYHPNGNVWVYINILGGQYSDQAG NRIRAWNWSNSITSSPFVSFKPVFLALNRYIGVRILEQDKISITFLAMGQQARISVGTKV KSRITPPPLLTDELEWTLIRESNPT >gi568815595f:150452354_150683908|GENSCAN_predicted_CDS_7|978_bp gatgaagaggaggagacatcacccaaatgtgaattttgtggcagcgatctaagagcattt ttttctaatgtggatgtttcctctgaaccaaaagggcatgcctcctgttgtattgctttc caaaatctgattgactatatctatgaggagcaaataaaaaccaaaccccctaaagctgaa ttaattgctattgaccctcatgcagcccatggtagtgaggtcgacagactcaaggcaaaa gaaaaagccctgcaaaggaaacaggagcaacgaatggccagacattttgcaataatatca agggaacagactcatttctctgaagatgattcaaagcgcttaaaaacaatttcttatcaa ctttctgtggatattccagaaaaacagataattgatgacattgtatttgattttcaacta agaaacagtaacatgtctatcatttgttgtgattctcggatagcatgtggaaagtcaagg agcattagaacaatttatatgaccttatctgatctcagctatccatctggaaacctagcc atcattcgagtgcccaacaaggtaaatggttttacttgtatagtccaagaagatatgccc actaaccctgctatcctagcagtgctggattcctctggcagaagttcctgctatcatccc aatggaaatgtctgggtatacatcaatatcttgggaggtcaatattcagatcaagccggc aacagaataagggcttggaattggtcaaattccatcacttcctcaccctttgtttcattt aaacctgtctttctggctttgaaccgttatattggagtccgcatcttagaacaagacaag atttctataacttttctagcaatgggccaacaggcaagaatcagtgttggaaccaaagtg aagagtagaataacaccaccgcccctgcttacggatgaattagagtggaccctcatcagg gaatccaaccccacataa