GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:29:18 Sequence gi568815592f:54208551_54489922 : 281372 bp : 35.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 22164 22367 204 2 0 111 100 50 0.490 6.97 1.02 Intr + 42436 42520 85 0 1 95 80 25 0.336 0.87 1.03 Term + 56213 56307 95 0 2 111 52 50 0.223 0.71 1.04 PlyA + 57258 57263 6 1.05 2.00 Prom + 72174 72213 40 -3.65 2.01 Init + 86526 86958 433 2 1 54 61 236 0.073 13.99 2.02 Intr + 100047 100355 309 1 0 78 102 172 0.220 12.96 2.03 Intr + 118248 118366 119 1 2 34 98 64 0.043 1.26 2.04 Intr + 120023 120152 130 1 1 67 73 68 0.084 2.55 2.05 Intr + 138817 138967 151 2 1 63 34 166 0.536 7.00 2.06 Intr + 141166 141346 181 1 1 87 111 104 0.616 11.55 2.07 Intr + 142058 142133 76 1 1 67 98 66 0.734 3.67 2.08 Intr + 145963 146086 124 2 1 47 82 99 0.406 3.92 2.09 Intr + 154255 154422 168 1 0 6 70 142 0.061 2.34 2.10 Intr + 171381 171459 79 0 1 67 70 36 0.011 -1.67 2.11 Term + 181241 181375 135 1 0 116 38 174 0.917 12.24 2.12 PlyA + 181564 181569 6 1.05 3.05 PlyA - 182665 182660 6 1.05 3.04 Term - 191414 191305 110 0 2 28 54 130 0.439 1.29 3.03 Intr - 199304 197733 1572 0 0 55 60 351 0.296 17.06 3.02 Intr - 200394 199455 940 0 1 8 40 445 0.820 21.51 3.01 Init - 205568 205554 15 2 0 71 91 7 0.146 -0.29 3.00 Prom - 212419 212380 40 -3.75 4.00 Prom + 217038 217077 40 -7.35 4.01 Init + 218857 218863 7 0 1 66 93 0 0.069 -0.51 4.02 Intr + 220993 221080 88 2 1 115 70 46 0.023 3.61 4.03 Intr + 232549 232774 226 1 1 79 32 103 0.003 0.86 4.04 Intr + 235448 235538 91 1 1 121 57 73 0.025 6.15 4.05 Term + 249975 250609 635 1 2 46 50 181 0.002 3.56 4.06 PlyA + 250628 250633 6 1.05 5.00 Prom + 251389 251428 40 -3.65 5.01 Init + 253912 253986 75 0 0 87 75 62 0.802 5.94 5.02 Term + 257469 257486 18 2 0 133 41 18 0.749 -1.06 5.03 PlyA + 257693 257698 6 1.05 6.00 Prom + 259674 259713 40 -4.05 6.01 Init + 262001 262104 104 1 2 83 56 80 0.775 4.06 6.02 Term + 272299 272407 109 1 1 92 50 100 0.268 3.60 6.03 PlyA + 273212 273217 6 1.05 7.02 PlyA - 273330 273325 6 1.05 7.01 Term - 277298 276794 505 1 1 -40 42 672 0.593 42.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_1|127_aa DVTVPPKPVSLHPLYQTKLYPPAKSLLHPQTLSHADCLAPGPFSHLSFSLSDEQENSHTL LSHNACNKTVVFKAGDNVCSLIVISLTSSQVPDLLAGLQRIPDDESYSLLLTFCLLIFYG QNKPEQS >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_1|384_bp gatgtaacagtccctcccaagcctgtctcgctccatcctttatatcagactaaactctat cctcctgctaagtcactgctgcatccacagaccctctcacatgctgactgtcttgcccca ggacccttcagtcatctgtccttctccttgagtgatgaacaggagaattctcacaccctc ctcagtcacaacgcatgcaacaagactgtagtcttcaaggcaggtgacaacgtctgttca ctgattgttatatccttaacatctagccaagtgcctgacctattagcagggcttcagaga attccagatgatgaaagctacagcttacttcttactttttgcctgttgattttctatggc cagaacaaaccagagcagtcatga >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_2|634_aa MLFHTCQQQQQQQRGGLLTHLLGQSLAGSAGAILHAGIHRGIGGSIGRQGCWHPCVHLHQ QQCQCKTWVIGGLHACIHPNGGGGGAELRGWAVAVHLHVCTGSNGGTGWRRASGLRVCAC IHAGNGSKAGREQGVLMVAVVALQEIWMEKQYLSQREVDLEAYFTRNHTVLQGTRFKRAI FQGQYCRNFGCCEDRDDGCVTEFYAANALCYCDKFCDRENSDCCPDYKSFCREEKEWPPH TQPWYPEGRWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTDHNSLQASEQNWT ENEFDELSEVGFRRWIITNSSELREHVLTECVAADRIAIQSKGRYTANLSPQNLISCCAK NRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCP NNVEKSNRIYQCSPPYRVSSNLGDNSSDANGYQAFPSRMPSKCQGPAIMQVREDFFHYKT GIYRHVTSTNKESEKYRKLQTHAVKLTGSLEEVDSSPQGLLWVDFETLVEKETADVVEIE KQLELEVEPEVLTEFLQFYFIKESPRPPNRPQCVMVPSLCPCVLIVQLSLIAANSWGKSW GENGYFRILRGVNESDIEKLIIAAWGQLTSSDEP >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_2|1905_bp atgctctttcacacgtgccagcagcagcagcagcagcagcggggtgggctgctcactcac ctgctagggcagagcctagctggttcagcgggcgccatcctccatgctggtattcacaga ggcattggtggcagcataggaaggcagggctgctggcatccatgtgtgcatttgcaccag cagcagtgtcagtgcaaaacatgggtcattggtggcctccatgcatgcattcaccccaat ggtggtggtggtggtgcagagttgaggggctgggctgttgctgtccatttgcatgtctgc actggcagcaatggtggtacagggtggcgacgggcctctggtctccgtgtgtgtgcatgc attcatgctggtaatggcagcaaggctgggagagaacaaggtgtgctcatggtggcagta gtggcattgcaagaaatctggatggagaagcagtatttatctcaaagagaagtggaccta gaggcttatttcactaggaatcacaccgttttgcaaggtactcgattcaaaagagccatt ttccaagggcaatactgtagaaattttggctgttgtgaagacagagatgatggctgtgtc actgagttctatgcggcgaatgcgttgtgctactgtgataaattctgtgacagagaaaat tctgattgctgtcctgactacaagtccttttgccgtgaagagaaagaatggcctcctcac acacagccttggtatccagaaggcagatggacagcacagaattacagccaattttgggga atgactttagaagatggttttaaatttcgccttggcactttgccacctagtcccatgctc ctgagcatgaatgaaatgacagatcacaactccttgcaagcaagtgaacaaaattggaca gagaatgagtttgatgaattgtcagaagtaggcttcagaaggtggataataacaaactcc tccgagctaagggagcatgttctaactgaatgtgtggctgctgaccgaatagcaattcag tctaagggtcgatacacggccaatctatcccctcagaatttgatctcttgctgtgccaag aaccgtcatggatgcaatagtggaagcatcgatagggcttggtggtacctgagaaaacgt ggactggtatcccacgcatgctacccacttttcaaagaccaaaatgctaccaacaatgga tgtgccatggcaagcaggtctgatgggcgaggaaaacggcatgccacgaagccatgtccc aacaacgtagaaaaatctaacaggatctatcaatgttctcctccatacagagtctcttcc aacctaggtgataattcctctgatgcaaatggttatcaagcctttccatctagaatgcca agcaaatgtcaaggcccagccataatgcaagtccgtgaagatttcttccattataagaca gggatatacagacatgttaccagcacaaataaagaatcagaaaaatatcgaaagcttcag acacatgcagtcaaactcactgggagtttggaagaagttgattccagtcctcagggatta ctgtgggtggatttcgagactttagtggagaaagaaactgcagatgtggtggaaatagaa aaacaactagaattagaagtagagcctgaagttttgactgaatttctgcaattttatttt ataaaggaaagcccccgacccccaaacaggccccagtgtgtgatggtcccttccctgtgc ccatgtgttctcattgttcaactctcacttattgctgccaattcctggggaaagtcatgg ggagagaatggctatttcaggattcttcgaggagtaaatgagtccgacattgaaaagttg attatcgcagcttggggccaactgacgagttctgatgaaccataa >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_3|878_aa MKLNMHSIRSKIDTLTSQLKELEKREQTHSKASRRQEITKIRAELKDIKTQKTLQKINES RSWFFERINKIDRPLARLIKKKREKNQIDSIKNDKGDITTDPTEIQTTIREYYKHLYANK LENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSETVAIINSLPTKKSPGPDGFTAEFYQ RYKEELVPFLLKLFQSIDKERILPNSFYEATIILIPKPGRDTTKEENFTPISLMNIDAKI LNKILVNRIQQHIQKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHMIISIDAE KAFDKIQLPFMLKTLNKLVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPI VSAQNLLKLISNFSKVSGYKINVQKSQAFLYINNRQTESQIMSELPFTIASKIVKYLGIQ LSRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRSSIVKMAILPKVMYRFNAIPIKL PMTFFIELEKTTLKFIWNQKRARIAKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQ NRDIDQWNRTEPSEIIPHIYNHLIFEKPDKNKQWGKDSLFNKWCWEKWLAICRKLKLDPF LTPYAKINSRWIKDLNVRAKTIKTLDENLGLTIQDIGMGKDFMSKTPKAMATKAKVDKWD LIKLKSFSTAKETTIRVNRQHTEWENIFATYLSDKGLISRICNELKQIYKKKTNNPIKKW AKHMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTAMRYHLTPVRMAIIKKSGKNRCWR GCGEIGTLLHCWLDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPKDYKSCCYKDT CTRILMFGSENVDADLTQDDFEGELLALKPPAGCDCRE >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_3|2637_bp atgaaactcaatatgcattcaataagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcgagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggatataaagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgataggccgctagcaagactaataaag aagaaaagagagaagaatcaaatagactcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggctctgaaactgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagctgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagataaagag agaatcctccctaactcattttatgaggccaccatcatcctgataccaaagcctggcaga gacacaaccaaagaagagaattttacaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggtaaaccgaatccagcagcacatccaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacacaaatcaataaatgta atccagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaactacccttcatgctaaaaactctcaataaattagtgttg gaagtcctggccagggcaattaggcaggagaaggaaatcaagggtattcaattaggaaaa gaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccatt gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacatcaataacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagatagtaaaatacctaggaatccaa ctttcaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaagcagtatc gtgaaaatggccatactgcccaaggtaatgtatagattcaatgccatccccatcaagcta ccaatgactttcttcatagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgcta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagaccaatggaacagaacagagccctcagaaataataccacatatctac aaccatctgatctttgaaaaacctgacaaaaacaagcaatggggaaaagattccctattt aataaatggtgctgggaaaagtggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatgcaaaaattaattcaagatggattaaagacttaaacgttagagctaaa accataaaaaccctagatgaaaacttaggactaaccattcaggacataggcatgggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaagttgacaaatgggat ctaattaaactaaagagcttctccacagcaaaagaaactaccatcagagtgaacaggcaa catacagaatgggagaacatttttgcaacctacttatctgacaaagggctaatatccaga atttgcaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg gcgaagcacatgaacagacacttctcaaaagaagacatctatgcagccaaaaaacacatg aaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaaaccgcaatgagatac catctcacaccagttagaatggcgatcattaaaaagtcaggaaaaaacaggtgctggaga ggatgtggagaaataggaacacttttacactgttggttggactgtaaactagttcaaccc ttgtggaagtcagtgtggagattcctcagggatctagaactggaaataccatttgaccca gccatcccattactgggtatatacccaaaggactataaatcatgctgctataaagacaca tgcacacgcatcctgatgtttggttcagaaaatgtggacgctgacctcacacaggatgac tttgagggagagctgcttgctcttaagccacctgcaggatgtgattgtagggagtga >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_4|348_aa MHCEMSASSCSLTVLELHQLDLNNYQRHMHQRPKGSELPSGPRHIPRCYPRALKSACGEY CLAWSSLFRAVGSLLAQGGPKMLPKIQVLKSGTPRTHLVLYSPVAVLDGVKAQMLFMDKE QISRLATADSLAQNTYSAQNLLKLISNFRKASGYKINVQKSQAFLYTNNRQTESQIMSEL PFKIFSKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNRWKNIPCSWIGRINIMKMAI LPKVIYRFNAIPIKLPMTFFTELEKTILKFIWNQKRACTAKSILSQKNKAGSIMLPDFKL YYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPEKNKQ >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_4|1047_bp atgcactgtgagatgtctgcctcctcctgctcactcactgtgctagagctacaccagttg gatctcaataactatcagcgtcacatgcaccaaaggcccaagggcagtgagctcccttct ggcccacggcacatcccaagatgctacccaagggctctgaagtcagcttgtggtgaatac tgtctggcctggagctcactcttcagggcagtgggctcccttctggcccagggaggtcca aagatgctgcctaagattcaagtcctgaaatcagggaccccaagaacccacctggtgctc tactcccctgtggctgtgctggatggggtgaaggctcagatgctcttcatggacaaggaa caaatctccaggttggccactgctgattccttagctcagaatacatattcagcccaaaat ctccttaagctgataagcaactttagaaaagcttcaggatacaaaatcaatgtgcaaaaa tcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactc ccattcaaaattttttcaaagagaataaaatacctaggaatccaacttacaagggatgtg aaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggacacaaac agatggaagaacattccatgctcatggataggaagaatcaatatcatgaaaatggccata ctgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttt acagaattggaaaaaactattttaaagttcatatggaaccaaaaacgagcctgcactgct aagtcaatcctaagccaaaagaacaaagctggaagcatcatgctacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagac caatggaacagaactgagccctcagaaataataccacacatctacaaccatctgatcttt gacaaacctgagaaaaacaagcaatag >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_5|30_aa MGLSVGQTVIILVKCPLLDCLEAQQRFQSE >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_5|93_bp atgggcctatcagttggacagacggtgataattctggttaagtgtcctcttctggactgc ctggaagcacaacagcgctttcagtctgaataa >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_6|70_aa MKHRSKGKKYMNNFNSQNAHAFENFSSFCQIYFALVCDGGVTRFFSAQLLKPLGELKDRQ SVGLRSHSRF >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_6|213_bp atgaaacatcgaagtaaaggaaagaagtacatgaataacttcaattctcagaatgcccat gcatttgagaacttttcctctttctgccaaatctactttgcactggtatgtgatgggggt gtgactcgcttcttcagtgcccagctgctcaaacctctaggggagcttaaggacaggcag tctgtggggctccgatctcacagcaggttctag >gi568815592f:54208551_54489922|GENSCAN_predicted_peptide_7|168_aa XFLLPEPAEGHLQQQPDTKAVLNRKVLRTGTLYIAESHLSWLDSSGLGFSLEYPTISLLA LSRDQSDCLGEHLYATVNDKFEESKESVADEEEEDSDDVELITEFIFVPSDKSALGAMFT AMCECQALHPDPEDEDEDDYDGEEYDVEAHERGKGDIPKSYTYEGLSH >gi568815592f:54208551_54489922|GENSCAN_predicted_CDS_7|507_bp nntttcctgctacctgagccagcagaggggcacctgcagcagcagccagacaccaaggct gtgctgaacaggaaggtcctccgcactggtaccctttatatcgctgagagccacctgtct tggttagatagctctggattaggattctcactggaataccccaccattagtttacttgca ttatccagggaccaaagtgactgtctaggagaacatttgtatgctacggtgaatgacaaa tttgaagaatccaaagaatctgttgctgatgaagaagaggaagacagtgatgatgttgaa cttattactgaatttatatttgtacctagtgataaatcagcactgggggcaatgttcact gcaatgtgtgaatgccaggccttgcatccagatcctgaggatgaggatgaggatgactac gatggagaagaatatgatgtggaagcacatgaacgaggaaaaggggacatccctaaatct tacacctatgaaggattatcccattaa