GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:37:43 Sequence gi568815590f:42597547_42836615 : 239069 bp : 44.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 29845 30061 217 0 1 91 58 109 0.072 6.28 1.02 Term + 48065 48354 290 0 2 18 47 196 0.276 4.04 1.03 PlyA + 48769 48774 6 1.05 2.00 Prom + 49785 49824 40 -4.56 2.01 Init + 59968 60014 47 0 2 73 52 37 0.080 -1.25 2.02 Intr + 62217 62364 148 0 1 91 61 48 0.066 2.64 2.03 Intr + 80995 81060 66 0 0 105 66 44 0.038 3.00 2.04 Intr + 89262 89421 160 2 1 -4 86 161 0.061 6.26 2.05 Term + 93769 94067 299 2 2 94 49 164 0.864 8.53 2.06 PlyA + 95385 95390 6 1.05 3.00 Prom + 97263 97302 40 -4.56 3.01 Init + 100001 100052 52 1 1 71 115 45 0.476 6.47 3.02 Intr + 111171 111322 152 1 2 77 89 47 0.440 3.58 3.03 Intr + 133048 133157 110 0 2 43 106 56 0.182 1.98 3.04 Intr + 134121 135003 883 0 1 49 109 688 0.925 58.33 3.05 Term + 138938 139072 135 1 0 117 49 33 0.895 0.32 3.06 PlyA + 140984 140989 6 1.05 4.00 Prom + 143026 143065 40 -2.56 4.01 Init + 146802 146958 157 2 1 75 79 38 0.574 1.67 4.02 Intr + 148262 148429 168 0 0 50 68 100 0.379 4.12 4.03 Term + 156224 156378 155 0 2 87 34 55 0.124 -1.92 4.04 PlyA + 156637 156642 6 -0.45 5.07 PlyA - 157471 157466 6 1.05 5.06 Term - 158254 158201 54 1 0 77 54 50 0.421 -1.94 5.05 Intr - 159278 158300 979 1 1 72 84 472 0.530 36.14 5.04 Intr - 159491 159382 110 2 2 60 27 101 0.609 0.28 5.03 Intr - 164288 164010 279 2 0 2 53 236 0.599 9.17 5.02 Intr - 167658 167519 140 1 2 122 89 143 0.951 17.98 5.01 Init - 170884 170806 79 1 1 72 100 43 0.918 3.12 5.00 Prom - 173460 173421 40 -4.86 6.07 PlyA - 175283 175278 6 1.05 6.06 Term - 175313 175293 21 2 0 104 49 4 0.008 -3.59 6.05 Intr - 190305 190119 187 2 1 62 55 110 0.010 4.79 6.04 Intr - 216955 216875 81 0 0 100 89 40 0.417 4.05 6.03 Intr - 217192 217109 84 0 0 92 33 109 0.389 4.74 6.02 Intr - 219190 219123 68 1 2 45 97 51 0.365 -0.60 6.01 Init - 225766 225692 75 1 0 52 108 47 0.477 4.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 80995 81087 93 0 0 105 49 89 0.828 4.53 S.002 Sngl + 87828 88091 264 2 0 69 43 183 0.887 7.10 S.003 Term - 128264 127998 267 2 0 10 41 253 0.858 8.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:42597547_42836615|GENSCAN_predicted_peptide_1|168_aa RGSSLREKSRGKQQVLVHEKRGHCGYSSEGMTHGEGCSKSLIPPTTNIQDYKYVAKVRYP LVAHFVIKVNLFYPFHKLEEVDIEGVFNINNEASITHSLTNGEIVEMVLNPGDRGNSDEE DDIVNTAEKVPVDEMMKTCTGLIKGLEQCRFITEQEIIMQFIKSKGDI >gi568815590f:42597547_42836615|GENSCAN_predicted_CDS_1|507_bp agagggtcaagtttacgagagaaatccagggggaaacagcaagtcctggttcatgagaaa cggggacactgtgggtattcatcagagggcatgacacacggggagggctgttcaaagagc cttattccccctaccacaaacatacaggactacaaatatgtggcaaaagtgcgctaccct ttggtagcacactttgtaataaaagtcaacctcttctacccctttcataagctggaagaa gtggatattgaaggagtttttaacataaataatgaggcttccatcactcattcattgacc aatggtgaaattgttgaaatggttctgaatccaggtgatcgtggtaacagtgatgaggaa gatgacattgttaacactgcagaaaaagtgcctgttgatgagatgatgaaaacatgcact gggcttattaaaggactagagcagtgcagattcataacagaacaagaaatcatcatgcag tttataaaatcaaagggagacatctaa >gi568815590f:42597547_42836615|GENSCAN_predicted_peptide_2|239_aa MSDHHLGKNANPRNLDCLPDFSKATGPLLQDFSHGAAEEGDLWGDGTVCSGHQVASRFWG DSARRDQIQHAGDYSPIPRRYLTVVQKRIKYVGIQIKRDVKDLFKENYKPLLNEIKEDTN KHSMLMDRKNQYCENGHTAQALGVHLLWTLGWLTGARSPIHPQPVWPIRTQSCWDKASSN PKPIVVEKQTLVPGQSEQDTSWSSDQAAALRNGGGGEDKVWTLLSVRQDEGKPQKRNND >gi568815590f:42597547_42836615|GENSCAN_predicted_CDS_2|720_bp atgagtgaccatcacctggggaaaaacgcaaacccaagaaaccttgactgcttgcctgac tttagcaaagccactggcccccttctccaggacttttctcatggagctgctgaggaaggt gacctctggggtgatggtactgtgtgctcaggccatcaggtggcttctcgcttctggggt gacagtgctagaagggatcaaatccagcatgcaggtgactacagtcccatcccccggcgt tatctcacagtggtacagaagagaataaaatacgtaggaatccaaattaaaagggatgtg aaggacctcttcaaggagaactacaaaccgctgctcaacgaaataaaggaggacacaaac aaacattccatgctcatggataggaagaatcaatattgtgaaaatggccatactgcccaa gctctgggcgtccatctgctgtggacactggggtggctgaccggggccagaagcccaata cacccgcagcctgtttggcccatccgcacccagagttgctgggacaaagcttcttcaaac ccaaagcctatcgtggtggaaaagcagactttagtgcctggccagtctgagcaggacact tcctggtcctccgaccaggctgcagccctgaggaacggtggcggcggcgaggacaaagtg tggacgctcctgtcagtcagacaagatgaggggaagcctcaaaaaagaaacaacgactga >gi568815590f:42597547_42836615|GENSCAN_predicted_peptide_3|443_aa MLPDFMLVLIVLGIPSSATTGFNSIAENEDALLRHLFQGYQKWVRPVLHSNDTIKVYFGL KISQLVDVEWTDHKLRWNPDDYGGIHSIKVPSESLWLPDIVLFENADGRFEGSLMTKVIV KSNGTVVWTPPASYKSSCTMDVTFFPFDRQNCSMKFGSWTYDGTMVDLILINENVDRKDF FDNGEWEILNAKGMKGNRRDGVYSYPFITYSFVLRRLPLFYTLFLIIPCLGLSFLTVLVF YLPSDEGEKLSLSTSVLVSLTVFLLVIEEIIPSSSKVIPLIGEYLLFIMIFVTLSIIVTV FVINVHHRSSSTYHPMAPWVKRLFLQKLPKLLCMKDHVDRYSSPEKEESQPVVKGKVLEK KKQKQLSDGEKVLVAFLEKAADSIRYISRHVKKEHFISQVVQDWKFVAQVLDRIFLWLFL IVSVTGSVLIFTPALKMWLHSYH >gi568815590f:42597547_42836615|GENSCAN_predicted_CDS_3|1332_bp atgctcccagattttatgctggttctcatcgtccttggcatcccttcctcagccaccaca ggtttcaactcaatcgccgaaaatgaagatgccctcctcagacatttgttccaaggttat cagaaatgggtccgccctgtattacattctaatgacaccataaaagtatattttggattg aaaatatcccagcttgtagatgtggaatggacagaccacaagttacgctggaatcctgat gattatggtgggatccattccattaaagttccatcagaatctctgtggcttcctgacata gttctctttgaaaatgctgacggccgcttcgaaggctccctgatgaccaaggtcatcgtg aaatcaaacggaactgttgtctggacccctcccgccagctacaaaagctcctgcaccatg gacgtcacgtttttcccgttcgaccgacagaactgctccatgaagtttggatcctggact tatgatggcaccatggttgacctcattttgatcaatgaaaatgtcgacagaaaagacttc ttcgataacggagaatgggaaatactgaacgcaaaggggatgaaggggaacagaagggac ggcgtgtactcctatccctttatcacgtattccttcgtcctgagacgcctgcctttattc tataccctctttctcatcatcccctgcctggggctgtctttcctaacagttcttgtgttc tatttaccttcggatgaaggagaaaaactttcattatccacatcggtcttggtttctctg acagttttccttttagtgattgaagaaatcatcccatcgtcttccaaagtcattcctctc attggagagtacctgctgttcatcatgatttttgtgaccctgtccatcattgttaccgtg tttgtcattaacgttcaccacagatcttcttccacgtaccaccccatggccccctgggtt aagaggctctttctgcagaaacttccaaaattactttgcatgaaagatcatgtggatcgc tactcatccccagagaaagaggagagtcaaccagtagtgaaaggcaaagtcctcgaaaaa aagaaacagaaacagcttagtgatggagaaaaagttctagttgcttttttggaaaaagct gctgattccattagatacatttcgagacatgtgaagaaagaacattttatcagccaggta gtacaagactggaaatttgtagctcaagttcttgaccgaatcttcctgtggctctttctg atagtgtcagtaacaggctcggttctgatttttacccctgctttgaagatgtggctacat agttaccattag >gi568815590f:42597547_42836615|GENSCAN_predicted_peptide_4|159_aa MASPVENNWYLNVFRMLYKNTIDWVASKLQKFIADSSGDWEAQGKVLAGLVCGAVFFPDP PSSGLINLDDQDTPQSFILVIILSLKNQMIKHLRPHPAHGFEGSRMELEGRCTASVSASD SRVCGKALTPWVTRDFPNCKKRSREATAQRARLILCNGG >gi568815590f:42597547_42836615|GENSCAN_predicted_CDS_4|480_bp atggcatctccagttgagaataactggtatctaaatgtattcaggatgctatataaaaat accatagattgggtggcttctaaactgcagaaatttattgctgacagttctggagactgg gaagcccaaggaaaggtgctggcaggtctggtgtgtggtgcggtgtttttcccagacccg ccctcaagtggcttaataaacctcgatgatcaagacacgcctcagtccttcattttggtc atcattctttctctgaaaaatcagatgatcaagcacctgaggccacatccagcacatggc tttgagggctcccgaatggagctggagggcagatgcacagcctctgtgagtgcctcagac agcagagtctgcggcaaagccctgactccgtgggtcaccagggactttcccaactgcaag aaacgctccagagaagcaactgctcagagagctagactcattctttgcaacggtgggtag >gi568815590f:42597547_42836615|GENSCAN_predicted_peptide_5|546_aa MLTSKGQGFLHGGLCLWLCVFTPFFKGCVGCATEERLFHKLFSHYNQFIRPVENVSDPVT VHFEVAITQLANVPLMQLLCQLIDFSKKEVQGDGTRNGVDAVGLWTPPAEMSDSSAMPIG GIWNWMEREYDLPDRSCSIGSADISEGVHIPMGHVHTTWSFSEPAEIWNDYKLRWDPMEY DGIETLRVPADKIWKPDIVLYNNAVGDFQVEGKTKALLKYNGMITWTPPAIFKSSCPMDI TFFPFDHQNCSLKFGSWTYDKAEIDLLIIGSKVDMNDFWENSEWEIIDASGYKHDIKYNC CEEIYTDITYSFYIRRLPMFYTINLIIPCLFISFLTVLVFYLPSDCGEKVTLCISVLLSL TVFLLVITETIPSTSLVVPLVGEYLLFTMIFVTLSIVVTVFVLNIHYRTPTTHTMPRWVK TVFLKLLPQVLLMRWPLDKTRGTGSDAVPRGLARRPAKGKLASHGEPRHLKECFHCHKSN ELATSKRRLSHQPLQWVVENSEHSPEVEDVINSVQFIAENMKSHNETKEPMGTLDAQSDL QCDFCS >gi568815590f:42597547_42836615|GENSCAN_predicted_CDS_5|1641_bp atgctgaccagcaaggggcagggattccttcatgggggcttgtgtctctggctgtgtgtg ttcacacctttctttaaaggctgtgtgggctgtgcaactgaggagaggctcttccacaaa ctgttttctcattacaaccagttcatcaggcctgtggaaaacgtttccgaccctgtcacg gtacactttgaagtggccatcacccagctggccaacgtgcctttgatgcagctgctgtgt caattaattgatttttccaagaaggaggtacagggtgatggcaccaggaatggagttgat gctgtgggtctttggactcctccggcagagatgtctgatagcagtgccatgcccattggt gggatctggaattggatggagagagagtatgacttgccagacaggtcctgctccataggg tcagctgacatctctgaaggtgtccacatccccatgggccatgtccataccacctggagc ttctccgagcctgcggaaatctggaatgattataaattgcgctgggatccaatggaatat gatggcattgagactcttcgcgttcctgcagataagatttggaagcccgacattgttctc tataacaatgctgttggtgacttccaagtagaaggcaaaacaaaagctcttcttaaatac aatggcatgataacctggactccaccagctatttttaagagttcctgccctatggatatc acctttttcccttttgatcatcaaaactgttccctaaaatttggttcctggacgtatgac aaagctgaaattgatcttctaatcattggatcaaaagtggatatgaatgatttttgggaa aacagtgaatgggaaatcattgatgcctctggctacaaacatgacatcaaatacaactgt tgtgaagagatatacacagatataacctattctttctacattagaagattgccgatgttt tacacgattaatctgatcatcccttgtctctttatttcatttctaaccgtgttggtcttt taccttccttcggactgtggtgaaaaagtgacgctttgtatttcagtcctgctttctctg actgtgtttttgctggtcatcacagaaaccatcccatccacatctctggtggtcccactg gtgggtgagtacctgctgttcaccatgatctttgtcacactgtccatcgtggtgactgtg tttgtgttgaacatacactaccgcaccccaaccacgcacacaatgcccaggtgggtgaag acagttttcctgaagctgctgccccaggtcctgctgatgaggtggcctctggacaagaca aggggcacaggctctgatgcagtgcccagaggccttgccaggaggcctgccaaaggcaag cttgcaagccatggggaacccagacatcttaaagaatgcttccattgtcacaaatcaaat gagcttgccacaagcaagagaagattaagtcatcagccattacagtgggtggtggaaaat tcggagcactcgcctgaagttgaagatgtgattaacagtgttcagttcatagcagaaaac atgaagagccacaatgaaaccaaggagcctatgggcactctggatgcacagtcagacctt cagtgtgacttttgttcctga >gi568815590f:42597547_42836615|GENSCAN_predicted_peptide_6|171_aa MHRRTSLSNGSSINMIGPKHVLKAQRDLDKCFRLEEAHDTSPWVKICKAVCSGGITPSQR ECELKCTGSLDRAGAELFALQVWRLPASDSYFGTGAHGLTSTCGVRGLAGSGVKLLTLTV SVTALKAGRLELFILPGGFVVSLASGVKLQTFVVSVTAHKGSVDPKTSKLK >gi568815590f:42597547_42836615|GENSCAN_predicted_CDS_6|516_bp atgcatagaagaacttcattatcaaatggaagtagtataaacatgattgggcccaagcac gtcctaaaggcacaaagagatctagacaaatgcttccgattagaagaggctcatgatact tctccgtgggtcaagatttgtaaagcggtgtgctcgggcggcatcacacccagccagcgc gagtgtgaactcaagtgcacgggcagccttgaccgggctggagctgagctgtttgcactg caggtttggcgcctgcctgcatctgattcctactttgggacaggagcccatgggctgacg agcacgtgtggggttcgtggtctcgctggctcaggagtgaagctgctgaccctcactgtg agtgttacagctcttaaggcggggcgtctggagttgttcattcttcctggtgggttcgtg gtctcactggcttcaggagtgaagctacagaccttcgtggtgagtgttacagctcataaa ggcagtgtggacccaaaaacctccaaattgaaatga