GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:58:16 Sequence gi568815588r:12798438_13101570 : 303133 bp : 44.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7519 7634 116 0 2 98 47 58 0.249 0.49 1.02 Intr + 15758 15870 113 2 2 90 82 232 0.993 22.82 1.03 Intr + 17813 17891 79 0 1 68 91 99 0.893 6.81 1.04 Intr + 26028 26115 88 0 1 66 89 177 0.918 15.57 1.05 Intr + 27136 27253 118 0 1 57 36 76 0.559 -0.66 1.06 Term + 28390 28607 218 2 2 67 55 91 0.279 1.01 1.07 PlyA + 29839 29844 6 1.05 2.04 PlyA - 30587 30582 6 -0.45 2.03 Term - 32435 32272 164 0 2 45 48 147 0.753 4.50 2.02 Intr - 35219 35056 164 1 2 75 78 32 0.676 0.52 2.01 Init - 35807 35779 29 2 2 81 109 40 0.531 4.77 2.00 Prom - 37993 37954 40 -6.56 3.00 Prom + 53026 53065 40 -4.36 3.01 Init + 58225 58370 146 0 2 88 71 103 0.205 8.19 3.02 Intr + 69902 70098 197 2 2 76 96 94 0.836 8.06 3.03 Intr + 79607 79767 161 0 2 73 103 -9 0.069 -1.19 3.04 Term + 83967 84089 123 2 0 16 54 135 0.152 1.28 3.05 PlyA + 84315 84320 6 -0.45 4.08 PlyA - 84515 84510 6 1.05 4.07 Term - 86038 85926 113 2 2 75 44 102 0.866 3.22 4.06 Intr - 89695 89511 185 0 2 88 54 118 0.990 7.83 4.05 Intr - 92136 91997 140 0 2 72 95 48 0.949 3.16 4.04 Intr - 97777 97670 108 1 0 51 80 72 0.826 3.18 4.03 Intr - 99582 99411 172 2 1 -36 72 140 0.835 -0.05 4.02 Intr - 100242 99983 260 0 2 106 49 556 0.849 49.76 4.01 Init - 144517 144440 78 1 0 74 64 91 0.547 6.36 4.00 Prom - 154655 154616 40 -2.56 5.00 Prom + 165974 166013 40 -1.86 5.01 Init + 175544 175670 127 1 1 80 44 96 0.739 4.72 5.02 Intr + 176981 177142 162 0 0 82 37 124 0.543 6.65 5.03 Intr + 183238 183351 114 2 0 73 87 62 0.906 5.02 5.04 Term + 186116 186234 119 0 2 49 41 104 0.807 0.50 5.05 PlyA + 187795 187800 6 1.05 6.04 PlyA - 191352 191347 6 1.05 6.03 Term - 199738 199646 93 1 0 111 49 26 0.367 -1.17 6.02 Intr - 200075 199901 175 1 1 83 90 80 0.991 7.64 6.01 Init - 203133 202760 374 0 2 79 99 686 0.996 63.24 6.00 Prom - 204330 204291 40 -6.06 7.04 PlyA - 205365 205360 6 -0.45 7.03 Term - 206096 205857 240 2 0 8 49 128 0.452 -3.07 7.02 Intr - 207724 207613 112 0 1 78 91 108 0.486 10.48 7.01 Init - 227471 227464 8 2 2 114 100 0 0.320 4.30 7.00 Prom - 229817 229778 40 -4.26 8.00 Prom + 229946 229985 40 -6.46 8.01 Sngl + 230938 231402 465 0 0 88 43 481 0.740 39.75 8.02 PlyA + 231825 231830 6 1.05 9.00 Prom + 232363 232402 40 -4.96 9.01 Init + 232456 233035 580 0 1 49 16 171 0.809 1.32 9.02 Term + 234035 234606 572 0 2 62 46 212 0.290 9.00 9.03 PlyA + 234819 234824 6 1.05 10.00 Prom + 234936 234975 40 -4.46 10.01 Init + 234990 235295 306 2 0 71 19 174 0.082 6.10 10.02 Intr + 251294 251372 79 1 1 78 65 65 0.069 2.32 10.03 Term + 256742 256842 101 0 2 122 43 65 0.892 3.69 10.04 PlyA + 258230 258235 6 1.05 11.08 PlyA - 259120 259115 6 1.05 11.07 Term - 259917 259704 214 2 1 62 37 270 0.128 16.00 11.06 Intr - 261889 261768 122 1 2 70 83 33 0.093 0.29 11.05 Intr - 268830 268699 132 0 0 74 60 41 0.048 0.74 11.04 Intr - 273881 273697 185 0 2 76 103 29 0.003 2.71 11.03 Intr - 295867 295775 93 2 0 24 68 223 0.935 13.84 11.02 Intr - 297321 297193 129 1 0 16 83 114 0.622 4.37 11.01 Init - 299847 299769 79 0 1 46 105 76 0.971 6.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 18073 18097 25 1 1 104 54 15 0.824 -2.60 S.002 Term + 131229 131328 100 0 1 131 53 84 0.867 6.80 S.003 Sngl - 260024 259704 321 2 0 45 37 333 0.828 19.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_1|243_aa MMVPSKLGEHQGGGLPWPGGIKKDITEHRPVHGALGESRLCGYPPFYDENDSKLFEQILK AEYEFDSPYWDDISDSAKDFIRNLMEKDPNKRYTCEQAARHPWIAGDTALNKNIHESVSA QIRKNFAKSKWRQAFNATAVVRHMRKLHLGSSLDSSNASVSSSLSLASQKDWNPTMDPFF TDSYTGIGSKARREHSLYGCLSPCLSTQTPWGPAPTSSVAAEASHIKAFDACTSSGGGLQ RSL >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_1|732_bp atgatggtgcctagcaaacttggagaacaccaaggtgggggcctgccctggccgggtggg atcaagaaagacatcactgagcatagacctgtgcacggggctttgggggagagcaggctc tgcggctaccctcctttttatgatgaaaatgactccaagctctttgagcagatcctcaag gcggaatatgagtttgactctccctactgggatgacatctccgactctgcaaaagacttc attcggaacctgatggagaaggacccgaataaaagatacacgtgtgagcaggcagctcgg cacccatggatcgctggtgacacagccctcaacaaaaacatccacgagtccgtcagcgcc cagatccggaaaaactttgccaagagcaaatggagacaagcatttaatgccacggccgtc gtcagacatatgagaaaactacacctcggcagcagcctggacagttcaaatgcaagtgtt tcgagcagcctcagtttggccagccaaaaagactggaaccccacgatggatcccttcttt acagattcatacactggcatcggcagcaaagcccgtcgggagcactccttgtatggctgt ttgtccccatgtctgagcacacagactccctgggggcctgcaccaacaagctcagtggcg gctgaggcctctcatatcaaggcttttgatgcctgcacatcctctgggggagggcttcag cgcagcctctga >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_2|118_aa MPRDHRTGARNPSRVLVKFCEAARALAEAAAVALLPPSALEQWFQRGLPNAKTQFWPRFW GRGRGVEVTKMTQPTGLPMTGQPEEEGMRTQPAALLGRDDADLPGHPRGLEALDGPES >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_2|357_bp atgccccgggaccaccgaacaggtgccaggaatcccagtcgtgtgcttgttaaattctgc gaagcagcgagggctctggctgaagctgcggcagtggccctacttccaccttcagcccta gagcaatggttccagcgtggcctccccaacgcgaagactcaattctggccaaggttctgg gggagagggagaggtgtagaagtgacaaagatgactcaaccaacaggacttcccatgact ggccagccagaggaagagggcatgaggacacagccagcagcgttactgggtcgtgatgac gcagacctgccgggacacccccgtgggctcgaggcattggatgggcctgagagctga >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_3|208_aa MGRNQSRKAENSENQSASSPPKDCSSLPAMEQSWTGNDFDELTEVDFRSLAGIRRDMKRE KTVVVVNSVVSSVYSGRLFIGVKRDAFHSPFAEAKIELSTKNSILTARTCFAGPGRWRIQ SHLSCGNLHQSNITQAGKDLIESSTSTHSFQDVETKAQRDKVTCPRSQAEGNAMWVHKEK IAVYKPEREPSPDRIFQHVDLDFQPPGL >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_3|627_bp atggggagaaaccagagcaggaaagctgaaaattctgaaaaccagagtgcctcttctcct ccaaaggattgcagctccttgccagcaatggaacaaagctggacagggaatgactttgac gagctgacagaagtagacttcagaagtcttgctggcatcagaagagacatgaaaagggag aaaacggtagttgtggtcaactctgtggtcagcagtgtttacagtggcaggctgtttata ggagtgaagagggatgccttccattctccctttgcagaggccaagattgaactctcaacg aagaacagcattcttactgccagaacgtgttttgctggcccagggcgttggagaattcaa tcccatctgtcatgtggaaatttacatcaatctaacattactcaagctggaaaggatctt atcgagtcttctacttcaacccattcatttcaagatgtggaaactaaggctcaaagagat aaagtgacttgtccaagatcccaggcagagggaaatgccatgtgggtgcataaggagaag attgctgtctacaagccagaaagggagccctcaccagatcgaatcttccagcatgttgat ctggacttccagcctccaggactgtga >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_4|351_aa MASFICGNTPRSLPHTKEIKDEDTQGLMCSSVQKALFEEEDHVKKLQQKVATLEKRNRQL RERVKKVKRSLRQARKKGRHLELANQKLSEKLAAGALPHINARGPVRPPYLRGPGQPLVL MSELTLVLEPAGTSLATSLEGDGGLASKMPENQHVQGLPIQPAFSFQLRQPACPGTISKI QTIVYFPSVVQGAQETSKLNSHNKSQVVELKPSIPPWLLAAFSSLPSVHPQHGCLLHQSV QTKRAKEKVCNKIGTFKDWMVAGKGNIITHKSEHPESNQTAFHKEPFYISEEGARSSTHP DHFGARHQMPTSLSAACPVDPIASVYPESTRFAPSLRPPAWSKPPVPVLVS >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_4|1056_bp atggcgtcattcatctgcggtaataccccacgttctttgcctcacaccaaggagatcaag gacgaggacacacaaggactcatgtgctcctcggtgcagaaggccttgtttgaggaggag gaccacgtcaagaaactgcagcagaaagtggccaccctggagaagcgcaaccggcagctc cgggagcgagtgaagaaggtcaagaggtccttgcggcaggcgcgtaagaagggccgccac ctggagctggcgaaccagaaactcagtgagaagctggcggcgggcgcgctgccgcacatc aatgcccgggggcccgtgcgccccccctacctgcggggtccaggacaaccattagtcctg atgagtgagctgacgctggtgctggaacctgctggcacctcactggccacatctttggaa ggggatggtggccttgcatccaagatgcctgaaaatcagcacgtgcagggcctccctatc cagccagcattttccttccagctgaggcagccagcctgtccgggcactatctccaagatc cagaccattgtttactttccctctgtggtacaaggagcccaggagacttccaaactcaac tcacataacaagagccaggttgttgaactgaagccctcaattcctccctggctgctggct gccttcagttccttgccaagtgtgcatccccagcatggctgcttgcttcatcaaagtgtg cagactaagagggcaaaagagaaagtctgcaacaagattgggactttcaaggattggatg gtggcaggaaaagggaacattatcacccacaaatcagagcacccagaaagtaaccagaca gcatttcacaaagaaccgttctacatctctgaggaaggggcccgatcctcaactcaccct gaccattttggtgccagacaccaaatgccaacaagcctgagtgcagcctgccctgttgac ccaatcgccagcgtgtatccagagtccacgcgctttgctccttctctaagaccaccagcc tggtccaagcccccggtgcctgtgctagtgtcatga >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_5|173_aa MAPMVVKIQQQEKYKHSKNQAAFQKGLEVPERSKVTVYSPFHERSKNKISFGSLLRGNKS TNSTHPWLFVNERKDGQINAEPPLLRFQDLQASLCAGAITEGIYEVVHGKSPVYLQSSDF SYYQIPSPMSYDGEISYTPMKNESLYPYKNLFIAALFITGKSGNKSNAKLLNE >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_5|522_bp atggcacctatggttgtgaagatccaacagcaagagaagtacaagcactccaagaaccag gcagcattccagaaaggcctagaggttccagaaaggtcaaaggtcacagtctatagcccc tttcatgaaaggagtaagaacaagatcagtttcggcagcctcttaagaggaaacaaatcc actaactcaactcacccctggctgtttgtcaatgagaggaaggatgggcaaatcaatgct gaaccacccctgctccggttccaggacttgcaagcctccctttgtgcaggtgctattact gagggcatctatgaagtagtccatggcaagagtcctgtttatttacaaagctctgacttt agttactaccaaattccatcccctatgagttacgatggagaaattagttacacacccatg aaaaatgaaagcttatatccgtacaaaaacctgttcatagcagcattatttataacaggc aagagtggaaacaaatcaaatgccaaactgctgaatgaataa >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_6|213_aa MLRQLLLAALCLAGPPAPARACQLPSEWRPLSEGCRAELAETIVYARVLALHPEAPGLYN HLPWQYHAGQGGLFYSAEVEMLCDQAWGSMLEVPAGSRLNLTGLGYFSCHSHTVVQDYSY FFFLRMDENYNLLPHGVNFQDAIFPDTQENRRMFSSLFQFSNCSQGQQLATFSSDWEIQE DSRLLKKTIHKLWIPEALLLDKEFFPKYENKMR >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_6|642_bp atgctgcgccagctgctgctcgccgcgctctgcctggcgggtcccccagcgcccgcgcgc gcctgccagctgccctccgagtggaggcccctgagcgagggctgccgcgccgagctggcc gagaccatcgtgtacgccagggtgctggcgctgcaccccgaggcgcccggcctctacaac cacctgccctggcagtaccacgccggccaggggggcctcttctactcggccgaggtcgag atgctgtgcgaccaggcgtggggcagcatgctggaggtgcccgccggctccaggctcaac ctcaccggcctgggctacttctcgtgccactcccacaccgtggtccaggactactcctat ttcttcttcctcaggatggatgaaaattataacctcttgcctcacggagtcaatttccaa gatgccatcttcccagacactcaagagaacagaaggatgttttctagccttttccagttt tcaaactgttcgcaagggcagcagctggcgactttctccagtgactgggaaatccaggaa gacagtaggctgttaaaaaagaccattcataaactttggatccctgaagctttactgttg gataaggagttcttcccaaagtatgagaacaagatgagatga >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_7|119_aa MPGDVDCLDVPQNITLDYLMDDSMMNGSDVHETPYTLEALTHSGHGYAFPAIMLLPVAPS MDSWGTLSTIMEFPIVMFLINDSHSIASFTAMKCSSGLKAIELTGFTTCPITWKQLDKF >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_7|360_bp atgcccggagatgttgattgtcttgacgttccacaaaacatcacattggattacctcatg gatgacagtatgatgaatggatctgatgttcatgaaacaccatatactttagaggccttg acacattctggacacggatatgccttccctgccataatgcttctgccagtagcaccatct atggactcatggggcaccttatccaccatcatggaatttcccatagtgatgtttctgatc aatgattcccacagcattgcttctttcacagcaatgaagtgcagcagtggtctcaaggcc atagaattaactggttttaccacatgtcccatcacctggaagcaactggacaaattttga >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_8|154_aa MGRNQSRKAENSKNRNASSPPKECSSSPTMEQSWTENDFDKLREEGFRRSVIINFSKLKE DVRTHRKEAKNHEERLDEWLTRTNSIEKTLNDLTELKIMVRELHDACTSFSSRFDQVEER VSVIEDQMNQTKREEKFREKRVKRNQQSLQEIWD >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_8|465_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaccggaacgcctcttctcct ccaaaggaatgcagctcatcgccaacaatggaacaaagctggacagagaatgactttgac aagttgagagaagaaggcttcagacgatcggtaataataaacttctccaagctaaaggag gatgtccgaacccatcgcaaagaagctaaaaaccatgaagaaagattagacgaatggtta actagaacaaacagcatagagaagaccctaaatgacctgacggagctgaaaatcatggta cgagaactacatgacgcatgcacaagcttcagcagccgatttgatcaagtggaagaaagg gtatcagtgattgaagatcaaatgaatcaaacgaagcgagaagagaagtttagagaaaaa agagtaaaaagaaatcaacaaagcctccaagaaatatgggactag >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_9|383_aa MGDFNTPLSTLDRSTRQKVNTDIQELNTALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGNRALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSTTWKLNNLLLSDYWV NNEMKAEIKMFFETNENIDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQIHSKASMLEVLARAIRQEKEIKGIQLGKEEVKLSLLADDMIVYLENPIISAQN LLKRISNFSKVSRCKINVQKSQAFLYTNNRQTESQIMSQLPFTIASNRIKYLGIQLTRDV KDLFKKNYKTLLNEIKEDTNKGKNIPCSWIGRINIVNMAILPKVIYRFNAIPIKLPMTFF TELEKTTLKFIWNQKRAALPSQS >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_9|1152_bp atgggagactttaacaccccactgtcaacattagacagatcaacaagacagaaagttaac acggatatccaggaattgaacacagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacatcacacttattcc aaaattgaccacatagttggaaacagagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaagctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgagtgactactgggta aataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacatagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttattgcacta aatgcccacaaaagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaatacattcaaaagctagcatgttggaagttctggccagg gcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtcaaattg tccctgcttgcagatgacatgattgtatatttagaaaaccccatcatctcagcccaaaat ctccttaagcggataagcaacttcagcaaagtctcaagatgcaagatcaatgtgcaaaaa tcacaagcattcctatacaccaataacagacaaacagagagccaaatcatgagtcaactc ccattcacaattgcttcaaacagaataaaatacctaggaatccaacttacaagggatgtg aaggacctcttcaagaagaactacaaaacactgctgaacgaaataaaagaggacacaaac aaagggaagaacattccatgctcgtggataggaagaatcaatatcgtgaacatggccata ctgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagctgcattgcca agtcaatcctaa >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_10|161_aa MGEDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFTIYPSDKGL ISRIYKELKQIYKKKANNPIKKWAKDMNRHFSKEDIYAGNTHVTEKAAQKSPGRDNTIDT SDHPFWRAGYRDFLSARQQDFKRVEAAYAQNWHSTISAAFC >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_10|486_bp atgggcgaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttacaatctacccatctgacaaaggccta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaagcaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcaggcaac acacacgtcacagagaaggctgcccagaagagccctgggagagacaacaccatcgacacc tcagaccaccctttctggagggcggggtacagggacttcctctctgcaaggcaacaggat ttcaagagagtggaagctgcctacgcccagaactggcacagcaccatttcagctgcattc tgttag >gi568815588r:12798438_13101570|GENSCAN_predicted_peptide_11|317_aa MSISETQSQEKKVTHKLPMALVWEARELYKHHQKEHDLIIMSNKAGVPKPRVVNQYWPVA SQELDRTAGAWVSIIIIIIIIIIIIIIIIILKSKLTGKEPGGTVLGKTAVRSPSEAYLSI PSRKGHRPRLWLHDHLEFDSLKARIDNWVIRKHISKRNKEVTGEKLNCCPPLGTLGPWAM HPNPGWEGSGEPTIRQSISQVPAAIGFSQESQSPSLNSDAILKARIPRHAQQGHVFHGVE FQGLRPRILSRFGMDKIYEGQVKVTGDEYDVESIDGQPGAFTCYLDAGLARTTAGNQVFG TLKGAVDGGLSIPRNTK >gi568815588r:12798438_13101570|GENSCAN_predicted_CDS_11|954_bp atgagcatcagtgaaactcagagccaagagaagaaggtcacccacaagctgcccatggct ttggtgtgggaggctcgagagctgtacaaacaccaccaaaaagaacacgacctaataatt atgtccaataaagcaggggtccccaaaccccgggtggtgaaccagtactggcctgtggcc tctcaggaactagaccgaacagcaggagcctgggtgagtatcatcatcatcatcatcatc atcatcatcatcatcatcatcatcatcatcttgaaaagtaaactcactgggaaagagcca gggggcactgttttggggaaaacagcagtcagatctccctcagaggcttatctaagtatc cccagcagaaaaggccatcgtccgaggctctggttgcatgatcatttggagtttgatagc ctgaaggcgagaatagacaactgggttattaggaaacatatatcaaaacgaaacaaggag gttacgggagagaaacttaattgctgtccaccattgggcacccttgggccatgggccatg caccccaaccctggctgggaggggagcggggaacccaccattcgccagtctatctcgcag gtgcctgcagccattgggttttcccaagagagccaatccccttccctgaattcagacgcc atcttgaaagcacgaattcccagacacgcacagcaaggtcacgtcttccatggggttgag ttccaaggtctgaggccgaggattctcagtaggtttggcatggacaagatctatgaaggc caagtgaaggtgactggcgatgaatacgatgtggaaagcattgatggtcagccaggtgcc tttacctgctatttggatgcaggccttgccagaactaccgctggcaatcaagtttttggc accctgaagggagctgtggatggaggcttgtctatccctcgcaataccaaatga