GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:40:37 Sequence gi568815596r:43124322_43326315 : 201994 bp : 49.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2673 2807 135 2 0 53 68 73 0.318 2.56 1.02 Intr + 3660 3718 59 2 2 71 66 38 0.213 -2.42 1.03 Intr + 6733 6806 74 1 2 82 71 95 0.540 6.25 1.04 Intr + 30484 30664 181 2 1 56 49 80 0.072 -0.17 1.05 Intr + 33954 34105 152 0 2 113 35 42 0.192 1.21 1.06 Intr + 37751 37874 124 0 1 33 60 107 0.297 2.04 1.07 Intr + 39041 39169 129 2 0 108 80 7 0.242 1.71 1.08 Intr + 60086 60197 112 2 1 89 105 4 0.477 2.58 1.09 Intr + 61835 61985 151 1 1 81 27 79 0.409 0.84 1.10 Intr + 63977 64128 152 0 2 88 116 -11 0.322 1.58 1.11 Intr + 65051 65137 87 1 0 109 90 -7 0.232 1.77 1.12 Intr + 70862 70945 84 1 0 57 106 35 0.064 2.22 1.13 Intr + 77235 77437 203 2 2 14 37 144 0.006 -0.02 1.14 Intr + 90153 90311 159 0 0 58 77 55 0.007 0.50 1.15 Intr + 93636 93768 133 0 1 95 59 22 0.073 0.65 1.16 Intr + 94311 94391 81 2 0 84 105 18 0.439 2.93 1.17 Intr + 95438 95639 202 1 1 141 121 79 0.830 15.36 1.18 Intr + 96932 97047 116 0 2 58 68 92 0.719 4.37 1.19 Term + 97467 97484 18 2 0 79 47 4 0.371 -6.28 1.20 PlyA + 97708 97713 6 -0.45 2.16 PlyA - 98106 98101 6 1.05 2.15 Term - 101431 99998 1434 1 0 92 54 2326 0.987 220.47 2.14 Intr - 101841 101639 203 1 2 110 64 24 0.378 1.20 2.13 Intr - 102982 101944 1039 1 1 48 102 234 0.202 10.87 2.12 Intr - 107558 107505 54 2 0 72 115 54 0.773 5.68 2.11 Intr - 108561 108392 170 1 2 112 58 179 0.906 16.97 2.10 Intr - 119394 119248 147 1 0 103 78 26 0.002 3.31 2.09 Intr - 144594 144449 146 2 2 87 77 30 0.791 1.73 2.08 Intr - 144761 144705 57 1 0 142 77 2 0.830 2.50 2.07 Intr - 145238 145110 129 1 0 22 81 158 0.325 8.31 2.06 Intr - 145933 145909 25 2 1 32 94 21 0.180 -5.82 2.05 Intr - 155575 155444 132 2 0 81 82 89 0.968 8.22 2.04 Intr - 162740 162587 154 2 1 75 111 99 0.998 10.55 2.03 Intr - 168892 168513 380 2 2 77 98 170 0.523 11.58 2.02 Intr - 182127 181936 192 1 0 42 99 110 0.008 6.96 2.01 Intr - 196219 196125 95 0 2 104 80 15 0.013 1.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 115627 115720 94 0 1 79 78 94 0.952 8.04 S.002 Init - 182056 181936 121 1 1 77 99 80 0.941 8.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:43124322_43326315|GENSCAN_predicted_peptide_1|783_aa SMWDGSGVGAALIKGWLLDLELTTGSLAGLRPSAEGFQQVPQSRMVLASHQNSGCRMDAL CCPSRICQAAGYSAEKEKTEAVVMSPKELKLPPLHIPGPGSREANACLALCPEMHIPARY DPAAVTTASPDVESPPDVLTIAAVPRMGLGTAPHHWAKRMTRHSAGDDLLLLGRPFSSSS SPILTQRPVSCHCPRGLADLAPSSNAPALEPDKVRPPYQGHGPRFSSLARRDKGYGPYQP RRTHQQLGRTQQGSAQAGTQTHLGGTDVSEFYPSFISPRPHNLMRKGLLTSLGSRPLPLT SEWAATSSWFTYREWLRTPVTEFWVCNDEADVDTRVEGFDVNQRTKWKPSIWGPPEAAIY RKGFVVIFKNRYGPSPKQNSEPSPLGCSVSCMNLALISPSVPILDGVPGPRVFRIFTSCV SIKEASLNMPVKNTFSAVRWFTPIITTLWEAEKNGIIYSNNNSIDMLSFLFVGKPQMTVW LRLQPTPSCCPLDWQPLNPLPSKCPPSHQFNWLPVQLQKRPNLSVEPKQEAGEKGANGQG PLRLLQEVPCSGSGPTGLLLASSCLSAPQTLWAPSEKPAGWNIQGPAMLSLSAKKDLGLL WLLVVQGKQTAGFQLPRSPFSSGAGLRLPACKQSPHQQAVTTGAESEFFLKMSLQTASGT KACPWQVCPATEGSSWGGLSSCQAPPRHSPRCPPPSLSRASSGTELELPWTATALHIPQE CGQEKINTIRVHDNGPLMEASNKHQNAPVNTSAIIVNVKQLPEAWRIHQTAPSPPCSARK LSA >gi568815596r:43124322_43326315|GENSCAN_predicted_CDS_1|2352_bp tccatgtgggatgggagcggggtgggggcagccctaatcaagggatggctgctggacttg gagctgaccactggctccctggcaggccttagaccttccgcagaaggatttcagcaggtt cctcagagcagaatggtcctagcaagtcatcaaaacagcgggtgccgcatggacgccctg tgctgcccttcacggatatgccaggctgcaggctacagtgccgagaaggagaagacggag gcggtggtgatgtcacccaaggaattaaaactcccacccctgcacatccccgggccaggc tccagggaagccaacgcctgcttggccttgtgcccagagatgcacattccagccaggtat gatcctgctgctgttactactgcaagcccagatgtggaatcacctccagatgttctcacc atagcagctgtgcctaggatgggattgggcactgctccccatcactgggctaaacgaatg acccgccactcagcaggtgacgacttactcctccttggccggcccttcagctcctcctcc tcacccatcctgactcagaggcctgtcagctgccactgcccccgaggcctggcagatttg gcacccagcagcaatgccccggctctggaacctgacaaagtgagacctccctaccagggc catggcccccgcttctccagcctcgctcgccgggacaaaggttacgggccttatcaacca aggaggacccaccagcagcttggacgcacccaacaaggctctgcccaggctggaactcag acacatctgggaggcacagatgtgtctgagttttatccaagctttatctcaccaaggccc cataacctcatgaggaaaggactcctgacctcattaggctcgcgccccctcccgctgacc tccgaatgggctgctacaagttcatggttcacctacagggagtggctgcgcacaccagtc actgagttctgggtatgcaatgatgaagcagatgtggacaccagggtggagggttttgat gtcaaccaaaggaccaaatggaagccttccatttggggtcctccagaagctgctatttac agaaaaggctttgttgtcatttttaaaaacagatatggcccatcacccaaacaaaactca gagccttctcccctggggtgctctgtttcctgcatgaatcttgctctcatctccccgtcc gtgcccatcctagatggggtgccagggccaagggtctttaggattttcacctcctgtgtc agcatcaaggaagcttccctgaacatgccagtcaagaacacattctcggccgtgcggtgg ttcacacctataatcacaacgctttgggaggctgagaaaaatggaattatttattcaaat aataactctattgatatgctctcattcctgtttgtggggaagccccagatgactgtgtgg ctgaggctccaaccaacccccagctgctgccccctggactggcagcccctcaacccactc cccagcaagtgtcctccctctcaccagttcaactggctcccagttcagctgcagaaacgg ccaaacctctctgtggaacccaaacaggaagctggggagaagggagccaacggccaagga cctttgaggctcctccaggaagtcccctgcagcggctctggacctacagggctattgttg gcaagctcctgcctgtcagccccacagactctctgggccccctctgagaagcctgctgga tggaacatccagggaccagccatgctgagcctctcagccaagaaggatttgggtttgctt tggttattagtggtgcaggggaaacaaacagcaggtttccaattaccccgcagccccttc tcaagtggtgcaggactcagactgcccgcctgcaagcagtcaccacaccagcaagcagtc accacgggagcggagagtgaattctttctaaaaatgtccctgcagacagccagcggcacc aaggcctgtccttggcaggtgtgcccagcaacagagggctcctcctggggaggcctgtcc agctgccaggccccgccccgccacagcccccgctgtcctcctccctccctcagccgtgcc agcagcggcacagaactggaattgccctggacggccacagctctgcatatcccccaggag tgtggacaagaaaaaataaacacaattagagttcacgacaatgggcccctgatggaagca agcaacaagcaccaaaacgctccagttaacaccagtgcaatcatcgttaacgtgaaacag ctgcccgaggcctggcgtatccaccagaccgccccctcccctccatgctcagctaggaag ttgtccgcataa >gi568815596r:43124322_43326315|GENSCAN_predicted_peptide_2|1452_aa XQNPCLVTRAVYIDILFLLTCCLNRSAKDNQPVSHGLSVEEKPQVLCELEFKLNRVMVDK FLIPNPVLRTNSPKEPEVGRAQYLTPVVPTLWEAKAVLESLGFWEEVRGIISGSELITGF PWAFKVPGLPQYLQSLTRLAIAAVWAAAAKSGERETNVPISFSQLLESAFPEVRSLTLEA LLEKFLAAASGLGEKGVPPLLCNMGEKFLLLAMKENHPECFCKNRELIAAELKQWVQLVI LSCEDHLPTESRLAVVEVLTSTTPLFLTNPHPILELQDTLALWKCVLTLLQSEEQAVRDA ATETVTTAMSQENTCQSTDVETEAQEREPGPPDVHLHLSECVTGSGSHDVQKQVQTNAPV FDLLASYECQNFRFPQQGHITMQDFSSLCEDDSSSPSFRPDSIGSIPPEHPSQHWHLPFA LLSSLSHRLLPSQQPDEDLESHFPLKTERALRQVVPGKGLLLICGHLSAQELVRKGELGD AFQNKGEFAFCQVDASIALALALAVLCDLLQQWDQLAPGLPILLGWLLGESDDLVACVES MHQQAKGSKRIVPLYEKFGDMAPALKPCLLSLGGEGAEKRARAPPARLPRRPPAWPPPPP PRLHVQSGWSRHRLDAGPEPPPPGRGTLCKPAGPRGWLRYSGILQFPRPVQNRRWELKAP QVRLGEQGRERHVQGCRIPATLNVPGLSDRAAPEENSRAFPPPPGGIHWGRELHATPAAR AAGPARRPPAPSPGGKKLAAANGRPGRRLTSASGRAALPGDPGGGRVLSPPPARPPRRSR TPGLYIGRALGGRRVPAVRELPSAPRGGRAGHPGARRSAPRFSPSPLPHSAGPSRRPAPL HSPSPLAPRQTFGPSPPLARYSSWLKPGHAAPRAPPDLPACRSGHCGIQKHVDHTSVRLL RCRLLVQGRESPGVCKRVSERPGGGEGGKEGAGVAVEFLHCRPRPRCPGLPGQASRRPPR QPGRRITCRNPARPTTEKSLANLNLNNMLDKKAVGTPVAAAPSSGFAPGFLRRHSASNLH ALAHPAPSPGSCSPKFPGAANGSSCGSAAAGGPTSYGTLKEPSGGGGTALLNKENKFRDR SFSENGDRSQHLLHLQQQQKGGGGSQINSTRYKTELCRPFEESGTCKYGEKCQFAHGFHE LRSLTRHPKYKTELCRTFHTIGFCPYGPRCHFIHNADERRPAPSGGASGDLRAFGTRDAL HLGFPREPRPKLHHSLSFSGFPSGHHQPPGGLESPLLLDSPTSRTPPPPSCSSASSCSSS ASSCSSASAASTPSGAPTCCASAAAAAAAALLYGTGGAEDLLAPGAPCAACSSASCANNA FAFGPELSSLITPLAIQTHNFAAVAAAAYYRSQQQQQQQGLAPPAQPPAPPSATLPAGAA APPSPPFSFQLPRRLSDSPVFDAPPSPPDSLSDRDSYLSGSLSSGSLSGSESPSLDPGRR LPIFSRLSISDD >gi568815596r:43124322_43326315|GENSCAN_predicted_CDS_2|4359_bp nngcaaaatccatgtttggtgaccagagctgtatatattgatattctcttcctattgact tgctgcctcaacagatctgcaaaggacaaccagccagtatcccatggactaagtgtggaa gagaagccccaggtcctttgtgagttagagtttaaattaaacagggtcatggtggacaaa tttctgattcccaatcctgttcttcggaccaattcgccgaaagaaccagaagtaggccga gcgcagtacctcacacctgtagtaccaacactttgggaggccaaggcagttctggagagt cttggcttctgggaggaagtcagagggattatctcaggatcagagctgataacgggattc ccttgggccttcaaggtgccaggcctgccccagtacctccagagcctcaccagactagcc attgctgcagtgtgggccgcggcagccaagagtggagagcgggagacgaatgtccccatc tctttctctcagctgttagaatctgccttccctgaagtgcgctcactaacactggaagcc ctcttggaaaagttcttagcagcagcctctggacttggagagaagggcgtgccacccttg ctgtgcaacatgggagagaagttcttattgttggccatgaaggaaaatcacccagaatgc ttctgcaagaacagggaattgatagctgctgagctgaagcagtgggttcagctggtcatc ttgtcatgtgaagaccatcttcctacagagtctaggctggccgtcgttgaagtcctcacc agtactacaccacttttcctcaccaacccccatcctattcttgagttgcaggatacactt gctctctggaagtgtgtccttacccttctgcagagtgaggagcaagctgttagagatgca gccacggaaaccgtgacaactgccatgtcacaagaaaatacctgccagtcaacagatgtg gaaactgaagcccaagaaagagagcctgggccccctgacgttcacctgcatctcagcgag tgcgtcactggcagcggctcccatgatgtgcaaaagcaggtgcaaactaatgctcccgtc ttcgatctgctggccagttatgagtgccagaactttcgctttcctcagcaaggccatatc acaatgcaggatttctcttcactgtgtgaggatgactcctccagccccagcttcaggcct gactctattgggagcatccctcctgagcaccccagccagcactggcacctgccctttgcc ctcctgagttcactcagccacagacttcttccatcccaacagccagatgaagacctggaa tcacattttcccctcaagacagaaagggctctgcggcaggttgtgcctgggaaggggctg cttctcatttgtggccacctctctgcccaggagctggtgaggaagggtgaactaggggat gcctttcagaacaaaggagagtttgccttctgccaggtggatgcctccatcgctctggcc ctggccctggccgtcctgtgtgatctgctccagcagtgggaccagttggcccctggactg cccatcctgctgggatggctgttgggagagagtgatgacctcgtggcctgtgtggagagc atgcatcagcaggccaagggttcaaaacgcattgtccctctttatgagaaatttggagac atggccccggcgctcaagccctgtttactgagcctgggcggggagggggcggagaaacga gcccgggctccaccggcaagactgccgcggcggccgcccgcgtggccacccccaccccca ccgcgactccacgtgcagtcgggctggagccgccaccgactggacgcaggccccgagccc ccgcctcctggccggggcaccctttgcaaacccgccgggccgcggggatggttgcgatat tctggcattttgcaattcccgcgcccagtacaaaaccgaaggtgggagcttaaagctcca caggtccgcctcggagaacagggcagggaaagacacgtccagggctgcagaatcccggcc acgctaaacgtaccggggctctccgaccgcgcagccccggaggagaacagccgtgccttc ccgccgccacccggcggcatccactggggccgagagctacacgccacaccggccgcccgg gccgccggccccgcccggaggcctccagcaccctcccccggaggaaaaaaattggcggcg gccaatgggaggccgggaaggcgcctgacgtccgcgagcgggcgggcggcgttgcctgga gaccccggcgggggccgagttctgtcccctcccccggcgcgcccgccccgccgcagccgc actcccgggctctatatagggcgcgcgctcggaggccgccgagttccagcagtccgcgag ctgccgtcggctccgcggggggggcgggccgggcaccccggggcgcggaggagcgctcct cgcttctctccttcccccctgccgcactccgccggaccctcccgccggcccgcgccgctg cactcgccctctcctctcgccccccggcaaactttcggcccctccccgcccctcgcccgt tattcgtcgtggctcaagcccggccacgccgccccaagggctcctcccgacctcccggcc tgccgctccggccactgcgggatccagaaacatgtcgaccacacttctgtccgccttcta cgatgtcgacttcttgtgcaaggtcgggaaagtcccggggtttgcaaaagagtgtccgag cgccctggaggcggggagggcggcaaggagggcgccggtgtcgcggttgagtttctccac tgccgaccgcggccacgctgcccggggcttcccggacaggcttcgcgccgcccacctcgg cagccggggcggaggatcacgtgtcgaaacccagcgcggcccacgacagagaaatccctg gccaacctcaacctgaacaacatgctggacaagaaggcggtggggacgcctgtggccgcc gcccccagctcgggcttcgcgccgggattcctccgacggcactcggccagcaacctgcat gcactcgcccaccccgcgcccagccccggcagctgctcgcccaagttcccgggcgccgct aacggcagcagctgcggcagcgcggcggccggcggtccgacctcctacggcacccttaag gagccgtcggggggcggcggcacagccctgctcaacaaggagaacaaattccgggaccgc tcgtttagcgagaacggcgatcgcagccagcacctcctgcacctgcagcagcagcagaag gggggcggcggctcccagatcaactccacgcgctacaagaccgagctgtgccggcccttc gaggagagcggcacgtgcaagtacggcgaaaagtgccagttcgcgcatggcttccacgag ctgcgcagcctgactcgccatccgaagtacaagaccgagctgtgccgcacctttcatacc atcggcttctgcccctatgggccgcgctgccacttcatccacaacgcggacgagcggcgg cccgcgccgtcggggggcgcctccggggacctgcgtgcctttggcacgcgcgatgcgttg cacctgggcttcccgcgggagccgcggcccaagttgcaccacagcctcagcttctcgggc ttcccgtcgggccaccatcagcccccgggcggcctcgagtcgccgctgctgctcgacagc cccacgtcgcgcacgccgccgccgccctcctgctcttcggcctcgtcctgctcctcctcc gcctcctcctgttcctcggcctccgcggcctccacgccctcgggcgccccgacatgctgc gcctccgcggcggccgcggctgcggccgctctgctgtacggcaccgggggcgccgaggac ctgctggcgccgggggccccgtgcgcggcctgctcgtcggcctcgtgcgccaacaacgcc ttcgccttcggtccggagctcagcagcctcatcacgccgctcgccatccagacccacaac tttgccgccgtggccgccgccgcctactaccgcagtcagcagcagcagcagcagcagggc ctggcgccccccgcgcagccgccggcgccgcccagcgcgaccctccccgccggggccgcc gcacctccctcgccgcccttcagcttccagctgccgcgccgcctgtccgactcgcccgtg ttcgacgcgccccccagccccccggactcgctgtcggaccgcgacagctacctaagcggc tccctgagctccggcagcctcagcggctctgagtctcccagcctcgaccctggccgccgc ctgccaatcttcagccgcctctccatctccgacgactga